+ All Categories
Home > Documents > Pentaho business analytics & data integration [email protected]

Pentaho business analytics & data integration [email protected]

Date post: 25-Feb-2016
Category:
Upload: fleta
View: 49 times
Download: 3 times
Share this document with a friend
Description:
Pentaho business analytics & data integration [email protected]. About US – Zaponet data science solutions. - PowerPoint PPT Presentation
42
Pentaho business analytics & data integration [email protected]
Transcript
Page 1: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho business analytics & data integration

[email protected]

Page 2: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

About US – Zaponet data science solutions

Zaponet is a service integrator and development shop providing solutions & professional services for building state of the art data-products which leverage big-data & data-science technologies.

Zaponet architect, design and builds big-data solutions: data warehouses, user-profile systems, recommendation engines, complex event processing and more

Some of our technology partners are: pentaho ,cloudera ,infobright , vertica, kognitio ,gigaspaces

• more details www.zaponet.com *future meetup: Pentaho Weka for data science

Page 3: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

About Me – Amjad Akkawi

Zaponet CTO

Experience in pentaho

Page 4: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Agenda

• Pentaho in business analytics & data integration

• Pentaho BI Demo• Pentaho PDI Demo

Page 5: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

About Pentaho

• Recognized leader in business analytics & data integration• Subscription-based business model• Achieved critical mass:

• Over 1,200 commercial customers• Over 10,000 production deployments• Over 185 countries

• Stewardship of most important open source analytics projectsINDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY

Page 6: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Why Customer Love Pentaho

Innovation & Scalability

Superior Customer

Service

Total Value

8 weeks time to market

2 weeks time to market

€350K+ cost saving75% lower acquisition costs

Music files from 20,000 sources

Operational reports at all 1000 retail stores

Less than 1 month ROI

Analyzing buying patterns of 5 million

membersAnalytics on 500,000

patients records

…“better functionality and more support”

…“top-notch professional support”

“Pentaho support is as good as its software”

…“a great partner through every phase of

our project”

…“ROI was almost immediate”.

Fully rolled out in budget in 4 months

Marketing dashboard in less than 1 day

Speed of Deployment

Page 7: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho in the Big Data Fabric

Big

Dat

a M

gmt

HadoopJava MapReduce, PigPentaho MapReduce

NoSQL Databases Analytic Databases

Data IntegrationJob Orchestration

Workflow

SchedulingHigh Performance

Visual IDE

Dat

a In

tegr

atio

n

Pentaho Business Analytics•R

•3rd Party BI Tools•Applications

3rd Party Tools

Big

Ana

lytic

s

Page 8: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

High Level Feature/Functions

Advanced Power Users

&ViewersData Mining

Information ConsumersDashboards

Knowledge Workers/

Business UsersAnalysis

Business UsersReporting

Power Users,Developers &

DBAsData

Advanced Predictive

Analysis

Self-service InteractiveKPI & Metrics and

Visualization

Self-service Interactive and Ad Hoc Analysis

Ad hoc and Operational

Reports

High Performance Data Integration, BIG DATA, Cleansing

and Presentation

Com

pone

nts a

re in

depe

nden

t

Page 9: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

High Level Feature/Functions

Advanced Power Users

&ViewersData Mining

Information ConsumersDashboards

Knowledge Workers/

Business UsersAnalysis

Business UsersReporting

Power Users,Developers &

DBAsData

Advanced Predictive

Analysis

Self-service InteractiveKPI & Metrics and

Visualization

Self-service Interactive and Ad Hoc Analysis

Ad hoc and Operational

Reports

High Performance Data Integration, BIG DATA, Cleansing

and Presentation

Page 10: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Dashboards

Page 11: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Dashboards & Interactive Dashboards

Page 12: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Dashboards – Geo Location-Based

Page 13: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

High Level Feature/Functions

Advanced Power Users

&ViewersData Mining

Information ConsumersDashboards

Knowledge Workers/

Business UsersAnalysis

Business UsersReporting

Power Users,Developers &

DBAsData

Advanced Predictive

Analysis

Self-service InteractiveKPI & Metrics and

Visualization

Self-service Interactive and Ad Hoc Analysis

Ad hoc and Operational

Reports

High Performance Data Integration, BIG DATA, Cleansing

and Presentation

Page 14: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Reports – Interactive, Static, Distributed

Page 15: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

15

Reports – Reporting Pack & House Styles

Page 16: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Reports – Reporting Pack & House Styles

Page 17: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

High Level Feature/Functions

Advanced Power Users

&ViewersData Mining

Information ConsumersDashboards

Knowledge Workers/

Business UsersAnalysis

Business UsersReporting

Power Users,Developers &

DBAsData

Advanced Predictive

Analysis

Self-service InteractiveKPI & Metrics and

Visualization

Self-service Interactive and Ad Hoc Analysis

Ad hoc and Operational

Reports

High Performance Data Integration, BIG DATA, Cleansing

and Presentation

Page 18: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

18

Enhanced In-Memory Analytics• Enhanced in-memory caching for speed of

thought visualization & analysis– More re-usability of in-memory data– Fewer trips to the database/disk

• Builds on existing unique extreme-scale in-memory analytics– Support for external data grids

• Infinispan / JBoss Enteprise Data Grid and Memcached

• Scale to caching hundreds of GBs (potentially TBs) of data in-memory

• Competition– Java heap or C++ memory space (a few GB at

most (most BI products)or

– Proprietary (hard to manage) in-memory technology (e.g. Qlikview, Microstrategy)

Page 19: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Analyzer – Table format

Page 20: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Analyzer – Chart format

Page 21: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Analyzer: Geo Location-Based Analysis

Page 22: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

High Level Feature/Functions

Advanced Power Users

&ViewersData Mining

Information ConsumersDashboards

Knowledge Workers/

Business UsersAnalysis

Business UsersReporting

Power Users,Developers &

DBAsData

Advanced Predictive

Analysis

Self-service InteractiveKPI & Metrics and

Visualization

Self-service Interactive and Ad Hoc Analysis

Ad hoc and Operational

Reports

High Performance Data Integration, BIG DATA, Cleansing

and Presentation

Page 23: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Scenario 1

OperationalDatabase Dashboard

Report

Page 24: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Scenario 2

Data Mart(s) / Warehouse

Metadata

Dashboard

Report

Analyzer

Page 25: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Metadata – Schema WorkbenchComplex calculations and multi-cube requirements may need more modeling

Page 26: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Scenario 3

Unstructured Data100

Data Mart(s) / Warehouse

Structured Data

BIG DATA Technology

and/orStaging Area &

Data Vault

Pentaho Data Integration

Source data acquisition

Initial consolidation as required

Pentaho Data Integration

Cleansing

Transformation

Change Data Capture

Data Warehouse Management

PDI PDI Metadata

Dashboard

Report

Analyzer

Page 27: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Variations on a Theme

Unstructured Data

Ad-hoc Data

Data Mart(s) / Warehouse

Structured Data

AlertingSMS, eMail & attachments

Pentaho Data Integration

Source data acquisition

Initial consolidation as required

Pentaho Data Integration

Cleansing

Transformation

Change Data Capture

Data Warehouse Management

PDI PDI Metadata

Dashboard

Report

Analyzer

BIG DATA Technology

and/orStaging Area &

Data Vault

Page 28: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

PDI Components• Enterprise Edition Data Integration Server

– Execution and remote monitoring– Integrated scheduling– Enterprise Security options– Enhanced content management including revision history and locking– Remote distributed cluster based processing

Page 29: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Kettle Conceptual Model

Page 30: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho Data Integration

Step based processing engine with instant visualization of results

Page 31: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho Data Integration

Step based performance

Page 32: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

32

Pentaho Data Integration

Integrated Metadata Creation

Page 33: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho and Big DataForrester Wave, Enterprise Hadoop Solutions, Q1 2012

Only vendor in strong performer category: “an impressive Hadoop integration tool”

Only business analytics vendor

Richest functionality Most extensive integration

with open source Apache Hadoop and major Hadoop distributions

Page 34: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Expanded Insight into Big and Diverse Data• Improved support for Hadoop

– Simpler deployment across Hadoop clusters• Support for the Hadoop cache• Debian RPM installer

– Performance and ease of use enhancements for Pentaho MapReduce visual development

– Support for Hadoop Security data access

• New NoSQL database support– Cassandra– MongoDB

• Growing the Pentaho big data community– Open sourced all big data components (Hadoop & NoSQL)

• Apache License – same as used by leading Hadoop and NoSQL distros

– New big data developer resources: How to documents, videos, walk-throughs

Page 35: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Hadoop Data Management & Integration

Accessible by any ETL developer or data scientist

Pentaho MapReduce

Page 36: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

NoSQL Data Management & Integration

Accessible by any ETL developer or data scientist

Visual Job OrchestrationAny Data Source

Page 37: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Visual Job Orchestration Any Data Source

Scheduling

Accessible to any ETL developer

or data scientist

Page 38: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Pentaho Integration Options

PentahoBI Server

OtherApplication

Pentaho

CustomStuff

My Application

PentahoComponents

Page 39: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

IntegrationBundled Mashup Extended Embedded

Value Fastest Way to Get Analytics that

Have Your Look & Feel

An Integrated Experience for Yours

End User

Customizing Pentaho for Your

Experience

Ultimate Integration and Customization

What it Takes?

• Pentaho is a separate app, branded with Partner’s logo, look & feel

• Optional: Partner app may include links to Pentaho reports, analysis and dashboards (popping new window)

• Optional: Single sign-on creates a seamless experience

• Pentaho & Partner app have the same UI

• Pentaho User Console, or individual reports, analysis or dashboards are included in partner app

• Single sign-on creates a seamless experience

• Pentaho’s core functionality is extended through plug-ins. Examples:- Connecting to custom data sources- Adding new visualizations- Customizing security- Replacing Pentaho rules engine

• Integrate with Partner’s App Server

• Directly embedding Pentaho into your app

• Calling Pentaho Java APIs from your App

Skill Level • Limited HTML skills • HTML skills • HTML skills• Java skills

• HTML skills• Java skills• Knowledge of Pentaho architecture

Page 40: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Q & A

NEXT …Pentaho PDI DemoPentaho BI Demo

Page 41: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

“Traditional” Database Support

DATA INTEGRATIONDATA ANALYSIS

Author
Page 42: Pentaho  business analytics & data  integration Amjad.akkawi@zaponet.com

Broadest Support for Big Data Platforms

Hadoop NoSQL Analytic Databases


Recommended