+ All Categories
Home > Technology > Pentaho Analytics on MongoDB

Pentaho Analytics on MongoDB

Date post: 15-Jan-2015
Category:
Upload: mark-kromer
View: 276 times
Download: 0 times
Share this document with a friend
Description:
A quick, small run-through of Pentaho Analytics 5.1 on MongoDB, providing native support for ETL, Reporting and Analytics on your MongoDB Collections.
Popular Tags:
18
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 1 Pentaho Analytics for MongoDB Mark Kromer Pentaho Big Data Analytics Product Manager @kromerbigdata
Transcript
Page 1: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75551

Pentaho Analytics for

MongoDB

Mark KromerPentaho Big Data Analytics Product Manager

@kromerbigdata

Page 2: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75552

Modern, unified data integration and business analytics platform• Broadest and deepest big data integration

• Embeddable, cloud-ready analytics

• Big data blending at the source

Fast and Broad Innovation• Open source development model

• 100% java, pluggable and extensible

Critical mass achieved• Over 1,200 commercial customers

• Over 10,000 production deployments

Pentaho MissionEnabling the future of analytics

Page 3: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75553

Blending brings the two worlds togetherEvolving big data architectures

PDI

Existing ETL Tool or PDI

EDW Data Marts

Analytics

ExistingETL Toolor PDI

Customer

Provisioning

Billing

BI Tools

Location

Web

Social Media

Network

ExistingProcessor PDI

HadoopCluster

PDI

Analytic DB

On-Demand Integration & Blending

Page 4: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75554

Pentaho 5.1

Powering Big Data Analytics @ Scale

Page 5: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75555

• Unleash operational analytics on MongoDB for IT and Business Analysts• Unlock value of data in MongoDB for analysts with no

coding required

• Offload data preparation for data scientists• Focus on analytics, better understand customer

behavior

• Reduce complexity for big data developers• Leverage existing skilled resources and reduce

complexity• Improve efficiency and performance for analytics

Powering Big Data Analytics @ ScaleMeeting the demands of the big data-driven enterprise

Page 6: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75556

ORCHESTRATE

ERP DW

Processing

CRM

Raw Data

Parsed Data

Analytic Datasets

Pentaho Analytics for MongoDB

Master Data

Analysis & Reporting

ANALYZE

Unstructured Data

Structured Data

INGEST

Ingestion

AGG FRAMEWORK

Data Integration Analytics

Page 7: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75557

❯ Simple, easy-to-use visual data exploration

❯ Web-based thin client; in-memory caching

❯ Rich library of interactive visualizations

• Geo-mapping, heat grids, scatter plots, bubble charts, line over bar and more

• Pluggable visualizations

❯ Java ROLAP engine to analyze structured and unstructured data, with SQL dialects for querying data from RDBMs

❯ Pluggable cache integrating with leading caching architectures: Infinispan (JBoss Data Grid) & Memcached

Pentaho Interactive Analysis & Data Discovery

Highly Flexible Advanced Visualizations

Page 8: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75558

“The Pentaho platform is meeting unmet market needs, allowing users to directly analyze data in MongoDB. We have seen more accurate results with new analyses and are no longer constrained by having to only pull part of our data”

Business User (COO)

Reporting on Operations and

Overhead

End UsersDashboards and

Reports on Customer Policy

Data

PDI

Data Marts

Data ScientistData Mining and Data Governance

Web Services

Customer Portal Log Files

Cross Department Operations Data

PDI

Transaction and Policy Data

RDBMS

PDIJSON transformation

Analyzer tuned for MongoDB

PDI

Page 9: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75559

Data IntegrationETL, Scheduling, Events, Orchestration

• 100% Java engine• Meta-data driven architecture – graphical ETL Designer• Scale-out architecture, deployable to

• Desktop• PDI clusters• Hadoop clusters

• Plugin architecture for extensibility• Batch, low-latency and real time processing• Rapid onboarding of Analytics• Embeddable

Page 10: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755510

Concept – Data Transformations

INPUT(S) – PROCESS(ES) – OUTPUT(S)

Page 11: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755511

Concept – Jobs (orchestrate)

START – CHECK – WATCH – EXECUTE – NOTIFY - FINISH

Page 12: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755512

mongoDBclusterPDI ETL

Analytics

Broad ConnectivityBroad connectivity combined with powerful data integration

Page 13: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755513

• Ability to blend traditional data sources with Big Data

• Rapid time to value through drag/drop visual development for Big Data integration

• Adaptive Big Data layer guards system from changing Big Data versions – reduces risk

• Comprehensive analytics: visualizations, reports, dashboards, ad hoc analysis

Why

Customer 360 – NoSQL ArchitectureA Blended View to Drive Revenue Growth and Service Improvements

Reference Architecture Notes

• Financial services company: Ingest data from source systems into single Big Data store, then process & summarize data at customer unique ID level

• Information is available in call center application for service, accessible by research analysts, and leveraged in predictive applications as well

NoSQL

CRM System

Documents & Images

Admin. Info

Claims

Online Interactions

Call Center View

Research Analysts

Predictive Analytics

PDI PDI

Analyzer

Reports

Page 14: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755514

Flexible Schema for Big Data Variety

Every document in a single collection could have different customer data

{name: “jeff”, eyes: “blue”, loc: [40.7, 73.4], boss: “ben”}

{name: “brendan”, aliases: [“el diablo”]}

{name: “ben”, hat: ”yes”}

{name: “matt”, pizza: “DiGiorno”, height: 72, loc: [44.6, 71.3]}

{name: “will”, eyes: “blue”, birthplace: “NY”, aliases: [“bill”, “la ciacco”], loc: [32.7, 63.4], boss: ”ben”}

50M Customers = 50M Documents = 1TB

Page 15: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755515

• Reduces development effort • Data is more useful than

independent representations• Documents make it easy to

integrate data from multiple schemas into a shared representation

Documents Accelerate Time to Market

Page 16: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755516

Scale Like an AccordionAutomatic horizontal scaling based on customer ID

Page 17: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755517

New Book – Pentaho Analytics for MongoDB

Page 18: Pentaho Analytics on MongoDB

© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755518

Thank You


Recommended