Date post: | 15-Jan-2015 |
Category: |
Technology |
Upload: | mark-kromer |
View: | 276 times |
Download: | 0 times |
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75551
Pentaho Analytics for
MongoDB
Mark KromerPentaho Big Data Analytics Product Manager
@kromerbigdata
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75552
Modern, unified data integration and business analytics platform• Broadest and deepest big data integration
• Embeddable, cloud-ready analytics
• Big data blending at the source
Fast and Broad Innovation• Open source development model
• 100% java, pluggable and extensible
Critical mass achieved• Over 1,200 commercial customers
• Over 10,000 production deployments
Pentaho MissionEnabling the future of analytics
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75553
Blending brings the two worlds togetherEvolving big data architectures
PDI
Existing ETL Tool or PDI
EDW Data Marts
Analytics
ExistingETL Toolor PDI
Customer
Provisioning
Billing
BI Tools
Location
Web
Social Media
Network
ExistingProcessor PDI
HadoopCluster
PDI
Analytic DB
On-Demand Integration & Blending
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75554
Pentaho 5.1
Powering Big Data Analytics @ Scale
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75555
• Unleash operational analytics on MongoDB for IT and Business Analysts• Unlock value of data in MongoDB for analysts with no
coding required
• Offload data preparation for data scientists• Focus on analytics, better understand customer
behavior
• Reduce complexity for big data developers• Leverage existing skilled resources and reduce
complexity• Improve efficiency and performance for analytics
Powering Big Data Analytics @ ScaleMeeting the demands of the big data-driven enterprise
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75556
ORCHESTRATE
ERP DW
Processing
CRM
Raw Data
Parsed Data
Analytic Datasets
Pentaho Analytics for MongoDB
Master Data
Analysis & Reporting
ANALYZE
Unstructured Data
Structured Data
INGEST
Ingestion
AGG FRAMEWORK
Data Integration Analytics
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75557
❯ Simple, easy-to-use visual data exploration
❯ Web-based thin client; in-memory caching
❯ Rich library of interactive visualizations
• Geo-mapping, heat grids, scatter plots, bubble charts, line over bar and more
• Pluggable visualizations
❯ Java ROLAP engine to analyze structured and unstructured data, with SQL dialects for querying data from RDBMs
❯ Pluggable cache integrating with leading caching architectures: Infinispan (JBoss Data Grid) & Memcached
Pentaho Interactive Analysis & Data Discovery
Highly Flexible Advanced Visualizations
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75558
“The Pentaho platform is meeting unmet market needs, allowing users to directly analyze data in MongoDB. We have seen more accurate results with new analyses and are no longer constrained by having to only pull part of our data”
Business User (COO)
Reporting on Operations and
Overhead
End UsersDashboards and
Reports on Customer Policy
Data
PDI
Data Marts
Data ScientistData Mining and Data Governance
Web Services
Customer Portal Log Files
Cross Department Operations Data
PDI
Transaction and Policy Data
RDBMS
PDIJSON transformation
Analyzer tuned for MongoDB
PDI
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-75559
Data IntegrationETL, Scheduling, Events, Orchestration
• 100% Java engine• Meta-data driven architecture – graphical ETL Designer• Scale-out architecture, deployable to
• Desktop• PDI clusters• Hadoop clusters
• Plugin architecture for extensibility• Batch, low-latency and real time processing• Rapid onboarding of Analytics• Embeddable
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755510
Concept – Data Transformations
INPUT(S) – PROCESS(ES) – OUTPUT(S)
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755511
Concept – Jobs (orchestrate)
START – CHECK – WATCH – EXECUTE – NOTIFY - FINISH
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755512
mongoDBclusterPDI ETL
Analytics
Broad ConnectivityBroad connectivity combined with powerful data integration
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755513
• Ability to blend traditional data sources with Big Data
• Rapid time to value through drag/drop visual development for Big Data integration
• Adaptive Big Data layer guards system from changing Big Data versions – reduces risk
• Comprehensive analytics: visualizations, reports, dashboards, ad hoc analysis
Why
Customer 360 – NoSQL ArchitectureA Blended View to Drive Revenue Growth and Service Improvements
Reference Architecture Notes
• Financial services company: Ingest data from source systems into single Big Data store, then process & summarize data at customer unique ID level
• Information is available in call center application for service, accessible by research analysts, and leveraged in predictive applications as well
NoSQL
CRM System
Documents & Images
Admin. Info
Claims
Online Interactions
Call Center View
Research Analysts
Predictive Analytics
PDI PDI
Analyzer
Reports
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755514
Flexible Schema for Big Data Variety
Every document in a single collection could have different customer data
{name: “jeff”, eyes: “blue”, loc: [40.7, 73.4], boss: “ben”}
{name: “brendan”, aliases: [“el diablo”]}
{name: “ben”, hat: ”yes”}
{name: “matt”, pizza: “DiGiorno”, height: 72, loc: [44.6, 71.3]}
{name: “will”, eyes: “blue”, birthplace: “NY”, aliases: [“bill”, “la ciacco”], loc: [32.7, 63.4], boss: ”ben”}
50M Customers = 50M Documents = 1TB
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755515
• Reduces development effort • Data is more useful than
independent representations• Documents make it easy to
integrate data from multiple schemas into a shared representation
Documents Accelerate Time to Market
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755516
Scale Like an AccordionAutomatic horizontal scaling based on customer ID
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755517
New Book – Pentaho Analytics for MongoDB
© 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-755518
Thank You