+ All Categories
Home > Technology > Hadoop Reporting and Analysis - Jaspersoft

Hadoop Reporting and Analysis - Jaspersoft

Date post: 10-May-2015
Category:
Upload: hortonworks
View: 4,455 times
Download: 3 times
Share this document with a friend
Description:
Hadoop is deployed for a variety of uses, including web analytics, fraud detection, security monitoring, healthcare, environmental analysis, social media monitoring, and other purposes.
Popular Tags:
31
Hadoop Reporting & Analysis What Architecture is Best for Me?
Transcript
Page 1: Hadoop Reporting and Analysis - Jaspersoft

Hadoop Reporting & AnalysisWhat Architecture is Best for Me?

Page 2: Hadoop Reporting and Analysis - Jaspersoft

©2013 Jaspersoft Corporation. 2

Jim WalkerDirector Product Marketing, Hortonworks

Twenty years experience building products and bringing them to market. His expertise includes data loss prevention, master data management and now big data.

Ben ConnorsWorldwide Head of Alliances, Jaspersoft

Prior to Jaspersoft, Ben was at HP, Oracle, Viador, and other BI companies. He has over 20 years of experience in databases and business intelligence.

Matt DahlmanTechnical Director of Alliances, Jaspersoft

Prior to Jaspersoft, Matt was with Oracle, Netonomy, and Sybase. He brings over 15 years of database and business intelligence experience to his role.

Presenters

Page 3: Hadoop Reporting and Analysis - Jaspersoft

Agenda

Hadoop in the Modern Data architecture Hadoop Usage Patterns Jaspersoft

Company BI Suite

Jaspersoft/Hortonworks Integration Demo The Future of Interactive Hadoop Q&A

©2013 Jaspersoft Corporation. Proprietary and Confidential 3

Page 4: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

A Brief History of Apache Hadoop

Page 4

2013

Focus on INNOVATION2005: Yahoo! creates

team under E14 to work on Hadoop

Focus on OPERATIONS2008: Yahoo team extends focus to

operations to support multiple projects & growing clusters

Yahoo! begins to Operate at scale

EnterpriseHadoop

Apache Project Established

HortonworksData Platform

2004 2008 2010 20122006

STABILITY2011: Hortonworks created to focus on “Enterprise Hadoop“. Starts with

24 key Hadoop engineers from Yahoo

Page 5: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Existing Data Architecture

Page 5

APPL

ICAT

ION

SDA

TA S

YSTE

MS

TRADITIONAL REPOSRDBMS EDW MPP

DATA

SO

URC

ES

OLTP, POS SYSTEMS

OPERATIONALTOOLS

MANAGE & MONITOR

Traditional Sources (RDBMS, OLTP, OLAP)

DEV & DATATOOLS

BUILD & TEST

Business Analytics

Custom Applications

Enterprise Applications

Page 6: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

An Emerging Data Architecture

Page 6

APPL

ICAT

ION

SDA

TA S

YSTE

MS

TRADITIONAL REPOSRDBMS EDW MPP

DATA

SO

URC

ES

MOBILEDATA

OLTP, POS SYSTEMS

OPERATIONALTOOLS

MANAGE & MONITOR

Traditional Sources (RDBMS, OLTP, OLAP)

New Sources (web logs, email, sensor data, social media)

DEV & DATATOOLS

BUILD & TEST

Business Analytics

Custom Applications

Enterprise Applications

HORTONWORKS DATA PLATFORM

Page 7: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Interoperating With Your Tools

Page 7

APPL

ICAT

ION

SDA

TA S

YSTE

MS

TRADITIONAL REPOS

apps

HORTONWORKS DATA PLATFORM

DATA

SO

URC

ES

MOBILEDATA

OLTP, POS SYSTEMS

Traditional Sources (RDBMS, OLTP, OLAP)

New Sources (web logs, email, sensor data, social media)

OPERATIONALTOOLS

MANAGE & MONITOR

DEV & DATATOOLS

BUILD & TEST

Page 8: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

OS Cloud VM Appliance

HDP: Enterprise Hadoop Distribution

Page 8

PLATFORM SERVICES

HADOOP CORE

DATASERVICES

OPERATIONAL SERVICES

Manage & Operate at

Scale

Store, Process and Access Data

Enterprise Readiness: HA, DR, Snapshots, Security, …

HORTONWORKS DATA PLATFORM (HDP)

Distributed Storage & Processing

Hortonworks Data Platform (HDP)Enterprise Hadoop

• The ONLY 100% open source and complete distribution

• Enterprise grade, proven and tested at scale

• Ecosystem endorsed to ensure interoperability

HDFS YARN (in 2.0)

WEBHDFS MAP REDUCE

HCATALOG

HIVEPIGHBASE

SQOOP

FLUME

OOZIE

AMBARI

Page 9: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Operational Data Refinery

Page 9

DATA

SYS

TEM

SDA

TA S

OU

RCES

1

31 Capture

Capture all data

ProcessParse, cleanse, apply structure & transform

ExchangePush to existing data warehouse for use with existing analytic tools

2

3

Refine Explore Enrich

2

APPL

ICAT

ION

S

Collect data and apply a known algorithm to it in trusted operational process

TRADITIONAL REPOSRDBMS EDW MPP

HORTONWORKS DATA PLATFORM

Business Analytics

Custom Applications

Enterprise Applications

Traditional Sources (RDBMS, OLTP, OLAP)

New Sources (web logs, email, sensor data, social media)

Page 10: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Application Enrichment

Page 10

DATA

SYS

TEM

SDA

TA S

OU

RCES

Refine Explore Enrich

APPL

ICAT

ION

S

1 CaptureCapture all data

ProcessParse, cleanse, apply structure & transform

ExchangeIncorporate data directly into applications

2

3

Collect data, analyze and present salient results for online apps

3

1

2TRADITIONAL REPOS

RDBMS EDW MPP

Traditional Sources (RDBMS, OLTP, OLAP)

New Sources (web logs, email, sensor data, social media)

Custom Applications

Enterprise Applications

NOSQL

HORTONWORKS DATA PLATFORM

Page 11: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Big Data Exploration & Visualization

Page 11

DATA

SYS

TEM

SDA

TA S

OU

RCES

Refine Explore Enrich

APPL

ICAT

ION

S

1 CaptureCapture all data

ProcessParse, cleanse, apply structure & transform

ExchangeExplore and visualize with analytics tools supporting Hadoop

2

3

Collect data and perform iterative investigation for value

3

2TRADITIONAL REPOS

RDBMS EDW MPP

1

HORTONWORKS DATA PLATFORM

Business Analytics

Traditional Sources (RDBMS, OLTP, OLAP)

New Sources (web logs, email, sensor data, social media)

Page 12: Hadoop Reporting and Analysis - Jaspersoft

The Intelligence Inside

Page 13: Hadoop Reporting and Analysis - Jaspersoft

Competing on Time and Information

©2013 Jaspersoft Corporation. Proprietary and Confidential 13

“The New Factors of Production: Time and Information”Brian Gentile, Jaspersoft

But business users don’t have access to

timely, actionable data

Why?

Most don’t spend their day inside a BI tool …nor do they want to!

Page 14: Hadoop Reporting and Analysis - Jaspersoft

We Need “Intelligence Inside”

©2013 Jaspersoft Corporation. Proprietary and Confidential 14

We want information to FIND US, not the other way round

“We need Intelligence Inside the applications and business processes we use every day.”

Pipeline dashboard inside SaaS CRM app Performance report inside partner portal Salary data visualizations inside HR intranet Portfolio analytics inside client website Tickets crosstab inside custom helpdesk app Interactive charts inside native mobile app

Page 15: Hadoop Reporting and Analysis - Jaspersoft

Jaspersoft: The Intelligence Inside

©2013 Jaspersoft Corporation. Proprietary and Confidential 15

Self-Service BI + Embeddable + Affordable

“We empower millions of people every day to make decisions faster by delivering timely, actionable data to them inside their apps and business process through an embeddable, cost-effective reporting and analytics platform.”

Page 16: Hadoop Reporting and Analysis - Jaspersoft

Intelligence Inside

Example Customers

Commercial Apps

Customer Portals

Cloud Apps

Internal Apps

Big Data Analytics

The Intelligence Inside Business

©2013 Jaspersoft Corporation. Proprietary and Confidential 16

Page 17: Hadoop Reporting and Analysis - Jaspersoft

The Intelligence Inside the New IT Stack

Inaugural BI service: On VMware Cloud Foundry On Red Hat OpenShift Jaspersoft Certified Amazon Redshift and RDS To connect directly (no ETL) to non-SQL like MongoDB and HBase

©2013 Jaspersoft Corporation. Proprietary and Confidential 17

“Our mission is to become the de facto reporting and analytic service in the New IT Stack, enabling BI Builders to build the Intelligence Inside internal and commercial apps on the leading Cloud platforms, powered by the new Big Data stores.”

Page 18: Hadoop Reporting and Analysis - Jaspersoft

Broad Recognition, Strong Partnerships

50%+ ACV Growth Every Year

Magic Quadrants

18©2013 Jaspersoft Corporation. Proprietary and Confidential

World’s Most Widely Deployed BI

• Commercial Open Source BI Suite• Nearly 200 people in US, EMEA, APAC• 16,000,000 downloads• 325,000 community members• 130,000 embedded applications• 15,000 paying customers• 1,800 subscription customers

Jaspersoft: High Growth and Momentum

Page 19: Hadoop Reporting and Analysis - Jaspersoft

Product Overview

Page 20: Hadoop Reporting and Analysis - Jaspersoft

Design Any Report . . .

©2013 Jaspersoft Corporation. Proprietary and Confidential 20

Page 21: Hadoop Reporting and Analysis - Jaspersoft

… Dashboard

21©2013 Jaspersoft Corporation. Proprietary and Confidential

Page 22: Hadoop Reporting and Analysis - Jaspersoft

… or Analytic View

22©2013 Jaspersoft Corporation. Proprietary and Confidential

Page 23: Hadoop Reporting and Analysis - Jaspersoft

POJO files

… using Any Data Type

Relational FilesRelational Big Data Files

©2013 Jaspersoft Corporation. Proprietary and Confidential 23

Redshift

BigQuery

Page 24: Hadoop Reporting and Analysis - Jaspersoft

©2013 Jaspersoft Corporation. Proprietary and Confidential 24

… bringing Intelligence to Any App

Page 25: Hadoop Reporting and Analysis - Jaspersoft

… with a World-Class BI Platform

©2013 Jaspersoft Corporation. Proprietary and Confidential 25

Reporting, Dashboards, Visualization, OLAP Analysis

Columnar-Based In-Memory Engine

Data Connectivity to Any Data100%

Web

Sta

ndar

ds:

CS

S,

.JS

, .J

SP,

Jav

a

Ext

ensi

ve A

PIs

: H

TT

P, S

OA

P, R

ES

T

HTML5 Browser, Native Mobile Apps

Business Metadata Layer

Data Integration

Data Virtualization Direct

Hadoop Other DataRDBMS

Page 26: Hadoop Reporting and Analysis - Jaspersoft

Approach Data Exploration Operational Reporting Analytics

Use Case For data analysts and data scientists who want to discover real-time patterns as they emerge from their Big Data content

For executives and operational managers who want summarized, pre-built daily reports on Big Data content

For data analysts and operational managers who want to analyze historical trends based upon pre-defined questions in their Big Data content

Latency Low Medium High

Big Data HBase, NoSQL, Analytic DBMS Hive, NoSQL, Analytic DBMS Hadoop, NoSQL, Analytic DBMS

Connectivity Native Native, SQL ETL

Architecture

Three Approaches to Big Data Analysis

BI Platform

In-Memory Engine

Native

BI Platform

Native SQL

BI Platform

OLAP Engine

Data Mart

ETL

Multi-Dimensional Analysis

Reports & Dashboards

Multi-Dimensional Analysis

©2013 Jaspersoft Corporation. Proprietary and Confidential

Page 27: Hadoop Reporting and Analysis - Jaspersoft

Jaspersoft’s Hadoop Difference

Advanced Hadoop integration Only BI provider than can support 3 approaches to Hadoop analytics Live Exploration, Batch Analysis, Batch reporting Direct, native connectors to Hive and HBase

Broad partnerships

Deep knowledge and ecosystem

27©2013 Jaspersoft Corporation. Proprietary and Confidential

Page 28: Hadoop Reporting and Analysis - Jaspersoft

Jaspersoft 5 Demo

28

“We've taken the desktop power of data visualization tools, built it scale on the HTML5 web, and made it embeddable within any app, device or portal”

©2013 Jaspersoft Corporation. Proprietary and Confidential

Page 29: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Hortonworks Snapshot

Page 29

• We distribute the only 100% Open Source Enterprise Hadoop Distribution: Hortonworks Data Platform

• We engineer, test & certify HDP for enterprise usage

• We employ the core architects, builders and operators of Apache Hadoop

• We drive innovation within Apache Software Foundation projects

• We are uniquely positioned to deliver the highest quality of Hadoop support

• We enable the ecosystem to work better with Hadoop

Develop Distribute Support

We develop, distribute and support the ONLY 100% open source Enterprise Hadoop distribution

Endorsed by Strategic Partners

Headquarters: Palo Alto, CAEmployees: 180+ and growingInvestors: Benchmark, Index, Yahoo

Page 30: Hadoop Reporting and Analysis - Jaspersoft

© Hortonworks Inc. 2013

Hortonworks Approach

Identify and introduce enterprise requirements into the pubic domain

Work with the community to advance and incubate open source projects

Apply Enterprise Rigor to provide the most stable and reliable distribution

Community Driven Enterprise Apache Hadoop

Page 31: Hadoop Reporting and Analysis - Jaspersoft

The Intelligence Inside

Thank You

[email protected]


Recommended