+ All Categories
Home > Technology > Fire in the Hole: How a Spark-Powered Platform Charges Analytics

Fire in the Hole: How a Spark-Powered Platform Charges Analytics

Date post: 16-Jul-2015
Category:
Upload: inside-analysis
View: 151 times
Download: 0 times
Share this document with a friend
Popular Tags:
43
Grab some coffee and enjoy the pre-show banter before the top of the hour!
Transcript

Grab some coffee and enjoy the pre-show banter

before the top of the

hour!

The Briefing Room

Fire in the Hole: How a Spark-Powered Platform Charges Analytics

Twitter Tag: #briefr

The Briefing Room

Welcome

Host: Eric Kavanagh

[email protected] @eric_kavanagh

Twitter Tag: #briefr

The Briefing Room

!   Reveal the essential characteristics of enterprise software, good and bad

!   Provide a forum for detailed analysis of today’s innovative technologies

!  Give vendors a chance to explain their product to savvy analysts

!   Allow audience members to pose serious questions... and get answers!

Mission

Twitter Tag: #briefr

The Briefing Room

Topics

2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room

This Month: ANALYTIC PLATFORMS

November: DISCOVERY & VISUALIZATION

December: INNOVATORS

Twitter Tag: #briefr

The Briefing Room

Executive Summary

Ø The earth is SHAKING

Ø Remain FLEXIBLE Ø EXPECT the future Ø Prepare for CHANGE

Twitter Tag: #briefr

The Briefing Room

Analyst: Robin Bloor

Robin Bloor is Chief Analyst at The Bloor Group

[email protected] @robinbloor

Twitter Tag: #briefr

The Briefing Room

Platfora

! Platfora is a big data analytics company

!   Its Big Data Analytics platform runs natively on the open source Apache Hadoop framework, and it delivers analytics over petabyte-scale data

!   The recently released Platfora 4.0, built on Apache Spark, includes Geo Analytics capabilities and added Advanced Visualizations

Twitter Tag: #briefr

The Briefing Room

Guest: Denise Hemke

As a Director of Product Management, Denise Hemke focuses on building BI so business analysts have self-service access to petabyte scale data. Denise has been building enterprise products for the last 13 years in a variety of different industries. She is passionate about partnering with customers, engineering and design. She enjoys building enterprise products that solve real customer use cases with a consumer-quality aesthetic. Early in her career, she focused on the development of BI and management applications for AT&T data centers and their customers. At Salesforce, Denise managed teams

responsible for building monitoring & management, debugging, and productivity tools for use by R&D, Operations and customers. Denise also served as the Director of Engineering at Platfora. In that role, she was responsible for innovation and delivery of the customer surface area, which includes rendering large-scale visualizations.

@denisehemke @platfora

Disrupting the traditional analyst workflow with Platfora and Spark. Denise Hemke, Director of Products October 28, 2014

10

@denisehemke @platfora

Introducing Platfora

11

LEAD THE INDUSTRY TRANSITION FROM BUSINESS INTELLIGENCE TO BIG DATA ANALYTICS.

#1 Big Data Analytics platform native on Hadoop

MISSION

End-to-end platform built for Multi-Structured Data

Self-service, iterative, interactive, and fast

@denisehemke @platfora

Data Preparation

Analysis 10%

100%

Analyst

TODAY: “MULTI-STRUCTURED DATA ANALYSIS”

IT

2005 - TODAY “DATA

DISCOVERY”

Analyst

Platfora is the Only End-to-End Big Data Analytics Platform

@denisehemke @platfora 13

“Apache Spark is Hadoop's speedy Swiss Army knife.”

“Spark is making waves because it’s putting MapReduce on the endangered species list.”

“Apache lights a fire under Hadoop with Spark.”

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

14

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

15

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

16

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

17

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

18

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Spark: The New Foundation of Big Data Analytics

19

“Out of the box” advanced analytics

Beyond SQL only

Easier to access

Vendor neutral and open source

In-memory speed

Spark is winning

And, why it matters for your business

@denisehemke @platfora

Let’s Avoid the Pitfalls of the Past

20

DATA ADMIN He’s

overwhelmed by the number

of data preparation requests.

DATA SCIENTIST

She’s focused on mundane work instead of high value projects.

C-SUITE

He’s falling behind the

competition.

BUSINESS ANALYST

He can’t answer his questions fast

enough.

The big data analytics workflow is broken and needs to be fixed

@denisehemke @platfora

Let’s Avoid the Pitfalls of the Past

21

DATA ADMIN He’s

overwhelmed by the number

of data preparation requests.

DATA SCIENTIST

She’s focused on mundane work instead of high value projects.

C-SUITE

He’s falling behind the

competition.

BUSINESS ANALYST

He can’t answer his questions fast

enough.

 

The big data analytics workflow is broken and needs to be fixed

@denisehemke @platfora

Let’s Avoid the Pitfalls of the Past

22

DATA ADMIN He’s

overwhelmed by the number

of data preparation requests.

DATA SCIENTIST

She’s focused on mundane work instead of high value projects.

C-SUITE

He’s falling behind the

competition.

BUSINESS ANALYST

He can’t answer his questions fast

enough.

 

The big data analytics workflow is broken and needs to be fixed

@denisehemke @platfora

Let’s Avoid the Pitfalls of the Past

23

DATA ADMIN He’s

overwhelmed by the number

of data preparation requests.

DATA SCIENTIST

She’s focused on mundane work instead of high value projects.

C-SUITE

He’s falling behind the

competition.

BUSINESS ANALYST

He can’t answer his questions fast

enough.

 

The big data analytics workflow is broken and needs to be fixed

@denisehemke @platfora

Let’s Avoid the Pitfalls of the Past

24

DATA ADMIN He’s

overwhelmed by the number

of data preparation requests.

DATA SCIENTIST

She’s focused on mundane work instead of high value projects.

C-SUITE

He’s falling behind the

competition.

BUSINESS ANALYST

He can’t answer his questions fast

enough.

 

The big data analytics workflow is broken and needs to be fixed

@denisehemke @platfora

Platfora is Laying the Foundation for the Future of Big Data Analytics

25

Built on Spark The definitive

end-to-end Business Analyst workflow

built on Spark

Next-gen Data Preparation Fast, smart, and powerful

data preparation seamlessly integrated into

the full-stack

Platfora Platform Extensions

Adapt the platform to your data and questions while amplifying the work

of developers

@denisehemke @platfora

The Definitive End-to-End Business Analyst Workflow

26

You can use data science to get better answers • Access to advanced analytics processing models

You don’t have to write the code yourself • Integrated into the full-stack platform

You’re not stuck in history • Always running on the latest technology

Built on Spark

@denisehemke @platfora

Data Preparation Integrated into an End-to-End Platform

27

You get revolutionary time to value • Natively integrated full-stack solution

So simple your business users can do it • Making data prep visual, safe, and intelligent

Powerful for your enterprise • Built to handle enterprise scale big data projects

Next-gen data preparation

@denisehemke @platfora

Configurable Platform Extensions That Meet the Needs of any Business

28

Platfora Platform Extensions (PPE)

You can stop trying to find a BI system that can answer your questions out of the box • Adapt the platform to your data and questions

You don’t have to repeat the process, every time your questions change • Reusable and configurable

You have all the power in the world • Utilize all of Spark, including access any data source

@denisehemke @platfora 29

DEMO

@denisehemke @platfora 30

Questions?

Denise Hemke Director of Products Platfora [email protected] @denisehemke

@denisehemke @platfora 31

You should know.

Twitter Tag: #briefr

The Briefing Room

Perceptions & Questions

Analyst: Robin Bloor

Lightning Analytics? Is it Real?

Robin Bloor, PhD

Data Analytics: A Process

u  Data Analytics is a multi-disciplinary end-to-end repetitive process

u  It changed, because of: • Data availability ++ • Parallel technology •  Scalable software • Open source tools • M/C Learning

Data Access

Data Prep

Model

Analyze

Deploy

Execute

Does it Matter if it’s Not the Fastest?

u  It is ITERATIVE

u  The speed of the END-TO-END PROCESS matters

u  And, the impact of the technology on the ANALYTICAL PROCESS matters

The situation is COMPLEX rather than simple. Analytics is not Formula 1 racing, but:

The Full Monty

Analytical Latencies

1.   Data access

2.   Data preparation

3.   Model development

4.   Execution

5.   Implementation

6.   Model audit & update

This is where the rubber meets the road:

SPEED = VALUE

The Impending Reality

Technology can speed up analytics by two orders of magnitude

(on the IT side)

THIS WILL CHANGE ANALYTICS

u  Are we really short on data scientists? Or are we short on fast analytics tools?

u  Is the data really big?

u  Please comment on analytical workloads: • What do you see as the natural IT bottlenecks? • What do you see as the natural business bottlenecks?

u  Do we want business analysts to become ersatz data scientists?

u  In respect to scale, what is your largest implementation by data volume, and what was the industry sector/problem space?

u  What do you see as the largest barrier to adoption of Platfora?

Twitter Tag: #briefr

The Briefing Room

Twitter Tag: #briefr

The Briefing Room

Upcoming Topics

www.insideanalysis.com

2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room

2015 Editorial Calendar coming soon!

This Month: ANALYTIC PLATFORMS

November: DISCOVERY & VISUALIZATION

December: INNOVATORS

Twitter Tag: #briefr

The Briefing Room

THANK YOU for your

ATTENTION!

Some images provided courtesy of Wikimedia Commons and Wikipedia


Recommended