+ All Categories
Home > Data & Analytics > Spark Summit Keynote by Shaun Connolly

Spark Summit Keynote by Shaun Connolly

Date post: 11-Jan-2017
Category:
Upload: spark-summit
View: 2,164 times
Download: 0 times
Share this document with a friend
11
Accelerating Enterprise Spark Shaun Connolly Hortonworks Strategy @shaunconnolly
Transcript
Page 1: Spark Summit Keynote by Shaun Connolly

Accelerating Enterprise Spark

Shaun ConnollyHortonworks Strategy

@shaunconnolly

Page 2: Spark Summit Keynote by Shaun Connolly

Apache Spark Unlocks Enormous Potential of Data in

the Enterprise

Page 3: Spark Summit Keynote by Shaun Connolly

Page 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved

Personalized Online Ads

Petabytes of Weblogs Analyzed with Spark at Scale• Data streams from a vast array of

desktop and mobile devices• 13 billion daily events processed

latency as low as 40 milliseconds • No data cleansing necessary prior

to analysis with Apache Spark• 2 clusters consolidated into 1

YARN-based HDP cluster• Launched new product Webtrends

Explore™ -- powered by HDP

Per-Customer Click Path

Web LogAnalysis

SQL Server Offload

“We’re able to…look at this data set and process it and do predictions, behavioral analysis. We can do things that allow us to determine ROI for different actions and behavioral patterns.”

Peter Crossley, Chief Architect

Behavioral Segmentation

Ad Click Predictions

LCV Analysis

Page 4: Spark Summit Keynote by Shaun Connolly

Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

New Use Cases

Cable Company: Optimize Advertising• Monitor channel changes with Spark Streaming• Correlate changes with Ads/Programming• Allocate Ads real time: Show ads to user who are

watching a show and will stay for > over 20 seconds

Railroad Company: Real-time View of State of Track• Optimize the track and train maintenance • Large volume and granularity of track data• GeoSpatial analytics is critical

Page 5: Spark Summit Keynote by Shaun Connolly

Spark TrendsImplications for the Enterprise

Data API Enterprise Ready /”Hardened”

Data Science is still the Frontier

Page 6: Spark Summit Keynote by Shaun Connolly

Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

ETL, Streaming, Reporting, Analytics

Must Integrate into Existing Environments

A Critical Tool in the Enterprise Tool Box

The Data API

Page 7: Spark Summit Keynote by Shaun Connolly

Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved

HA, DR, Tooling, Debugging, Operations

Security, Encryption, Governance Models

Scale

Implications of Enterprise-Ready / “Hardened”

Page 8: Spark Summit Keynote by Shaun Connolly

Agile Analytics & Data Science

Need to Democratize

Easy and Better Tooling

Train and Encourage More People to Join Us

Page 9: Spark Summit Keynote by Shaun Connolly

Hortonworks Strategy for Enterprise Spark at Scale

Agile Analytics & Data Science

Accelerate Capabilities for the Enterprise

Innovate at the Core

Page 10: Spark Summit Keynote by Shaun Connolly

Stay tuned…. March 1

Page 11: Spark Summit Keynote by Shaun Connolly

Thank You!Shaun Connolly

@shaunconnolly


Recommended