+ All Categories
Home > Technology > DataEngConf SF16 - Bridging the gap between data science and data engineering

DataEngConf SF16 - Bridging the gap between data science and data engineering

Date post: 13-Feb-2017
Category:
Upload: hakka-labs
View: 1,215 times
Download: 0 times
Share this document with a friend
27
Data Engineering and Data Science: Bridging the Gap
Transcript
Page 1: DataEngConf SF16 - Bridging the gap between data science and data engineering

Data Engineering and Data Science:Bridging the Gap

Page 2: DataEngConf SF16 - Bridging the gap between data science and data engineering

About Me● Slack’s Head of Data

Engineering

● Used to work at Cloudera, Google, other places

● Wrote popular tweet that I’m sort of tired of talking about

● Only owns one hat

Page 3: DataEngConf SF16 - Bridging the gap between data science and data engineering

First Question: Why Bother?

Page 4: DataEngConf SF16 - Bridging the gap between data science and data engineering

In The Beginning...

Page 5: DataEngConf SF16 - Bridging the gap between data science and data engineering

What Do We Both Want?

Page 6: DataEngConf SF16 - Bridging the gap between data science and data engineering

Infinite Loop of Sadness

Data Eng

Ops

Data Science

Business

Page 7: DataEngConf SF16 - Bridging the gap between data science and data engineering

Alone Together

Page 8: DataEngConf SF16 - Bridging the gap between data science and data engineering

Back To First Principles

Page 9: DataEngConf SF16 - Bridging the gap between data science and data engineering

Deploying Kafka

Page 10: DataEngConf SF16 - Bridging the gap between data science and data engineering

Infinite Loop of Sadness Empathy

Data Eng

Ops

Data Science

Business

Page 11: DataEngConf SF16 - Bridging the gap between data science and data engineering

Rage In Support Of The Machine

Page 12: DataEngConf SF16 - Bridging the gap between data science and data engineering

Everybody ETLs

Page 13: DataEngConf SF16 - Bridging the gap between data science and data engineering

Option 1: SQL-Centric ETL

Page 14: DataEngConf SF16 - Bridging the gap between data science and data engineering

Option 2: JVM-Centric ETL

Page 15: DataEngConf SF16 - Bridging the gap between data science and data engineering

A Third Way

Page 16: DataEngConf SF16 - Bridging the gap between data science and data engineering

#1: The Rise of Spark

Page 17: DataEngConf SF16 - Bridging the gap between data science and data engineering

#2: Too Many Streaming Engines

Page 18: DataEngConf SF16 - Bridging the gap between data science and data engineering

#3: Streaming Design Patterns

Page 19: DataEngConf SF16 - Bridging the gap between data science and data engineering

#4: A Focus On The Real Problem

Page 20: DataEngConf SF16 - Bridging the gap between data science and data engineering

Inspiration From Deep Learning

Page 21: DataEngConf SF16 - Bridging the gap between data science and data engineering

Not Quite Static Typing

Page 22: DataEngConf SF16 - Bridging the gap between data science and data engineering

Glitch

Page 23: DataEngConf SF16 - Bridging the gap between data science and data engineering

Table Declarations

Page 24: DataEngConf SF16 - Bridging the gap between data science and data engineering

Scripting >> UDFs...

Page 25: DataEngConf SF16 - Bridging the gap between data science and data engineering

...but SQL when you need it

Page 26: DataEngConf SF16 - Bridging the gap between data science and data engineering

A Brave New World

Page 27: DataEngConf SF16 - Bridging the gap between data science and data engineering

Thanks!(oh and we’re hiring)


Recommended