Page 1
data science @ The New York Times
[email protected] @nytimes.com@chrishwiggins
references: bit.ly/icerm
Page 2
data science: mindset & toolset
drew conway, 2010
references: bit.ly/icerm
Page 3
modern history:2009
references: bit.ly/icerm
Page 4
modern history:2009
references: bit.ly/icerm
Page 5
data science @ The New York Times
[email protected] @nytimes.com@chrishwiggins
references: bit.ly/icerm
Page 6
biology: 1892 vs. 1995
references: bit.ly/icerm
Page 7
biology: 1892 vs. 1995
references: bit.ly/icerm
new toolset, new mindset
Page 8
1851
references: bit.ly/icerm
Page 9
news: 20th century
church state
references: bit.ly/icerm
Page 10
church
references: bit.ly/icerm
Page 11
church
references: bit.ly/icerm
Page 13
news: 20th century
church state
references: bit.ly/icerm
Page 14
news: 21st century
church state
engineering
references: bit.ly/icerm
Page 15
news: 21st century
church state
engineering: DSE
references: bit.ly/icerm
Page 16
1851 1996
newspapering: 1851 vs. 1996
references: bit.ly/icerm
Page 17
example:
millions of views per hour2015
Page 18
references: bit.ly/icerm
Page 19
references: bit.ly/icerm
"...social activities generate large quantities of potentially valuable data...The data were not generated for the purpose of learning; however, the potential for learning is great’’
Page 20
references: bit.ly/icerm
"...social activities generate large quantities of potentially valuable data...The data were not generated for the purpose of learning; however, the potential for learning is great’’ - J Chambers, Bell Labs,1993
Page 21
data science: the web
references: bit.ly/icerm
Page 22
data science: the web
is your “online presence”
references: bit.ly/icerm
Page 23
data science: the web
is a microscope
references: bit.ly/icerm
Page 24
data science: the web
is an experimental tool
references: bit.ly/icerm
Page 25
data science: the web
is an optimization tool
references: bit.ly/icerm
Page 26
1851 1996
newspapering: 1851 vs. 1996 vs. 2008
2008
references: bit.ly/icerm
Page 27
“a startup is a temporary organization in search of a repeatable and scalable business model” —Steve Blank
references: bit.ly/icerm
Page 28
every publisher is now a startup
references: bit.ly/icerm
Page 29
every publisher is now a startup
Page 31
news: 21st century
church state
engineering
references: bit.ly/icerm
Page 32
news: 21st century
church state
engineering
references: bit.ly/icerm
Page 33
learnings
references: bit.ly/icerm
Page 34
learnings
- predictive analytics- descriptive analytics- prescriptive analytics
references: bit.ly/icerm
Page 35
learnings
- predictive analytics- descriptive analytics- prescriptive analytics
cf. modelingsocialdata.org
references: bit.ly/icerm
Page 36
predictive analytics, e.g.,
cf. modelingsocialdata.org
Page 37
predictive analytics, e.g.,
“the funnel”
cf. modelingsocialdata.org
Page 38
interpretable predictive analytics
supe
r co
ol s
tuff
cf. modelingsocialdata.org
Page 39
interpretable predictive analytics
supe
r co
ol s
tuff
cf. modelingsocialdata.org
arxiv.org/abs/q-bio/0701021
Page 40
optimization & learning, e.g.,
“How The New York Times Works “popular mechanics, 2015
Page 41
optimization & prediction, e.g.,
“How The New York Times Works “popular mechanics, 2015
(some models)
(som
e mo
neys
)
Page 42
recommendation as predictive analytics
Page 43
recommendation as predictive analytics
bit.ly/AlexCTM
Page 44
descriptive analytics, e.g,
cf. daeilkim.com ; import bnpy
Page 45
prescriptive analytics, e.g,
Page 46
prescriptive analytics, e.g,
Page 47
prescriptive analytics, e.g,
Page 48
Reporting
Learning
Test
Optimizing
Exploredescriptive:
predictive:
prescriptive:
Page 49
Reporting
Learning
Test
Optimizing
Exploredescriptive:
predictive:
prescriptive:
Page 50
common requirements in data science:
Page 51
common requirements in data science:
1. people2. ideas3. things
cf. USAF
Page 52
things:what does DS team deliver?
Page 53
things:what does DS team deliver?
- build data prototypes- build APIs- impact roadmaps
Page 54
- build data prototypes
Page 55
- build data prototypes
cf. daeilkim.com
Page 56
- build data prototypes
cf. daeilkim.com
Page 57
- in puppet, w/python2.7- collaboration w/pers. team
- build APIs
Page 58
- impact roadmaps
flickr/McJex
Page 59
data science: ideas
Page 60
data skills
- data engineering- data science- data visualization- data product- data multiliteracies- data embeds
cf. “data scientists at work”, ch 1
Page 61
data skills
- data engineering- data science- data visualization- data product- data multiliteracies- data embeds
cf. “data scientists at work”, ch 1
Page 62
data skills
- data engineering- data science- data visualization- data product- data multiliteracies- data embeds
cf. “data scientists at work”, ch 1
Page 63
data science: people
- new mindset > new toolset
Page 64
data science: people
Page 65
summary:pay attention to:
1. people2. ideas3. things
cf. USAF
Page 66
thanks to the data science team!
Page 67
data science @ The New York Times
[email protected] @nytimes.com@chrishwiggins