+ All Categories
Home > Documents > Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5....

Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5....

Date post: 29-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
49
David Tarrant · @davetaz Finding Stories in Data http://training.theodi.org/Malaysia
Transcript
Page 1: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

David Tarrant · @davetaz

Finding Stories in Data

http://training.theodi.org/Malaysia

Page 2: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Aim

Improve understanding of how to source, analyse and visualise data to discover insight and tell stories.

Page 3: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Aims

Identify a number of data driven stories. Understand the stages in creating an open data story. Create your own open data story using analysis tools. Visualise your findings using an interactive graphic.

Page 4: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Examples: Telling stories with data

Page 5: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating
Page 6: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

http://ampp3d.mirror.co.uk/

Page 7: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Data is a source

Page 8: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Two main methods of using data as a source

1.  Story first – data used to enhance, fact check, dig deeper

2.  Data first – story found/presented through data analysis

Page 9: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Story, then data

Page 10: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Data – then story

Page 11: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Thanks to David Ottewell Head of Data Journalism Trinity Mirror (Regionals) for permission in using this slide

Page 12: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Thanks to David Ottewell Head of Data Journalism Trinity Mirror (Regionals) for permission in using this slide

Page 13: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Thanks to David Ottewell Head of Data Journalism Trinity Mirror (Regionals) for permission in using this slide

Page 14: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Using data as a source ≠ must have visualisation

http://www.ft.com/cms/s/0/4b1a2f64-2048-11e3-9a9a-00144feab7de.html

Page 15: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Using data as a source ≠ (necessarily) big investigation

http://ampp3d.mirror.co.uk/2014/02/26/the-eu-could-ban-roaming-charges-completely-this-year/

Page 16: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Data for education

Page 17: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

http://aviation.live.kiln.it/

Page 18: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

h"p://www.ny*mes.com/newsgraphics/2013/08/18/reshaping-­‐new-­‐york/  

Page 19: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Self discovery

Page 20: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Show me the money

http://smtm.labs.theodi.org/

Page 21: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

LFB Fire Station Closures

http://london-fire.labs.theodi.org/

Page 22: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Data discovery patterns

Page 23: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Finding data on the web (of documents)

•  Government data

•  Google advanced

•  Aggregators and portals

•  Scraping

Page 24: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Government data

Page 25: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

data.gov.XX

Page 26: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Google advanced

site:  Get  results  only  from  certain  sites  or  domains    link:  Find  pages  that  link  to  a  certain  page      related:  Find  sites  similar  to  one  you  already  know    filetype:  Find  certain  file  types  only    

Page 27: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Aggregators and portals

Collect together data from across the web into one place.

enigma.io   transportAPI  

Page 28: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Scraping If you can’t obtain usable data (csv, xls) then you may have to resort to scraping.

scraperwiki.com   import.io  

Page 29: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Finding data on the web (of data) 1.  Add random extensions (.xml, .json, .csv etc) 2.  Look for alternative links (rss feeds etc)

3.  Look for embedded data 4.  Do some content negotiation 5.  Spot the API 6.  Scrape (or search google again)

How  the  web  should  work,  but  people  forgot  that  Tim  put  this  in  when  he  invented  it!  

Page 30: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Duck typed data

If it looks like a duck and quacks like a duck,then it’s probably a duck. Basically, keep an eye out for tables, lists and other stuff that looks like data.

Page 31: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

1. Adding random extensions

UK  Trade  Tariff  

Try  using  the  following:      .csv      .json      .xml      .rss      .rdf  

BBC  Music  and  Programmes  

Page 32: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

2. Look for alternative links

Scroll  down!  

Page 33: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

2. Look for alternative links

RSS

Page 34: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Finding data on the web (of data) 1.  Add random extensions (.xml, .json, .csv etc) 2.  Look for alternative links (rss feeds etc)

3.  Look for embedded data 4.  Do some content negotiation 5.  Spot the API 6.  Scrape (or search google again)

Techniques  3-­‐5  are  not  covered  in  this  session.  Please  

ask  your  trainer  for  more  informa*on  if  there  is  *me.  

Page 35: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Exercise: Find a story http://bit.ly/odi-stories3

http://en.wikipedia.org/wiki/File:Stöwer_Titanic.jpg

Page 36: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

1.  Go through (some of) the Checklist for Exploring Data 2.  Create a pivot table 3.  Five ideas for finding a story

q  Choose one story you want to tellq  Create a headlineq  (Bonus: Create a chart, e.g. with

datawrapper.de)

Exercise

Page 37: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

http://www.theguardian.com/news/datablog/2012/may/24/data-journalism-punk

Page 38: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

The data percolator

Page 39: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Data percolation: A model of data preparation and analysis

Gather

Prepare

Produce

See also: the Data Journalism Handbook

Page 40: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Gather

Prepare

Produce

Data percolation: A model of data preparation and analysis

Page 41: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Explore

How should I budget my time?

Gather

Prepare

Produce

Page 42: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating
Page 43: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating
Page 44: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating
Page 45: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

How should I budget my time?

Gather

Prepare

Produce

1.1 FIND reliable data sources

1.2 Understand your RIGHTS

1.3 Visualise and UNDERSTAND your data

2.1 CLEAN your data

2.2 TRANSFORM it where useful

2.3 COMBINE it with other data sets

3.1 REDUCE and find the story

3.2 Think and understand the CONTEXT

3.3 Do your results pass a SENSE-CHECK?

Page 46: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Time planning

Gather Produce

Prepare

2.1 CLEAN

2.2 TRANSFORM

2.3 COMBINE

2.4 ENRICH 2.5 ANALYSE

Page 47: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Tools

Excel Refine Datawapper.ie Plot.ly CartoDB d3.js

Page 48: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

Chose one or more of the following to try: q  Data cleaning in Refineq  Enriching data to discover insight (in Refine)q  Creating interactive graphics in Plot.lyq  Creating interactive maps in cartoDB.q  Building you own interactive website and d3.js

visualisation.

Exercises

Page 49: Finding Stories in Datatraining.theodi.org/resources/FindingStories_Malaysia.pdf · 2020. 5. 27. · Aims Identify a number of data driven stories. Understand the stages in creating

David Tarrant · @davetaz

Thank You

http://training.theodi.org/Malaysia


Recommended