+ All Categories
Home > Education > Beginner's Guide to Getting Public Data into the Classroom

Beginner's Guide to Getting Public Data into the Classroom

Date post: 27-Jan-2017
Category:
Upload: shawn-handran
View: 338 times
Download: 0 times
Share this document with a friend
53
BEGINNER’S GUIDE TO GETTING PUBLIC DATA INTO THE CLASSROOM PRESENTED OCTOBER 17, 2015 SOCIETY FOR SCIENCE AND THE PUBLIC TEACHER CONFERENCE WASHINGTON DC Shawn Handran, Ph.D.
Transcript
Page 1: Beginner's Guide to Getting Public Data into the Classroom

BEGINNER’S GUIDE TO GETTING PUBLIC DATA INTO THE CLASSROOMPRESENTED OCTOBER 17, 2015 SOCIETY FOR SCIENCE AND THE PUBLIC TEACHER CONFERENCEWASHINGTON DC

Shawn Handran, Ph.D.

Page 2: Beginner's Guide to Getting Public Data into the Classroom

About me 10 years in academic research

Montana State BS, Washington Univ. in St. Louis PhD, Harvard Medical School post-doc

7 years in biotechnology Genomics, bioinformatics, HT screening & imaging

4 years in Non-Profit sector Foundation/fundraising database research

4th year of teaching at FCS AP Biology, AP Statistics, Biotechnology

Page 3: Beginner's Guide to Getting Public Data into the Classroom

Getting public data into the classroom

Stimulate intrinsic interest Keep barriers to entry low Get to a good comfort level Gradual release

% of content

Page 4: Beginner's Guide to Getting Public Data into the Classroom

Getting Public Data into the Classroom

STEP 1:STIMULATE INTRINSIC INTEREST

Page 5: Beginner's Guide to Getting Public Data into the Classroom

Stimulate intrinsic interest Teachers often mandate the

parameters Too much control stifles creativity

Give students ownership Ownership drives interest level and

engagement Provide guidance

Yes, some parameters are still required!

Page 6: Beginner's Guide to Getting Public Data into the Classroom

Some recurring themes Music Sports Whatever example you just showed in

class

Page 7: Beginner's Guide to Getting Public Data into the Classroom

Getting Public Data into the Classroom

STEP 2:KEEP BARRIERS TO ENTRY LOW

Page 8: Beginner's Guide to Getting Public Data into the Classroom

Keep barriers to entry low Datasets

Ease of access Dataset format

Data analysis Cost Ease of use

Page 9: Beginner's Guide to Getting Public Data into the Classroom

Dataset barriers to entry Ease of access

HTML tables Downloadable files Copy and paste Query database/forms PDF

Diffi

culty

Page 10: Beginner's Guide to Getting Public Data into the Classroom

Dataset barriers to entry Dataset format

HTML use Import HTML Table function Text format (csv, tab) or Excel (xls, xlsx) Query forms Simple database files (e.g., Access) Complex database files

Diffi

culty

Page 11: Beginner's Guide to Getting Public Data into the Classroom

Keep Barriers to Entry Low

TUTORIAL 1: IMPORT HTML TABLE INTO GOOGLE SHEETS

Page 12: Beginner's Guide to Getting Public Data into the Classroom

MLB 2014 AL team summary stats Baseball-

Reference.com http://goo.gl/5RU0Gt

Page 13: Beginner's Guide to Getting Public Data into the Classroom
Page 14: Beginner's Guide to Getting Public Data into the Classroom
Page 15: Beginner's Guide to Getting Public Data into the Classroom

Import HTML Table template Google Sheets https://

goo.gl/PTv7vl

Page 16: Beginner's Guide to Getting Public Data into the Classroom

Import HTML Table

Page 17: Beginner's Guide to Getting Public Data into the Classroom

Import HTML Table

Page 18: Beginner's Guide to Getting Public Data into the Classroom

Two functions of data analysis Data handling Data visualization

Most programs do both but some not well

You’ll often use multiple programs

Page 19: Beginner's Guide to Getting Public Data into the Classroom

Data analysis: cost vs. ease of use

R Stata SAS

OpenOffice StatCrunchGoogleSheets NumbersGapminder

JMPExcelPublisher

MinitabFathom*

Free $$$

Har

dEa

sy

Tableau PublicSPSS

Illustrator

Tableau

*discontinued

Page 20: Beginner's Guide to Getting Public Data into the Classroom

Spreadsheet/graphing programs Advantages

Free or close to free (except Excel) Good selection of canned graphs

Disadvantages Challenging for students to learn Requires a lot of wizard-level hacking/tweaking

Winners: Google Sheets, MS Excel

Page 21: Beginner's Guide to Getting Public Data into the Classroom

Statistical programs Advantages

Handles large datasets faster and better than Excel Designed for statistical analysis Handles variables seamlessly More graph options and better graph editing tools than Excel About the same learning curve as Excel for simple functions

Disadvantages Moderate to high cost, even with academic pricing More sophisticated graphs or analyses require mad skills Poor graphic export options

Winners: JMP, Minitab

Page 22: Beginner's Guide to Getting Public Data into the Classroom

Minitab

StatCrunch

Dataset size: 286K

Page 23: Beginner's Guide to Getting Public Data into the Classroom

Graphic design programs Advantages

Perfect control over every graphic element Final output looks stunning and is scalable

Disadvantages Zero data handling and analysis capability Huge learning curve Expensive

Winner: Adobe Illustrator Runner up: Microsoft Publisher (poor man’s Illustrator)

Page 24: Beginner's Guide to Getting Public Data into the Classroom

Tableau Public Advantages

Free including 10GB online storage Handles humongous datasets Interactive with mouse-over information Easy to use for simple datasets and graphs

Disadvantages Everything you create is public Data handling is limited and removing variables can

be tedious (but not always)

Page 25: Beginner's Guide to Getting Public Data into the Classroom

Keep Barriers to Entry Low

TUTORIAL 2:TABLEAU PUBLIC

Page 26: Beginner's Guide to Getting Public Data into the Classroom

Tableau Public Sign up and download desktop/mobile

apphttps://public.tableau.com/s/

Upload a data file Start tinkering!

Page 27: Beginner's Guide to Getting Public Data into the Classroom
Page 28: Beginner's Guide to Getting Public Data into the Classroom

WHO Mortality by Cause & Age World Health

Organization https://goo.gl/pYTHS4

Page 29: Beginner's Guide to Getting Public Data into the Classroom

Dataset size: 5.5K

Page 30: Beginner's Guide to Getting Public Data into the Classroom

Getting Public Data into the Classroom

STEP 3:GET TO A GOOD COMFORT LEVEL

Page 31: Beginner's Guide to Getting Public Data into the Classroom

Get to a good comfort level Getting started: Survey of public datasets Getting help: Learn from data experts Getting acquainted: Make new friends

Disclaimer: these lists are by no means exhaustive!

Page 32: Beginner's Guide to Getting Public Data into the Classroom

Getting started: public datasets Data.gov (186,000+ data sets)

http://www.data.gov/

Big Machine Learning (BigML) blog posthttp://blog.bigml.com/list-of-public-data-sources-fit-for-machine-learning/

Page 33: Beginner's Guide to Getting Public Data into the Classroom

Getting started: public datasets Gapminder Offline software (free)

http://www.gapminder.org/downloads/ Pre-loaded data! Cake walk easy to use! Dynamic and awesome looking!

Page 34: Beginner's Guide to Getting Public Data into the Classroom

Getting started: public datasets HTML tables

http://www.baseball-reference.com/ http://www.billboard.com/archive/charts http://apps.who.int/gho/data/?

theme=home

Page 35: Beginner's Guide to Getting Public Data into the Classroom

Getting started: public datasets Download files (text, Excel)

http://www.seanlahman.com/baseball-archive/statistics

http://www.gapminder.org/data/ https://data.cdc.gov/browse

Page 36: Beginner's Guide to Getting Public Data into the Classroom

Getting started: public datasets Copy and Paste

https://gssdataexplorer.norc.org/ (easy)http://espn.go.com/mlb/statistics (tedious)

Page 37: Beginner's Guide to Getting Public Data into the Classroom

Getting help: Learn from data experts David McCandliss

http://www.informationisbeautiful.net/ Andy Kirk http

://www.visualisingdata.com/blog/ Hans Rosling http

://www.gapminder.org/videos/ Edward Tufte http

://www.edwardtufte.com/tufte/

Page 38: Beginner's Guide to Getting Public Data into the Classroom

Getting acquainted: make friends Here at this conference On social media networks

You’ll have better luck on LinkedIn and G+

Don’t be afraid to reach out

Page 39: Beginner's Guide to Getting Public Data into the Classroom

Getting Public Data into the Classroom

STEP 4:GRADUAL RELEASE

Page 40: Beginner's Guide to Getting Public Data into the Classroom

Gradual release Model

Don’t just show it—demo it live Encourage

Preferably in-class computer time/activities Release and nudge

More nudginghigher quality of final product

Page 41: Beginner's Guide to Getting Public Data into the Classroom

Student Project: Billboard Top100 Student level: 12 (AP Statistics)

International student Public sources:

Billboard Top 100, Radio, Digital 2014http://www.billboard.com/archive/charts/2014

Moderate amount of nudging Mostly for language and cultural help

Page 42: Beginner's Guide to Getting Public Data into the Classroom
Page 43: Beginner's Guide to Getting Public Data into the Classroom

Student Project: MLB Hitting Stats Student level: 12 (AP Statistics)

Local student Public source:

http://espn.go.com/mlb/statistics Low amount of nudging

Student had an excellent grasp on statistical analysis

Page 44: Beginner's Guide to Getting Public Data into the Classroom

Less nudging, less complexity

Batting Average On Base Percentage0.000

0.050

0.100

0.150

0.200

0.250

0.300

0.350

0.400

Home Away

Away

Home

0.480.450.420.390.360.330.300.270.240.21Data

Home vs. Away On Base Percentage

MLB 2013 Home vs. Away Statistics

Dataset size: 808

Page 45: Beginner's Guide to Getting Public Data into the Classroom

Tableau possibilities… I recreated the same

dataset in ~2 hours of work on Tableau Public and visualized 11K data points

https://goo.gl/ffreyb

Page 46: Beginner's Guide to Getting Public Data into the Classroom

Dataset size: 11K

Page 47: Beginner's Guide to Getting Public Data into the Classroom

Student Project: Nearest Stars Student level: 8 (Energy Science) Public sources:

Nearby Stars Observatory http://nbso.org List of Nearest Stars Wikipedia Stellar Database http://stellar-database.com Hubble Space Telescope http://hubblesite.org

Extensive nudging full disclosure: my daughter’s project

Page 48: Beginner's Guide to Getting Public Data into the Classroom

Dataset size: 528

Sirius star system image fromHubble Space Telescope

Page 49: Beginner's Guide to Getting Public Data into the Classroom

Re-envisioned in Tableau Same dataset

recreated on Tableau in ~1 hour

https://goo.gl/tR0Wvl

Page 50: Beginner's Guide to Getting Public Data into the Classroom
Page 51: Beginner's Guide to Getting Public Data into the Classroom

Getting Public Data into the Classroom

WRAPPING UP

Page 52: Beginner's Guide to Getting Public Data into the Classroom

Getting public data into the classroom Get the students interested in something

important to them (not to you) Keep the barriers to entry low

GapMinder, Google Sheets, Excel, Tableau Get yourself trained and prepared

You don’t need to be an expert! Model it for them, then let them do it

Page 53: Beginner's Guide to Getting Public Data into the Classroom

Contact and links LinkedIn:

www.linkedin.com/in/shawnhandran


Recommended