+ All Categories
Home > Technology > Street Fighting Data Science

Street Fighting Data Science

Date post: 07-Dec-2014
Category:
Upload: benedikt-koehler
View: 2,055 times
Download: 3 times
Share this document with a friend
Description:
 
27
Street Fighting Data Science von @furukama (Benedikt Köhler, d.core) @jbenno (Jörg Blumtritt, Datarella) #rp13
Transcript
Page 1: Street Fighting Data Science

Street Fighting Data Science

von @furukama (Benedikt Köhler, d.core) @jbenno (Jörg Blumtritt, Datarella)

#rp13

Page 2: Street Fighting Data Science

Street Fighting Data Science

• Umnutzen vorhandener Daten (Tweets -> Bewe- gungsgeschwindigkeit)

• Umwidmen von Methoden (BioTech -> Sozialwissenschaften)

• Agile Ad-hoc-Analysen

• Improvisation

http://en.wikipedia.org/wiki/File:Fightingmanstones.jpg

Page 3: Street Fighting Data Science

Wir glauben an Gauß!

Page 6: Street Fighting Data Science

Data Science 101

• Crawling / Scraping

• APIs

• Datenbanken, Hadoop, Stream Processing

• „Data Munging“: Bereinigen / Formatieren / Konvertieren

• Machine Learning (Python Scikit-Learn / NumPy, SciPy, R, Mahout)

• Textanalyse (NLTK, R)

• Network Analysis (Gephi, NodeXL)

• Statistik (R, Python)

Page 7: Street Fighting Data Science

N-Gramme

• N-Gramme zerlegen Texte in kleinere Fragmente. 1-Gramm = „Street“, 2-Gramm = „Street Fighter“ -> Google Corpus 2006/12

Google Ngram Viewer http://books.google.com/ngrams + DB http://books.google.com/ngrams/datasets

Page 8: Street Fighting Data Science

WordNet

• WordNet: semantische und lexikalische Bedeutung von Wörtern

• Daraus z.B. Wörter mit Stimmungen identifizierbar (WN Affect)

WordNet http://wordnet.princeton.edu/ WordNet Affect http://wndomains.fbk.eu/wnaffect.html

Page 9: Street Fighting Data Science

N-Gramme + WordNet

• Emotionen im Zeitverlauf

Acerbi et al 2013 http://www.plosone.org/article/info:doi/10.1371/journal.pone.0059030

Page 10: Street Fighting Data Science

Food Pairings

Ahn et al 2011 http://www.nature.com/srep/2011/111215/srep00196/full/srep00196.html

Page 11: Street Fighting Data Science

Food Pairings

Ahn et al 2011 http://www.nature.com/srep/2011/111215/srep00196/full/srep00196.html

Page 12: Street Fighting Data Science

Food Pairings

Ahn et al 2011 http://www.nature.com/srep/2011/111215/srep00196/full/srep00196.html

Page 15: Street Fighting Data Science

Sandy: Meteorologie für alle

http://rpubs.com/JoFrhwld/sandy

Page 16: Street Fighting Data Science

Windmap

US Wind Patterns www.senchalabs.org/philogl/PhiloGL/examples/winds/

Page 17: Street Fighting Data Science

Google Correlate

Google Correlate www.google.com/trends/correlate

Page 18: Street Fighting Data Science

NodeXL – Twitter-Netzwerk #rp13

NodeXL http://nodexl.codeplex.com/

Page 19: Street Fighting Data Science

Netvizz – Facebook-Daten

NetVizz https://apps.facebook.com/netvizz/

Page 20: Street Fighting Data Science

Gephi – Visualisierungstool

Gephi http://gephi.org

Page 21: Street Fighting Data Science

Das Ergebnis

Facebook-Netzwerk von https://www.facebook.com/benediktkoehler

Page 22: Street Fighting Data Science

Twitter - Bewegungsdaten

Eric Fischer: Travel Patterns http://www.flickr.com/photos/walkingsf/6794335193

Page 23: Street Fighting Data Science

Der Passive Wahlomat

Piraten 0,14108935

Gruene 0,12956345

SPD 0,08088609

CDU 0,06258422

Linke 0,09733024

FDP 0,04376875

http://blog.metaroll.de/2012/03/23/der-passive-wahlomat-textmining-mit-politischen-programmen-und-konversationen-teil-1/

Page 25: Street Fighting Data Science

Web-Crawler

• HTTrack Website Copier etc.

• Simple Web Crawler in Python etc.

Page 27: Street Fighting Data Science

Danke!

http://beautifuldata.net


Recommended