+ All Categories
Transcript
Page 1: Mining Spatio-temporal Clusters for Event Correlation and ... … · ladygaga justinbieber ka rihanna shakira britneyspears KimKardashian barackobama taylorswift13 selenagomez WORTH?

Mining Spatio-temporal Clusters forEvent Correlation and Visualization

Rahul Potharaju, Andrew Newell, Cristina Nita-RotaruDepartment of Computer Science, Purdue University

Finding the Story in the TweetStack

Early event detection carriessubstantial value in various domains

Tremendous real-time capabilities in social media

Limitations of Conventional TechniquesNatural Language Processing is not very effective for tweets! Short words,

new abbreviations and word disambiguation.

Key IdeaLeverage signal processing to pre-process tweets into clusters. Subsequently

improve semantic interpretation using natural language processing.

Egypt

Google

Mubarak

Twitter

1: Construct Time Series

2: Cluster similar patterns

3: Semantic Correlation

Egypt

Mubarak

Google

ARE TWEETS RELATING REAL-WORLD ENTITIES CORRELATED?Seasonal Trend Decomposition based on Loess Smoother

Extract Trend Lines Cross Correlation

CURRENTLY IN THE PIPELINEClustering Time Series• Computing cross correlation is expensive!• Convert time series into another representation• Cluster (k-means or hierarchical) this representation• Verify cluster utility

abababc aaabbbababbbabba

5AA-2E8.pdf 1 3/19/2012 2:25:05 PM

mfocosi
Typewritten Text
2012 - 5AA-2E8 - Finding the Story in the TweetStack: Mining Spatio-temporal Clusters for Event Correlation and Visualization - Rahul Potharaju PDR
Top Related