+ All Categories
Home > Documents > Mining Spatio-temporal Clusters for Event Correlation and ... … · ladygaga justinbieber ka...

Mining Spatio-temporal Clusters for Event Correlation and ... … · ladygaga justinbieber ka...

Date post: 26-May-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
1
Mining Spatio-temporal Clusters for Event Correlation and Visualization Rahul Potharaju, Andrew Newell, Cristina Nita-Rotaru Department of Computer Science, Purdue University Finding the Story in the TweetStack Early event detection carries substantial value in various domains Tremendous real-time capabilities in social media Limitations of Conventional Techniques Natural Language Processing is not very effective for tweets! Short words, new abbreviations and word disambiguation. Key Idea Leverage signal processing to pre-process tweets into clusters. Subsequently improve semantic interpretation using natural language processing. Egypt Google Mubarak Twitter 1: Construct Time Series 2: Cluster similar patterns 3: Semantic Correlation Egypt Mubarak Google ARE TWEETS RELATING REAL-WORLD ENTITIES CORRELATED? Seasonal Trend Decomposition based on Loess Smoother Extract Trend Lines Cross Correlation CURRENTLY IN THE PIPELINE Clustering Time Series Computing cross correlation is expensive! Convert time series into another representation Cluster (k-means or hierarchical) this representation Verify cluster utility abababc aaabbbab abbbabba 5AA-2E8.pdf 1 3/19/2012 2:25:05 PM
Transcript
Page 1: Mining Spatio-temporal Clusters for Event Correlation and ... … · ladygaga justinbieber ka rihanna shakira britneyspears KimKardashian barackobama taylorswift13 selenagomez WORTH?

Mining Spatio-temporal Clusters forEvent Correlation and Visualization

Rahul Potharaju, Andrew Newell, Cristina Nita-RotaruDepartment of Computer Science, Purdue University

Finding the Story in the TweetStack

Early event detection carriessubstantial value in various domains

Tremendous real-time capabilities in social media

Limitations of Conventional TechniquesNatural Language Processing is not very effective for tweets! Short words,

new abbreviations and word disambiguation.

Key IdeaLeverage signal processing to pre-process tweets into clusters. Subsequently

improve semantic interpretation using natural language processing.

Egypt

Google

Mubarak

Twitter

1: Construct Time Series

2: Cluster similar patterns

3: Semantic Correlation

Egypt

Mubarak

Google

ARE TWEETS RELATING REAL-WORLD ENTITIES CORRELATED?Seasonal Trend Decomposition based on Loess Smoother

Extract Trend Lines Cross Correlation

CURRENTLY IN THE PIPELINEClustering Time Series• Computing cross correlation is expensive!• Convert time series into another representation• Cluster (k-means or hierarchical) this representation• Verify cluster utility

abababc aaabbbababbbabba

5AA-2E8.pdf 1 3/19/2012 2:25:05 PM

mfocosi
Typewritten Text
2012 - 5AA-2E8 - Finding the Story in the TweetStack: Mining Spatio-temporal Clusters for Event Correlation and Visualization - Rahul Potharaju PDR
Recommended