+ All Categories
Home > Documents > Mining information from social media

Mining information from social media

Date post: 08-Jan-2018
Category:
Upload: meagan-thornton
View: 218 times
Download: 0 times
Share this document with a friend
Description:
Classification of Social Behavior User Preference or Behavior can be represented as class labels In a given Web-Page, we can classify user behavior as: Whether or not clicking on an ad Whether or not interested in certain topics Like/Dislike a product In a social network sites Comments by friends Like/dislike of any posts/status In commercial web sites Like/dislike of product Views toward the product etc.
12
MINING INFORMATION FROM SOCIAL MEDIA 1 Contains: 1.Classification of Social Behavior 2.Link Prediction 3.Viral Marketing/Outbreak Detection 4.Network Modeling 5.Social Dimensions
Transcript
Page 1: Mining information from social media

MINING INFORMATION FROM SOCIAL MEDIA

1

Contains:1.Classification of Social Behavior2.Link Prediction3.Viral Marketing/Outbreak Detection4.Network Modeling5.Social Dimensions

Page 2: Mining information from social media

Classification of Social Behavior• User Preference or Behavior can be represented as class

labels• In a given Web-Page, we can classify user behavior as:

• Whether or not clicking on an ad• Whether or not interested in certain topics• Like/Dislike a product

• In a social network sites– Comments by friends– Like/dislike of any posts/status

• In commercial web sites– Like/dislike of product– Views toward the product etc.

Page 3: Mining information from social media

Visualization after Prediction

: Smoking: Non-Smoking: ? Unknown

Predictions6: Non-Smoking7: Non-Smoking8: Smoking9: Non-Smoking10: Smoking

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 4: Mining information from social media

Link Prediction• Given a social network, predict which nodes are likely to get

connected• Output a list of (ranked) pairs of nodes• Example: Friend recommendation in Facebook

Link Prediction

(2, 3)(4, 12)(5, 7)(7, 13)

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 5: Mining information from social media

Viral Marketing/Outbreak Detection

• Users have different social capital (or network values) within a social network, hence, how can one make best use of this information?

• Viral Marketing: find out a set of users to provide coupons and promotions to influence other people in the network so my benefit is maximized

• Outbreak Detection: monitor a set of nodes that can help detect outbreaks or interrupt the infection spreading (e.g., H1N1 flu)

• Goal: given a limited budget, how to maximize the overall benefit?

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 6: Mining information from social media

An Example of Viral Marketing• Find the coverage of the whole network of nodes with the

minimum number of nodes• How to realize it – an example

– Basic Greedy Selection: Select the node that maximizes the utility, remove the node and then repeat

• Select Node 1• Select Node 8• Select Node 7

Node 7 is not a node with high centrality!

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 7: Mining information from social media

Network Modeling• Large Networks demonstrate statistical patterns:

– Small-world effect (e.g., 6 degrees of separation)– Power-law distribution (a.k.a. scale-free distribution)– Community structure (high clustering coefficient)

• Model the network dynamics– Find a mechanism such that the statistical patterns observed in large-

scale networks can be reproduced.– Examples: random graph, preferential attachment process

• Used for simulation to understand network properties– Thomas Shelling’s famous simulation: What could cause the

segregation of white and black people– Network robustness under attack

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 8: Mining information from social media

Comparing Network Models

observations over various real-word large-scale networks

outcome of a network model

(Figures borrowed from “Emergence of Scaling in Random Networks”)

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 9: Mining information from social media

Social Dimensions

• Affiliations of actors are represented as social dimensions• Each Dimension represents one potential affiliation• Social dimensions capture prominent interaction patterns

presented in the network

1 32 1

Affiliation 1 Affiliation 2

Actor Affiliation 1 Affiliation 2

1 1 1

2 1 0

3 0 1

… …… ……

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 10: Mining information from social media

Approach II: Social-Dimension Approach (SocDim)

• Training: – Extract social dimensions to represent potential affiliations of actors

• Any community detection methods is applicable (block model, spectral clustering)

– Build a classifier to select those discriminative dimensions• Any discriminative classifier is acceptable (SVM, Logistic Regression)

• Prediction:– Predict labels based on one actor’s latent social dimensions– No collective inference is necessary

ExtractPotential

Affiliations

Trainingclassifier

Prediction

Labels

Predicted Labels

Social Dimensions

Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 11: Mining information from social media

Underlying Assumption

• Assumption: – the label of one node is determined by its social dimension – P(yi|A) = P(yi|Si)

• Community membership serves as latent features

11Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Page 12: Mining information from social media

Reference• Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection

and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

• Lei Tang and Huan Liu. Community Detection and Mining in Social Media, Morgan & Claypool Publishers, 2010


Recommended