+ All Categories
Home > Education > Lecture4 Social Web

Lecture4 Social Web

Date post: 18-Jun-2015
Category:
Upload: marieke-van-erp
View: 522 times
Download: 0 times
Share this document with a friend
Description:
How can we mine, analyse and visualise the Social Web? In this lecture, you will learn about mining social web data for analysis. Data preparation and gathering basic statistics on your data.
Popular Tags:
33
Social Web Lecture 4 How can we MINE, ANALYSE and VISUALISE the Social Web? (1) Marieke van Erp The Network Institute VU University Amsterdam
Transcript
Page 1: Lecture4 Social Web

Social WebLecture 4

How can we MINE, ANALYSE and VISUALISE the Social Web? (1)

Marieke van ErpThe Network Institute

VU University Amsterdam

Page 2: Lecture4 Social Web

Why?

• UCG provides an enormous wealth of data

• insights in users’ daily lives

• insights in communities

• insights in trends

Page 3: Lecture4 Social Web

To whom it may concern

• Politicians

• Companies

• Governmental institutions

• You?

Page 4: Lecture4 Social Web
Page 5: Lecture4 Social Web

The Age of Big Data

• 25 billion tweets on Twitter in 2010, by 175 million users

• 360 billion pieces of contents on Facebook in 2010, by 600 million different users

• 35 hours of videos uploaded to YouTube every minute

• 130 million photos uploaded to flickr per month

Page 6: Lecture4 Social Web

Questions to Ask

• Who uploads/talks? (age, gender, nationality, community)

• What are the trending topics?

• What else do these users like?

• Who are the most/least active users?

• etc.

Page 7: Lecture4 Social Web

What do you prefer?

Image: http://www.co.olmsted.mn.us/prl/propertyrecords/RecordingDocuments/PublishingImages/forms.jpg

Page 8: Lecture4 Social Web
Page 9: Lecture4 Social Web

The Rise of the Data Scientist

http://radar.oreilly.com/2010/06/what-is-data-science.html

Page 10: Lecture4 Social Web

The Rise of the Data Scientist

• Data Science enables the creation of data products

• Data products are applications that acquire their value from the data, and create more data as a result.

• Users are in a feedback loop: they constantly provide information about the products they use, which gets used in the data product.

Page 11: Lecture4 Social Web

Popular Data Products

Page 12: Lecture4 Social Web

Data Mining 101

(Inspired by George Tziralis’ FOSS Conf’09, John Elder IV’s Salford Systems Data Mining Conf. and Toon Calders’ slides)

Data mining is the exploration and analysis of large quantities ofdata in order to discover valid, novel, potentially useful, andultimately understandable patterns in data.

http://www.freefoto.com/images/33/12/33_12_7---Pebbles_web.jpg

Page 13: Lecture4 Social Web

Data Mining 101

Databases Statistics

Artificial Intelligence

Page 14: Lecture4 Social Web

Steps

• Data input & exploration

• Preprocessing

• Data mining algorithms

• Evaluation & Interpretation

Page 15: Lecture4 Social Web

Data Input & Exploration

• What data do I need to answer question X?

• What variables are in the data?

• Basic stats of my data?

Page 16: Lecture4 Social Web
Page 17: Lecture4 Social Web

Input & Exploration in ‘LikeMiner’

Page 18: Lecture4 Social Web

Preprocessing

• Cleanup!

• Choose a suitable data model

• What happens if you integrate data from multiple sources?

• Reformat your data

Page 19: Lecture4 Social Web

Preprocessing in ‘LikeMiner’

Page 20: Lecture4 Social Web

Data mining algorithms

• Classification: Generalising a known structure & apply to new data

• Association: Finding relationships between variables

• Clustering: Discovering groups and structures in data

Page 21: Lecture4 Social Web

Mining in ‘LikeMiner’

• Filter users by interests

• Construct user graphs

• PageRank on graphs to mine representativeness

• Result: set of influential users

• Compare page topics to user interests to find pages most representative for topics

Page 22: Lecture4 Social Web

Interpreting your results

Page 23: Lecture4 Social Web
Page 24: Lecture4 Social Web

Data Mining is not easy

Page 25: Lecture4 Social Web
Page 26: Lecture4 Social Web
Page 29: Lecture4 Social Web

Populations

http://www.brandrants.com/brandrants/obama/

Page 30: Lecture4 Social Web

Brand Sentiment via Twitter

http://flowingdata.com/2011/07/25/brand-sentiment-showdown/

Page 32: Lecture4 Social Web

Final Assignment: Your SocWeb App

• Create a Social Web app with your group

• Use structured data, relationships between entities, data analysis, visualisation

• Write individual research report on one of the main aspects of your app

Image Source: http://blog.compete.com/wp-content/uploads/2012/03/Like.jpg

Page 33: Lecture4 Social Web

Hands-on Teaser

• Build your own recommender system 101

• Recommend pages on del.icio.us

• Recommend pages to your Facebook friends

image source: http://www.flickr.com/photos/bionicteaching/1375254387/


Recommended