Date post: | 20-Aug-2015 |
Category: |
Technology |
Upload: | axel-bruns |
View: | 2,032 times |
Download: | 2 times |
Tracking Social Media Participation: New Approaches to Studying User-Generated Content
Dr Axel BrunsAssociate Professor
ARC Centre of Excellence for Creative Industries and InnovationQueensland University of Technology
[email protected] http://snurb.info/ – @snurb_dot_info
Researching Social Media
• Social Media:
Websites which build on Web 2.0 technologies to provide space for in-depth social interaction, community formation, and the tackling of collaborative projects.
Axel Bruns and Mark Bahnisch. "Social Drivers behind Growing Consumer Participation in User-Led Content Generation: Volume 1 - State of the Art." Sydney: Smart Services CRC, 2009.
Researching Social Media
• Various existing research approaches:
– Qualitative:
• Processes and practices How? What?
• Content generated by users What?
• Sites and organisational structures How? In what context?
– Quantitative:
• User surveys (demographics, practices, motivations) Who? Why?
• Content coding (usually small-scale) What?
– Mostly small-scale – limited applicability?
Known (Un)knowns
• What we know:
– Behaviour of small social media communities
– Practices of lead users
– Structural frameworks for selected sites / site genres
– Broad demographics of social media users
• Some things we want to know:
– How does all of this work at scale?
– What about ‘average’ users?
– How do communities overlap / interact?
– Can we track developments over time?
(Kelly & Etling, 2009)
Mining and Mapping
• New research materials:
– Massive amounts of data and metadata generated by social media
– Mostly freely available online (Web / RSS / API access)
– Clear, standardised formats
• New research tools:
– Network crawlers
– Website scrapers
– Network analysers / visualisers
– Large-scale text analysers
• What timeframe?
• Crawler approach: anything posted in the last 20 years
• Resulting in one static map – but what’s happening now?
• What map?
• Other ways to categorise these sites?
• Differences in activity, consistency
• Known unknowns – dynamics in the Iranian blogosphere:
• Sites appearing / disappearing?
• Increased / decreased activity?
• New linkage patterns:
• Stronger / weaker clustering?
• Move from one cluster to another?
• Change in topics, shift in emphasis, spread of information?
Asking Sophisticated Questions
Asking Sophisticated Questions
• Problems with current research approaches:– Crawlers don’t distinguish site genres or link types– Scrapers gather all text (including headers, footers, comments, …)– Very few attempts to trace the dynamics of participation– Many different ways to visualise these data– Assumptions often built into the software, and difficult to change
• Alternative approaches:– Gather large population of RSS feeds (and keep growing it)– Track for new posts, and scrape posts only (retain timestamp)– Extract links and keywords for further analysis– Develop ways of identifying and visualising change over time
• Needs to be appropriate to research questions
Applications: Blogosphere
• Questions:– (How) does the ‘A-List’
change over time?– (How) does political
alignment change over time?– How strong is cross-
connection across clusters?– What topics are discussed
– e.g. compared with MSM?
– What happens when power (Adamic & Glance, 2005)
changes hands – is bloggingan oppositional practice?
– Beyond left and right (beyond politics!): identification of blog genres based on textual / linkage patterns (qualitative follow-up necessary)
0
100
200
300
400
500
600
2009
.01.
1220
09.0
1.14
2009
.01.
1620
09.0
1.18
2009
.01.
2020
09.0
1.22
2009
.01.
2420
09.0
1.26
2009
.01.
2820
09.0
1.30
2009
.02.
0120
09.0
2.03
2009
.02.
0520
09.0
2.07
2009
.02.
0920
09.0
2.11
2009
.02.
1320
09.0
2.15
2009
.02.
1720
09.0
2.19
2009
.02.
2120
09.0
2.23
2009
.02.
2520
09.0
2.27
2009
.03.
0120
09.0
3.03
2009
.03.
0520
09.0
3.07
2009
.03.
0920
09.0
3.11
2009
.03.
1320
09.0
3.15
2009
.03.
1720
09.0
3.19
2009
.03.
2120
09.0
3.23
2009
.03.
2520
09.0
3.27
2009
.03.
2920
09.0
3.31
2009
.04.
0220
09.0
4.04
2009
.04.
0620
09.0
4.08
2009
.04.
1020
09.0
4.12
2009
.04.
1420
09.0
4.16
2009
.04.
1820
09.0
4.20
2009
.04.
2220
09.0
4.24
2009
.04.
2620
09.0
4.28
2009
.04.
3020
09.0
5.02
2009
.05.
0420
09.0
5.06
2009
.05.
0820
09.0
5.10
2009
.05.
1220
09.0
5.14
2009
.05.
1620
09.0
5.18
2009
.05.
2020
09.0
5.22
2009
.05.
2420
09.0
5.26
2009
.05.
2820
09.0
5.30
2009
.06.
0120
09.0
6.03
2009
.06.
0520
09.0
6.07
2009
.06.
0920
09.0
6.11
2009
.06.
1320
09.0
6.15
2009
.06.
1720
09.0
6.19
2009
.06.
2120
09.0
6.23
2009
.06.
2520
09.0
6.27
2009
.06.
2920
09.0
7.01
2009
.07.
0320
09.0
7.05
2009
.07.
0720
09.0
7.09
2009
.07.
1120
09.0
7.13
2009
.07.
1520
09.0
7.17
2009
.07.
1920
09.0
7.21
2009
.07.
2320
09.0
7.25
2009
.07.
2720
09.0
7.29
2009
.07.
3120
09.0
8.02
2009
.08.
0420
09.0
8.06
2009
.08.
0820
09.0
8.10
Australian News
Australian News
MSM Patterns of Activity (Jan.-Aug. 2009)
BushfiresBudget
Artefact
Qld Election
Utegate Pt. 2?
0
10
20
30
40
50
60
70
80
2009
.01.
1220
09.0
1.14
2009
.01.
1620
09.0
1.18
2009
.01.
2020
09.0
1.22
2009
.01.
2420
09.0
1.26
2009
.01.
2820
09.0
1.30
2009
.02.
0120
09.0
2.03
2009
.02.
0520
09.0
2.07
2009
.02.
0920
09.0
2.11
2009
.02.
1320
09.0
2.15
2009
.02.
1720
09.0
2.19
2009
.02.
2120
09.0
2.23
2009
.02.
2520
09.0
2.27
2009
.03.
0120
09.0
3.03
2009
.03.
0520
09.0
3.07
2009
.03.
0920
09.0
3.11
2009
.03.
1320
09.0
3.15
2009
.03.
1720
09.0
3.19
2009
.03.
2120
09.0
3.23
2009
.03.
2520
09.0
3.27
2009
.03.
2920
09.0
3.31
2009
.04.
0220
09.0
4.04
2009
.04.
0620
09.0
4.08
2009
.04.
1020
09.0
4.12
2009
.04.
1420
09.0
4.16
2009
.04.
1820
09.0
4.20
2009
.04.
2220
09.0
4.24
2009
.04.
2620
09.0
4.28
2009
.04.
3020
09.0
5.02
2009
.05.
0420
09.0
5.06
2009
.05.
0820
09.0
5.10
2009
.05.
1220
09.0
5.14
2009
.05.
1620
09.0
5.18
2009
.05.
2020
09.0
5.22
2009
.05.
2420
09.0
5.26
2009
.05.
2820
09.0
5.30
2009
.06.
0120
09.0
6.03
2009
.06.
0520
09.0
6.07
2009
.06.
0920
09.0
6.11
2009
.06.
1320
09.0
6.15
2009
.06.
1720
09.0
6.19
2009
.06.
2120
09.0
6.23
2009
.06.
2520
09.0
6.27
2009
.06.
2920
09.0
7.01
2009
.07.
0320
09.0
7.05
2009
.07.
0720
09.0
7.09
2009
.07.
1120
09.0
7.13
2009
.07.
1520
09.0
7.17
2009
.07.
1920
09.0
7.21
2009
.07.
2320
09.0
7.25
2009
.07.
2720
09.0
7.29
2009
.07.
3120
09.0
8.02
2009
.08.
0420
09.0
8.06
2009
.08.
0820
09.0
8.10
Blog
Blog
Blog Patterns of Activity (Jan.-Aug. 2009)
Bushfires
Budget
Artefact
Qld ElectionObama
Utegate Pt. 1 Utegate Pt. 2
0
10
20
30
40
50
60
70
80
90
2009
.01.
12
2009
.01.
14
2009
.01.
16
2009
.01.
18
2009
.01.
20
2009
.01.
22
2009
.01.
24
2009
.01.
26
2009
.01.
28
2009
.01.
30
2009
.02.
02
2009
.02.
04
2009
.02.
06
2009
.02.
08
2009
.02.
10
2009
.02.
12
2009
.02.
14
2009
.02.
16
2009
.02.
18
2009
.02.
20
2009
.02.
22
2009
.02.
24
2009
.02.
26
2009
.02.
28
2009
.03.
02
2009
.03.
04
2009
.03.
06
2009
.03.
08
2009
.03.
10
2009
.03.
12
2009
.03.
14
2009
.03.
16
2009
.03.
18
2009
.03.
20
2009
.03.
22
2009
.03.
24
2009
.03.
26
2009
.03.
28
2009
.03.
30
2009
.04.
01
2009
.04.
03
2009
.04.
05
2009
.04.
07
2009
.04.
09
2009
.04.
11
2009
.04.
13
2009
.04.
15
2009
.04.
17
2009
.04.
19
2009
.04.
21
2009
.04.
23
2009
.04.
25
2009
.04.
27
2009
.04.
29
2009
.05.
01
2009
.05.
03
2009
.05.
05
2009
.05.
07
2009
.05.
09
2009
.05.
11
2009
.05.
13
2009
.05.
15
2009
.05.
17
2009
.05.
19
2009
.05.
21
2009
.05.
23
2009
.05.
25
2009
.05.
27
2009
.05.
29
2009
.05.
31
2009
.06.
02
2009
.06.
04
2009
.06.
06
2009
.06.
08
2009
.06.
10
2009
.06.
12
2009
.06.
14
2009
.06.
16
2009
.06.
18
2009
.06.
20
2009
.06.
22
2009
.06.
24
2009
.06.
26
2009
.06.
28
2009
.06.
30
2009
.07.
02
2009
.07.
04
2009
.07.
06
2009
.07.
08
2009
.07.
10
2009
.07.
12
2009
.07.
14
2009
.07.
16
2009
.07.
18
2009
.07.
20
2009
.07.
22
2009
.07.
24
2009
.07.
26
2009
.07.
28
2009
.07.
30
2009
.08.
01
2009
.08.
03
2009
.08.
05
2009
.08.
07
2009
.08.
09
2009
.08.
11
Opinion
Opinion
Opinion Patterns of Activity (Jan.-Aug. 2009)
Australia Day
Budget
Qld Election
Obama Utegate Pt. 2
Artefact
Utegate in the Australian Blogosphere
19-24 June 2009
19 June 2009: Opposition Senator Abetz reads from alleged email from PM advisor to Grech during Senate enquiry
19 June 2009: Turnbull accuses Rudd of corruption and lying to parliament
22 June 2009: Federal Police raid Grech’s house and find email
22 June 2009: Email found to be fake, created by Grech
Utegate in the Australian Blogosphere
4-5 August 2009
4 Aug. 2009: Grech admits forging email
4 Aug. 2009: Auditor-General’s report finds no wrongdoing by PM or Treasurer
Acknowledgements:
Data gathering and processing by Lars Kirchhoff and Thomas Nicolai (Sociomantic Labs, Berlin)Concept maps by Tim Highfield (QUT)
(Preliminary stage for ARC Discovery project, 2010-12)
Applications: last.fm vs. Billboard
• Tracking listening patterns:
– Billboard = sales charts
– last.fm = listening activity
– Comparing sales and use of new releases
– Identifying brief flashes andslow burners
– Distinguishing casual listenersand committed fan groups
– Providing market informationto the music industry
(Adjei & Holland-Cunz, 2008)
Application: Wikipedia Content Dynamics
• Tracking editing patterns:
– Identifying stable/unstable content in Wikipedia
– Highlighting controversy, vandalism, sneaky edits
– Tracking consensus development– Tracking responses to developing
stories (http://www.research.ibm.com/visual/projects/history_flow/capitalism1.htm)
– Establishing trustworthiness based (http://trust.cse.ucsc.edu/)
on extent of peer review
– Highlighting most hotly debated(edited) sections of text
_______ Science Emerges
• Web Science Research Initiative (Tim Berners-Lee et al.)– Science, technology, computer engineering, …– Limited inclusion of media, cultural, and communication studies– Strong focus on Semantic Web, artificial ontologies
• Cultural Science + Cultural Science Journal (John Hartley et al.)– Media & cultural studies, evolutionary economics, anthropology, …– Limited inclusion of computer sciences, technology– Strong focus on culture, innovation, evolutionary dynamics
• Data mining and visualisation– Substantial commercial work on data mining– Visualisation experiments in communication
design and visual arts
Looking Ahead
• Critical, interdisciplinary approaches
– Need to better connect cultural studies, computer science, research technology developments
– Need to interrogate in-built assumptions of existing technologies
– Need to explore and investigate visualisation and analysis methods
– Need to develop cross-platform approaches and connect with more conventional research
• Open questions
– Ethics of working with technically public, but notionally private data
– Potential (ab)use of data mining techniques and/or research results by corporate and government interests
– What new knowledge can such research contribute?