+ All Categories
Home > Technology > Measuring Wiki viability

Measuring Wiki viability

Date post: 08-May-2015
Category:
Upload: dario-taraborelli
View: 680 times
Download: 0 times
Share this document with a friend
Description:
Paper presented at WikiSym 2008, showing what factors are likely to boost or hinder the growth of a wiki-based community. Full paper available at http://nitens.org/docs/wikidyn.pdf and in the forthcoming WikiSym 2008 proceedings.
17
Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions Measuring Wiki Viability An empirical assessment of the social dynamics of a large sample of wikis Camille Roth , Dario Taraborelli*, Nigel Gilbert* Centre National pour la Recherche Scientifique, Paris * Centre for Research in Social Simulation, University of Surrey WikiSym 2008 Porto, September 10, 2008 Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert
Transcript
Page 1: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Measuring Wiki ViabilityAn empirical assessment of the social dynamics

of a large sample of wikis

Camille Roth◦, Dario Taraborelli*, Nigel Gilbert*

◦ Centre National pour la Recherche Scientifique, Paris* Centre for Research in Social Simulation, University of Surrey

WikiSym 2008Porto, September 10, 2008

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 2: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

What factors affect the growth and viability of a wiki?

I Widespread interest in policies and recommendations foreffective wiki governance

I Best practices: how far can they be generalized?

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 3: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

What factors affect the growth and viability of a wiki?

Empirical evidence

I Wikipedia-centered literature on wiki dynamics

I Scarcity of data to study wiki evolution over time

I Lack of tools to measure wiki evolution for different wikiengines

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 4: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Overview

Goal: Identify factors that affect the dynamics of a large sampleof Mediawiki-based communities.

1. Wiki dynamics: methodology

2. Results: growth enhancers and regulators

3. Research directions and conclusions

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 5: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Method: dataset

I Data source

Growth data tracked over an 8-month span from 11, 500+MediaWikis

I Selected sample

360 wikis, with an initial population between 400 and20, 000 users, restricted to hosters with reliable data, andwith no major discontinuity in daily change rates.

I VariablesI population size (U)I content size (P)I admin population (A)I edits (E)I access control (R)

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 6: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Method: indicators considered

descriptive indicators

I user activity, i.e. the proportion of edits per user (E/U)I user density, i.e. the proportion of users per page (U/P)I edit density, i.e. the proportion of edits per page (E/P).

governance factors

I administrator ratio, i.e. the proportion of users with adminstatus (A/U)

I administrator density, i.e. the proportion of admins perpage (A/P)

I editing permission (R).

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 7: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Method: analysis

1. Sort wikis according to each of these indicators (e.g. editdensity)

edit density

2. Aggregate wikis in similar “chunks” (quantiles) of identicalsize (e.g. wikis with low edit density vs. wikis with high editdensity)

Q1 Q2 Q3 Q4 Q5

edit density

3. Measure how each quantile performs with respect to pageand population growth

Q1 Q2 Q3 Q4 Q5

edit density

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 8: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Growth enhancers (1): user activity

of their population or content.

3. WIKI DYNAMICSWe assessed wiki dynamics by comparing their diverse paths withrespect to a set of independent variables. ‘Growth’ is defined interms of population and content size change: user growth GU (resp.page growth GP ) is the ratio between final and initial populations(resp. content sizes): GU = Ulast/Ufirst (resp. GP = Plast/Pfirst).Wiki dynamics were studied as a function of the variables listed inSection 2.2:

(I) DESCRIPTIVE INDICATORS, i.e. variables on which wiki ad-ministrators have no direct control: (a) user activity, i.e. theproportion of edits per user (E/U ), (b) user density, i.e. theproportion of users per page (U/P ), and (c) edit density, i.e.the proportion of edits per page (E/P ).

(II) GOVERNANCE FACTORS, variables that wiki administratorscan directly control: (a) administrator ratio, i.e. the propor-tion of users who are granted administrator status (A/U ),(b) administrator density, i.e. the proportion of administra-tors per page (A/P ), (c) editing permission (R).

For each continuous variable, instead of carrying out a delicateanalysis by dealing with clouds of points, we adopted a more in-sightful approach by dividing wikis into five quantiles, each includ-ing exactly 20% of all wikis in the clean dataset (see Table 1). We

1 2 3 4 5

Quantile of

edits per user1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

edits per user

0.95

1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1.1

1.11.1

1.2

1.2

1.3

1.3

1.4

1.4

1 2 3 4 51

2

3

4

Quantiles of 'edits per user'

Populationquantiles

User growth

1

1

1

1.1

1.1

1.1

1.1

1.11.2

1.2

1 2 3 4 51

2

3

4

Quantiles of 'edits per user'

Populationquantiles

Page growth

Figure 1: Growth landscape with respect to the proportion ofedits per user.

1 2 3 4 5

Quantile of

users per good page1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

users per good page1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1

1.1

1.1

1.2

1.2

1.3

1.3

1.41.4 1.5

1 2 3 4 51

2

3

4

Quantiles of 'users per good page'

Populationquantiles

User growth

1

1

1

1.1

1.1

1.1

1.1

1.2

1 2 3 4 51

2

3

4

Quantiles of 'users per good page'

Populationquantiles

Page growth

Figure 2: Growth landscape with respect to the proportion ofusers per good page.

then computed and compared growth ratio means over all wikis foreach quantile. Additionally, we distinguished population quantilesin order to control for user size-related effects. To this end, weplotted a growth landscape that consists of a two-dimensional rep-resentation of the various growth ratios. This representation wasapplied to all the above-mentioned variables, except for R wherethere are only two “quantiles” (0 or 1). For each variable except R,the upper graphs indicate the mean values and confidence intervals(p < 0.05) of each quantile on the variable considered, while thelower graphs show contour plots for the same variable with brighterareas corresponding to higher growth ratios.

3.1 Significant descriptive indicatorsWe found significant correlations between a number of descriptiveindicators of wiki structure and their content and population growthrates.

Figure 1 shows the effect of user activity (measured as the pro-portion of edits per user) on growth rates. The results suggest thatuser activity correlates very strongly with wiki growth, not only interms of content production (which is to a certain extent unsurpris-ing) but also new member recruitment. The effect becomes strongerwith initially more populated wikis: the more users are activelyediting, the more a wiki grows in content and population.

Figure 2 shows the impact of user density on growth. The re-sults suggest that a higher number of contributors per page does notnecessarily indicate mushrooming wikis: for an identical contentsize, we found a significant correlation between a lower number ofusers and higher growth ratios, both in content and new members.

To better visualize the effect of user density on growth, we rep-resented the dependent variables GU and GP , independent vari-

1

2

3

4

5

6

7

8

U!P

400 1000 10000 20000

10

100

1000

10000

100000

Population !users"

Content

!goodpages"

Figure 3: Phase diagram in the content/population space ofwikis belonging to the clean dataset.

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 9: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Growth regulators (1): user density

of their population or content.

3. WIKI DYNAMICSWe assessed wiki dynamics by comparing their diverse paths withrespect to a set of independent variables. ‘Growth’ is defined interms of population and content size change: user growth GU (resp.page growth GP ) is the ratio between final and initial populations(resp. content sizes): GU = Ulast/Ufirst (resp. GP = Plast/Pfirst).Wiki dynamics were studied as a function of the variables listed inSection 2.2:

(I) DESCRIPTIVE INDICATORS, i.e. variables on which wiki ad-ministrators have no direct control: (a) user activity, i.e. theproportion of edits per user (E/U ), (b) user density, i.e. theproportion of users per page (U/P ), and (c) edit density, i.e.the proportion of edits per page (E/P ).

(II) GOVERNANCE FACTORS, variables that wiki administratorscan directly control: (a) administrator ratio, i.e. the propor-tion of users who are granted administrator status (A/U ),(b) administrator density, i.e. the proportion of administra-tors per page (A/P ), (c) editing permission (R).

For each continuous variable, instead of carrying out a delicateanalysis by dealing with clouds of points, we adopted a more in-sightful approach by dividing wikis into five quantiles, each includ-ing exactly 20% of all wikis in the clean dataset (see Table 1). We

1 2 3 4 5

Quantile of

edits per user1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

edits per user

0.95

1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1.1

1.11.1

1.2

1.2

1.3

1.3

1.4

1.4

1 2 3 4 51

2

3

4

Quantiles of 'edits per user'

Populationquantiles

User growth

1

1

1

1.1

1.1

1.1

1.1

1.11.2

1.2

1 2 3 4 51

2

3

4

Quantiles of 'edits per user'

Populationquantiles

Page growth

Figure 1: Growth landscape with respect to the proportion ofedits per user.

1 2 3 4 5

Quantile of

users per good page1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

users per good page1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1

1.1

1.1

1.2

1.2

1.3

1.3

1.41.4 1.5

1 2 3 4 51

2

3

4

Quantiles of 'users per good page'

Populationquantiles

User growth

1

1

1

1.1

1.1

1.1

1.1

1.2

1 2 3 4 51

2

3

4

Quantiles of 'users per good page'

Populationquantiles

Page growth

Figure 2: Growth landscape with respect to the proportion ofusers per good page.

then computed and compared growth ratio means over all wikis foreach quantile. Additionally, we distinguished population quantilesin order to control for user size-related effects. To this end, weplotted a growth landscape that consists of a two-dimensional rep-resentation of the various growth ratios. This representation wasapplied to all the above-mentioned variables, except for R wherethere are only two “quantiles” (0 or 1). For each variable except R,the upper graphs indicate the mean values and confidence intervals(p < 0.05) of each quantile on the variable considered, while thelower graphs show contour plots for the same variable with brighterareas corresponding to higher growth ratios.

3.1 Significant descriptive indicatorsWe found significant correlations between a number of descriptiveindicators of wiki structure and their content and population growthrates.

Figure 1 shows the effect of user activity (measured as the pro-portion of edits per user) on growth rates. The results suggest thatuser activity correlates very strongly with wiki growth, not only interms of content production (which is to a certain extent unsurpris-ing) but also new member recruitment. The effect becomes strongerwith initially more populated wikis: the more users are activelyediting, the more a wiki grows in content and population.

Figure 2 shows the impact of user density on growth. The re-sults suggest that a higher number of contributors per page does notnecessarily indicate mushrooming wikis: for an identical contentsize, we found a significant correlation between a lower number ofusers and higher growth ratios, both in content and new members.

To better visualize the effect of user density on growth, we rep-resented the dependent variables GU and GP , independent vari-

1

2

3

4

5

6

7

8

U!P

400 1000 10000 20000

10

100

1000

10000

100000

Population !users"

Content

!goodpages"

Figure 3: Phase diagram in the content/population space ofwikis belonging to the clean dataset.Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 10: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Wiki landscape (user density)

1

2

3

4

5

6

7

8U!P

400 1000 10000 20000

10

100

1000

10000

100000

Population !users"

Content

!goodpages"

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 11: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Growth enhancers (2): editing permission

able U/P , and initial wiki positions Ufirst and Pfirst altogether onthe same graph, yielding a phase diagram as plotted on Fig. 3.6 Inthis diagram, each dot (light color) corresponds to a wiki in thedatabase. Each arrow corresponds to a pair of quantiles “users perpage, population”. Widths and heights are proportional to user andcontent growth ratios, respectively. The size of the arrow representsthe strength of the observed growth in content and population forwikis in a given region of the wikisphere.

This graph should be regarded as a map of a portion of the wiki-sphere, showing the expected destiny of a wiki in terms of contentand population growth as a function of its initial position in thesame space. This diagram broadly suggests that a wiki’s positionis correlated with its subsequent fate. More precisely, it illustratesthat wikis in the upper/upper-right portion of the diagram are grow-ing faster, and more interestingly it provides an overview of demo-graphic dynamics in this region of the wikisphere.

3.2 Significant governance factorsTurning to governance features, we first analyzed the effects of theadministrator density on wiki dynamics by looking at the overallproportion of administrators per page.

Figure 4 shows that having a relatively high number of adminis-trators for a given content size is likely to reduce growth. There is astrong effect of the proportion of admins per page both on user andpage growth. For instance, while the last quantile of admins/pageratio enjoys near-zero growth rates over 8 months, the first quantiletops overall rates (!+50% for users, !+25% for pages). This ef-fect may be interpreted as the impact of strong governance activityon the proliferation of content and users.

6For this diagram, an increased level of detail called for a larger grid, here of 8 ! 7quantiles; U/P quantile means are represented by diagonal straight lines labelled “1–8”.

1 2 3 4 5

Quantile of

admins per good page1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

admins per good page0.95

1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1.1

1.1

1.2

1.3

1.3

1.4

1 2 3 4 51

2

3

4

Quantiles of 'admins per good page'

Populationquantiles

User growth

1

1

1

1

1.1

1.1

1.1

1.1

1.2

1 2 3 4 51

2

3

4

Quantiles of 'admins per good page'

Populationquantiles

Page growth

Figure 4: Growth landscape with respect to the proportion ofadmins per good page.

1 2 3 41.0

1.1

1.2

1.3

1.4

1.5

Population quantiles

Growthrate

User growth

1 2 3 41.00

1.05

1.10

1.15

1.20

1.25

1.30

1.35

1.40

Population quantiles

Growthrate

Page growth

Figure 5: Growth landscape with respect to editing permission:red dashed refers to anonymously editable wikis, while bluesolid to wikis editable by registered users only.

1 2 3 4 5

Quantile of

edits per good page1.0

1.1

1.2

1.3

1.4

1.5

Growth ratio

User growth

1 2 3 4 5

Quantile of

edits per good page1.00

1.05

1.10

1.15

1.20

1.25

Growth ratio

Page growth

1.2

1.2

1.2

1.3

1.3 1.4

1 2 3 4 51

2

3

4

Quantiles of 'edits per good page'

Populationquantiles

User growth

1.1

1.1

1.1

1.1

1.1

1.1

1 2 3 4 51

2

3

4

Quantiles of 'edits per good page'

Populationquantiles

Page growth

Figure 6: Growth landscape with respect to the proportion ofedits per good page.

1 2 3 4 5

Quantile of

admins per user1.0

1.1

1.2

1.3

1.4

1.5

Growth ratio

User growth

1 2 3 4 5

Quantile of

admins per user1.00

1.05

1.10

1.15

1.20

1.25

Growth ratio

Page growth

1.2

1.2

1.3

1.3 1.4

1 2 3 4 51

2

3

4

Quantiles of 'admins per user'

Populationquantiles

User growth

1 11

1.1

1.1

1.1

1.1

1.1

1.1

1.1

1.1

1.2

1.2

1.2

1 2 3 4 51

2

3

4

Quantiles of 'admins per user'

Populationquantiles

Page growth

Figure 7: Growth landscape with respect to the proportion ofadmins per user.

We identified another significant effect when we considered edit-ing permission. As a binary variable, the editing permission vari-able generates only two groups of wikis (wikis that allow anony-mous editing versus wikis that restrict editing to registered usersonly). The growth landscape is consequently limited to a one-dimensional comparison over population quantiles. The results inFigure 5 show that for both dimensions—population and content—having no access control is likely to favor growth. While a strongerpage growth is quite unsurprising in wikis where no registrationis required, the fact that this factor also fuels user registration ismore puzzling. One might expect that if users can participate with-out the need of registration, few would be inclined to register. Ourresults suggest on the contrary that wikis with unrestricted registra-tion trigger participation more easily than wikis that restrict access.

3.3 Neutral indicatorsFinally, we consider two indicators that showed a markedly mildercorrelation with wiki dynamics.

On the one hand, we found that edit density (i.e. edits/page) cor-relates in a moderately negative way with user growth—with a rel-atively stronger effect depending on initial population size—whilethere is surprisingly no significant correlation with page growth(Figure 6).

On the other hand, higher administrator ratios (i.e. admins/user)have no significant effect on content or population growth, as evi-denced by the contour plot on Figure 7.

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 12: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Growth regulators (2): admin density

able U/P , and initial wiki positions Ufirst and Pfirst altogether onthe same graph, yielding a phase diagram as plotted on Fig. 3.6 Inthis diagram, each dot (light color) corresponds to a wiki in thedatabase. Each arrow corresponds to a pair of quantiles “users perpage, population”. Widths and heights are proportional to user andcontent growth ratios, respectively. The size of the arrow representsthe strength of the observed growth in content and population forwikis in a given region of the wikisphere.

This graph should be regarded as a map of a portion of the wiki-sphere, showing the expected destiny of a wiki in terms of contentand population growth as a function of its initial position in thesame space. This diagram broadly suggests that a wiki’s positionis correlated with its subsequent fate. More precisely, it illustratesthat wikis in the upper/upper-right portion of the diagram are grow-ing faster, and more interestingly it provides an overview of demo-graphic dynamics in this region of the wikisphere.

3.2 Significant governance factorsTurning to governance features, we first analyzed the effects of theadministrator density on wiki dynamics by looking at the overallproportion of administrators per page.

Figure 4 shows that having a relatively high number of adminis-trators for a given content size is likely to reduce growth. There is astrong effect of the proportion of admins per page both on user andpage growth. For instance, while the last quantile of admins/pageratio enjoys near-zero growth rates over 8 months, the first quantiletops overall rates (!+50% for users, !+25% for pages). This ef-fect may be interpreted as the impact of strong governance activityon the proliferation of content and users.

6For this diagram, an increased level of detail called for a larger grid, here of 8 ! 7quantiles; U/P quantile means are represented by diagonal straight lines labelled “1–8”.

1 2 3 4 5

Quantile of

admins per good page1.0

1.1

1.2

1.3

1.4

1.5

1.6

Growth ratio

User growth

1 2 3 4 5

Quantile of

admins per good page0.95

1.00

1.05

1.10

1.15

1.20

1.25

1.30

Growth ratio

Page growth

1.1

1.1

1.2

1.3

1.3

1.4

1 2 3 4 51

2

3

4

Quantiles of 'admins per good page'

Populationquantiles

User growth

1

1

1

1

1.1

1.1

1.1

1.1

1.2

1 2 3 4 51

2

3

4

Quantiles of 'admins per good page'

Populationquantiles

Page growth

Figure 4: Growth landscape with respect to the proportion ofadmins per good page.

1 2 3 41.0

1.1

1.2

1.3

1.4

1.5

Population quantiles

Growthrate

User growth

1 2 3 41.00

1.05

1.10

1.15

1.20

1.25

1.30

1.35

1.40

Population quantiles

Growthrate

Page growth

Figure 5: Growth landscape with respect to editing permission:red dashed refers to anonymously editable wikis, while bluesolid to wikis editable by registered users only.

1 2 3 4 5

Quantile of

edits per good page1.0

1.1

1.2

1.3

1.4

1.5

Growth ratio

User growth

1 2 3 4 5

Quantile of

edits per good page1.00

1.05

1.10

1.15

1.20

1.25

Growth ratio

Page growth

1.2

1.2

1.2

1.3

1.3 1.4

1 2 3 4 51

2

3

4

Quantiles of 'edits per good page'

Populationquantiles

User growth

1.1

1.1

1.1

1.1

1.1

1.1

1 2 3 4 51

2

3

4

Quantiles of 'edits per good page'

Populationquantiles

Page growth

Figure 6: Growth landscape with respect to the proportion ofedits per good page.

1 2 3 4 5

Quantile of

admins per user1.0

1.1

1.2

1.3

1.4

1.5

Growth ratio

User growth

1 2 3 4 5

Quantile of

admins per user1.00

1.05

1.10

1.15

1.20

1.25

Growth ratio

Page growth

1.2

1.2

1.3

1.3 1.4

1 2 3 4 51

2

3

4

Quantiles of 'admins per user'

Populationquantiles

User growth

1 11

1.1

1.1

1.1

1.1

1.1

1.1

1.1

1.1

1.2

1.2

1.2

1 2 3 4 51

2

3

4

Quantiles of 'admins per user'

Populationquantiles

Page growth

Figure 7: Growth landscape with respect to the proportion ofadmins per user.

We identified another significant effect when we considered edit-ing permission. As a binary variable, the editing permission vari-able generates only two groups of wikis (wikis that allow anony-mous editing versus wikis that restrict editing to registered usersonly). The growth landscape is consequently limited to a one-dimensional comparison over population quantiles. The results inFigure 5 show that for both dimensions—population and content—having no access control is likely to favor growth. While a strongerpage growth is quite unsurprising in wikis where no registrationis required, the fact that this factor also fuels user registration ismore puzzling. One might expect that if users can participate with-out the need of registration, few would be inclined to register. Ourresults suggest on the contrary that wikis with unrestricted registra-tion trigger participation more easily than wikis that restrict access.

3.3 Neutral indicatorsFinally, we consider two indicators that showed a markedly mildercorrelation with wiki dynamics.

On the one hand, we found that edit density (i.e. edits/page) cor-relates in a moderately negative way with user growth—with a rel-atively stronger effect depending on initial population size—whilethere is surprisingly no significant correlation with page growth(Figure 6).

On the other hand, higher administrator ratios (i.e. admins/user)have no significant effect on content or population growth, as evi-denced by the contour plot on Figure 7.

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 13: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Summary of results

Editsper user

Editionpermission

Adminsper user

Edits pergood page

Users pergood page

Admins pergood page

0.9

1.0

1.1

1.2

Page growth

Editsper user

Editionpermission

Adminsper user

Edits pergood page

Users pergood page

Admins pergood page

0.7

0.8

0.9

1.0

1.1

1.2

1.3

User growth

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 14: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Summary of results

Variable Growth ratePopulation Content

STRUCTURAL

INDICATORS

User activity (E/U ) + + + +Edit density (E/P ) - —User density (U/P ) - - - -

GOVERNANCE

FACTORS

Editing permission (R) + + + +Admin ratio (A/U ) — —Admin density (A/P ) - - - -

Table 2: Effect of different factors on wiki growth rates.

3.4 Summary of findingsThe results of this study suggest that different structural and gover-nance-related factors have significant effects on the content andpopulation dynamics of a wiki. Table 2 and Figure 8 summarizethe correlations found between growth ratios and each of the vari-ables we considered, by comparing the gain in the population andcontent sizes between the last and the first quantile for each vari-able (variables in Figure 8 are ranked from the most positively tothe most negatively correlated).

If we focus on structural aspects of wikis, we note that the higherthe ratio of edits per user the faster the wiki grows, both in terms ofcontent and population. Wikis with very active user communitiesare not only likely to grow in content, but also to attract a largenumber of new contributors. This result contrasts with the oppositeeffect produced by high user density per page.

Edits

per user

Edition

permission

Admins

per user

Edits per

good page

Users per

good page

Admins per

good page

0.7

0.8

0.9

1.0

1.1

1.2

1.3

User growth

Edits

per user

Edition

permission

Admins

per user

Edits per

good page

Users per

good page

Admins per

good page

0.9

1.0

1.1

1.2

Page growth

Figure 8: Comparison of growth ratios between the last andfirst quantiles, for each variable considered.

As far as governance factors are concerned, we observed thesingular fact that population growth is in average more than 20%faster for anonymously editable wikis. This seems to support theintuition that less barriers favor population growth. Furthermorewe observed that, while too many administrators per page may hin-der the growth of a wiki (in terms of content size), the proportionof administrators per user does not appear to show a significant in-fluence on growth. In all the above cases, we observed a strikingcorrelation between content and population growth.

4. CONCLUDING REMARKSThe main outcome of this study is an account of the factors thatwiki communities should take into account in order to control theirdemographics. In this respect, we showed the remarkable dynam-ical intertwinement of population and content growth, which sug-gests that models of wiki dynamics will probably need to focus onthe strong interrelations between these two variables.

Representing via phase diagrams the impact of specific variableson wiki dynamics can be a valuable solution for wiki administra-tors for monitoring purposes and for social scientists as a first steptowards modeling. However, in order to develop accurate modelsof wiki dynamics, further empirical evidence is needed. To makethe data tractable for this study, we restricted the dataset in sev-eral ways. A more comprehensive study, beyond the scope of thepresent paper, should endeavor to investigate a larger spectrum ofwiki-based communities.

5. REFERENCES[1] U. Brandes and J. Lerner. Visual analysis of controversy in

user-generated encyclopedias. In Visual Analytics Scienceand Technology, 2007. VAST 2007. IEEE Symposium on,pages 179–186, 2007.

[2] S. L. Bryant, A. Forte, and A. Bruckman. Becomingwikipedian: Transformation of participation in acollaborative online encyclopedia. In Group’05, SanibelIsland, FL, USA, Nov 6-9 2005.

[3] A. Capocci, V. Servedio, F. Colaiori, L. Buriol, D. Donato,S. Leonardi, and G. Caldarelli. Preferential attachment in thegrowth of social networks: the internet encyclopediawikipedia. PRE, 74(3):036116, 2006.

[4] J. Giles. Internet encyclopaedias go head to head. Nature,438(7070):900–901, 2005.

[5] M. W. Godfrey and Q. Tu. Evolution in open sourcesoftware: A case study. ICSM, 00:131, 2000.

[6] H.-J. Happel and M. Treitz. Proliferation in enterprise wikis.In Proceedings of the 8th International Conference on theDesign of Cooperative Systems (COOP 08), Carry-le-Rouet,France, May 2008.

[7] A. Kittur, E. Chi, B. A. Pendleton, B. Suh, andT. Mytkowicz. Power of the few vs. wisdom of the crowd:Wikipedia and the rise of the bourgeoisie. In ALT.CHI, 2007.

[8] C. Roth. Viable wikis: struggle for life in the wikisphere. InWikiSym ’07: Proceedings of the 2007 internationalsymposium on Wikis, pages 119–124, New York, NY, USA,2007. ACM.

[9] B. Suh, E. H. Chi, A. Kittur, and B. A. Pendleton. Lifting theveil: improving accountability and social transparency inwikipedia with wikidashboard. In CHI ’08: Proceeding ofthe twenty-sixth annual SIGCHI conference on Humanfactors in computing systems, pages 1037–1040, New York,NY, USA, 2008. ACM.

[10] F. B. Viégas, M. Wattenberg, and K. Dave. Studyingcooperation and conflict between authors with history flowvisualizations. In CHI ’04: Proceedings of the 2004conference on Human factors in computing systems, pages575–582. ACM Press, 2004.

[11] D. Wilkinson and B. Huberman. Assessing the value ofcooperation in Wikipedia. First Monday, 12(4), 2007.

[12] V. Zlatic, M. Bozicevic, H. Stefancic, and M. Domazet.Wikipedias: Collaborative web-based encyclopedias ascomplex networks. PRE, 74(1):016115, 2006.

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 15: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Wiki viability: Beyond raw growth indicators

I Are population- and content- growth rates good indicatorsof the viability of a wiki-based community?

I Need of more fine-grained indicators:

I active users vs. lurkers

I metacontent production (talk pages, comments)

I wiki structure (categories, namespaces)

I social network indicators

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 16: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Wiki viability: Beyond MediaWiki

I Beyond the Wikipedia-centric paradigm andplatform-dependent data.

I Methodological issue: how to extract reliable empirical datafrom a variety of wiki platforms?

http://wikitracer.com/docs/Help us build a standard framework to track the evolution ofwiki-based online communities!

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert

Page 17: Measuring Wiki viability

Viable wikis Wiki dynamics Growth enhancers and regulators Research directions and conclusions

Acknowledgments

FundingI PATRES: Pattern Resilience (NEST-043268)

Special thanks to:I s23.org for giving access to their largest mediawiki

databaseI WikiSym reviewers for valuable feedback

Measuring Wiki Viability C. Roth, D. Taraborelli, N. Gilbert


Recommended