ABSTRACT
Wikipedia has millions of articles, many of which receive little attention. One group of Wikipedians believes these obscure entries should be removed because they are uninteresting and neglected; these are the deletionists. Other Wikipedians disagree, arguing that this long tail of articles is precisely Wikipedia's advantage over other encyclopedias; these are the inclusionists. This paper looks at two overarching questions on the debate between deletionists and inclusionists: (1) What are the implications to the long tail of the evolving standards for article birth and death? (2) How is viewership affected by the decreasing notability of articles in the long tail? The answers to five detailed research questions that are inspired by these overarching questions should help better frame this debate and provide insight into how Wikipedia is evolving.
- N. Agarwal, H. Liu, L. Tang, and P. S. Yu. Identifying the influential bloggers in a community. In Proc. WSDM 2008, pages 207--218, Palo Alto, CA, USA, 2008. ACM. Google ScholarDigital Library
- C. Anderson. The Long Tail: Why the Future of Business is Selling Less of More. Hyperion, July 2006. Google ScholarDigital Library
- I. Beschastnikh, T. Kriplean, and D. W. McDonald. Wikipedian self-governance in action: Motivating the policy lens. In Proc. ICWSM 2008, Seattle,WA, USA, 2008. AAAI.Google Scholar
- S. L. Bryant, A. Forte, and A. Bruckman. Becoming Wikipedian: Transformation of participation in a collaborative online encyclopedia. In Proc. GROUP 2005, pages 1--10, Sanibel Island, FL, USA, 2005. ACM. Google ScholarDigital Library
- A. Clauset, C. R. Shalizi, and M. E. J. Newman. Power-law distributions in empirical data. 0706.1062, June 2007.Google Scholar
- A. Forte and A. Bruckman. Scaling consensus: Increasing decentralization in Wikipedia governance. In Proc. HICSS 2008, page 157, 2008. Google ScholarDigital Library
- J. Giles. Internet encyclopaedias go head to head. Nature, 438(7070):900--901, Dec. 2005.Google ScholarCross Ref
- S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. J. Inf. Sci., 32(2):198--208, 2006. Google ScholarDigital Library
- M. L. Goldstein, S. A. Morris, and G. G. Yen. Problems with fitting to the power-law distribution. The European Physical Journal B - Condensed Matter and Complex Systems, 41(2):255--258, 2004.Google Scholar
- A. Kittur, E. H. Chi, B. A. Pendleton, B. Suh, and T. Mytkowicz. Power of the few vs. wisdom of the crowd: Wikipedia and the rise of the bourgeoisie. In Proc. CHI 2007, Montreal, Quebec, Canada, 2007. ACM.Google Scholar
- A. Kittur, B. Suh, B. A. Pendleton, and E. H. Chi. He says, she says: Conflict and coordination in Wikipedia. In Proc. CHI 2007, pages 453--462, San Jose, CA, USA, 2007. ACM. Google ScholarDigital Library
- T. Kriplean, I. Beschastnikh, D. W. McDonald, and S. A. Golder. Community, consensus, coercion, control: CS*W or how policy mediates mass participation. In Proc. GROUP 2007, pages 167--176, Sanibel Island, FL, USA, 2007. ACM. Google ScholarDigital Library
- D. Milne, O. Medelyan, and I. H. Witten. Mining domain-specific thesauri from Wikipedia: A case study. In Proc. WI 2006, pages 442--448. IEEE CS, 2006. Google ScholarDigital Library
- R. Priedhorsky, J. Chen, S. T. K. Lam, K. Panciera, L. Terveen, and J. Riedl. Creating, destroying, and restoring value in wikipedia. In Proc. GROUP 2007, pages 259--268, Sanibel Island, FL, USA, 2007. ACM. Google ScholarDigital Library
- THEwikiStics. Yearly wikimedia page hits comparison. http://wikistics.falsikon.de/2008/, 2008.Google Scholar
- F. B. Viegas, M. Wattenberg, and K. Dave. Studying cooperation and conflict between authors with history flow visualizations. In Proc. CHI 2004, pages 575--582, Vienna, Austria, 2004. ACM. Google ScholarDigital Library
- B.-Q. Vuong, E.-P. Lim, A. Sun, M.-T. Le, and H. W. Lauw. On ranking controversies in Wikipedia: Models and evaluation. In Proc. WSDM 2008, pages 171--182, Palo Alto, CA, USA, 2008. ACM. Google ScholarDigital Library
- D. M. Wilkinson and B. A. Huberman. Cooperation and quality in Wikipedia. In Proc. WikiSym 2007, pages 157--164, Montreal, Quebec, Canada, 2007. ACM. Google ScholarDigital Library
- F. Wu, R. Hoffmann, and D. S. Weld. Information extraction from Wikipedia: Moving down the long tail. In Proc. KDD 2008, pages 731--739, Las Vegas, NV, USA, 2008. ACM. Google ScholarDigital Library
Index Terms
- Is Wikipedia growing a longer tail?
Recommendations
Wikipedia's “Neutral Point of View”: Settling Conflict through Ambiguity
This article discusses how one of the most important Wikipedia policies, the “neutral point of view” (NPOV), is appropriated and interpreted by the participants in the Wikipedia project. By analyzing a set of constitutive documents for the Wikipedian ...
Learning multilingual named entity recognition from Wikipedia
We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and CommunicationIn natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
Comments