Abstract
Our reliance on networked, collectively built information is a vulnerability when the quality or reliability of this information is poor. Wikipedia, one such collectively built information source, is often our first stop for information on all kinds of topics; its quality has stood up to many tests, and it prides itself on having a “neutral point of view.” Enforcement of neutrality is in the hands of comparatively few, powerful administrators. In this article, we document that a surprisingly large number of editors change their behavior and begin focusing more on a particular controversial topic once they are promoted to administrator status. The conscious and unconscious biases of these few, but powerful, administrators may be shaping the information on many of the most sensitive topics on Wikipedia; some may even be explicitly infiltrating the ranks of administrators in order to promote their own points of view. In addition, we ask whether administrators who change their behavior in this suspicious manner can be identified in advance. Neither prior history nor vote counts during an administrator’s election are useful in doing so, but we find that an alternative measure, which gives more weight to influential voters, can successfully reject these suspicious candidates. This second result has important implications for how we harness collective intelligence: even if wisdom exists in a collective opinion (like a vote), that signal can be lost unless we carefully distinguish the true expert voter from the noisy or manipulative voter.
- Sinan Aral, Lev Muchnik, and Arun Sundararajan. 2009. Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks. Proceedings of the National Academy of Sciences 106, 51 (2009), 21544--21549. Google ScholarCross Ref
- David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3 (March 2003), 993--1022.Google ScholarDigital Library
- Moira Burke and Robert Kraut. 2008. Mopping up: Modeling Wikipedia promotion decisions. In Proceedings of the ACM Conference on Computer Supported Cooperative Work. 27--36. Google ScholarDigital Library
- 2008. Candid CAMERA. Harper’s Magazine. July 2008. 23--24.Google Scholar
- Sanmay Das, Allen Lavoie, and Malik Magdon-Ismail. 2013. Manipulation among the arbiters of collective intelligence: How Wikipedia administrators mold public opinion. In Proceedings of the 22nd ACM Conference of Information and Knowledge Management (CIKM’13). 1097--1106. Google ScholarDigital Library
- Sanmay Das and Malik Magdon-Ismail. 2010. Collective wisdom: Information growth in wikis and blogs. In Proceedings of the ACM Conference on Electronic Commerce. 231--240. Google ScholarDigital Library
- Matthew Gentzkow and Jesse M. Shapiro. 2010. What drives media slant? Evidence from US daily newspapers. Econometrica 78, 1 (2010), 35--71. Google Scholar
- Arpita Ghosh, Satyen Kale, and Preston McAfee. 2011. Who moderates the moderators? Crowdsourcing abuse detection in user-generated content. In Proceedings of the ACM Conference on Electronic Commerce. ACM, New York, NY, 167--176. DOI:http://dx.doi.org/10.1145/1993574.1993599 Google ScholarDigital Library
- Jim Giles. 2005. Internet encyclopaedias go head to head. Nature 438, 7070 (Dec. 2005), 900--901. Google ScholarCross Ref
- Thomas L. Griffiths and Mark Steyvers. 2004. Finding scientific topics. Proceedings of the National Academy of Sciences 101 (Apr 2004), 5228--5235. DOI:http://dx.doi.org/10.1073/pnas.0307752101 Google ScholarCross Ref
- R. Hanson. 2002. Decision markets. Entrepreneurial Economics: Bright Ideas from the Dismal Science (2002), 79.Google Scholar
- M. Hindman, K. Tsioutsiouliklis, and J. A. Johnson. 2003. Googlearchy: How a few heavily-linked sites dominate politics on the web. In Proceedings of the Annual Meeting of the Midwest Political Science Association, Vol. 4. 1--33.Google Scholar
- Gabriela Kalna and Desmond J. Higham. 2007. A clustering coefficient for weighted networks, with application to gene expression data. AI Communications 20, 4 (Dec. 2007), 263--271.Google Scholar
- Aniket Kittur, Bongwon Suh, Bryan A. Pendleton, and Ed H. Chi. 2007. He says, she says: Conflict and coordination in Wikipedia. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Google ScholarDigital Library
- D. Lazer, A. S. Pentland, L. Adamic, S. Aral, A. L. Barabasi, D. Brewer, N. Christakis, N. Contractor, J. Fowler, M. Gutmann, T. Jebara, G. King, M. Macy, D. Roy, and M. Van Alstyne. 2009. Life in the network: The coming age of computational social science. Science 323, 5915 (2009), 721.Google Scholar
- Chenliang Li, Anwitaman Datta, and Aixin Sun. 2012. Mining latent relations in peer-production environments: A case study with Wikipedia article similarity and controversy. Social Network Analysis and Mining 2, 3 (2012), 265--278. Google ScholarCross Ref
- Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang, and Maosong Sun. 2011. PLDA+: Parallel latent Dirichlet allocation with data placement and pipeline processing. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 3 (2011), 26.Google ScholarDigital Library
- Rui Lopes and Luis Carriço. 2008. On the credibility of wikipedia: An accessibility perspective. In Proceedings of the 2nd ACM Workshop on Information Credibility on the Web (WICOW’08). ACM, New York, NY, 27--34. DOI:http://doi.acm.org/10.1145/1458527.1458536Google ScholarDigital Library
- F. Menczer, S. Fortunato, A. Flammini, and A. Vespignani. 2006. Googlearchy or Googlocracy? IEEE Spectrum Online (2006).Google Scholar
- M. Moyer. 2010. Manipulation of the crowd. Scientific American Magazine 303, 1 (2010), 26--28. Google ScholarCross Ref
- Martin Potthast, Benno Stein, and Robert Gerling. 2008. Automatic vandalism detection in Wikipedia. In Proceedings of the IR Research, 30th European Conference on Advances in Information Retrieval (ECIR’08). Springer, Berlin, 663--668. Google ScholarCross Ref
- P. Resnick and R. Sami. 2007. The influence limiter: Provably manipulation-resistant recommender systems. In Proceedings of the ACM Conference on Recommender Systems. ACM, 25--32. Google ScholarDigital Library
- Alastair Sloan. 2015. Manipulating Wikipedia to promote a bogus business school. Newsweek (March 24 2015).Google Scholar
- Koen Smets, Bart Goethals, and Brigitte Verdonk. 2008. Automatic vandalism detection in Wikipedia: Towards a machine learning approach. In Proceedings of the AAAI Workshop on Wikipedia and Artificial Intelligence.Google Scholar
- Anselm Spoerri. 2007. What is popular on Wikipedia and why? First Monday 12, 4 (April 2007). Google ScholarCross Ref
- Ba-Quy Vuong, Ee-Peng Lim, Aixin Sun, Minh-Tam Le, and Hady Wirawan Lauw. 2008. On ranking controversies in Wikipedia: Models and evaluation. In Proceedings of the International Conference on Web Search and Web Data Mining. 171--182. Google ScholarDigital Library
- Howard T. Welser, Dan Cosley, Gueorgi Kossinets, Austin Lin, Fedor Dokshin, Geri Gay, and Marc Smith. 2011. Finding social roles in Wikipedia. In Proceedings of the 2011 iConference (iConference’11). ACM, New York, NY, 122--129. DOI:http://dx.doi.org/10.1145/1940761.1940778 Google ScholarDigital Library
- Dennis M. Wilkinson and Bernardo A. Huberman. 2007. Assessing the value of coooperation in Wikipedia. First Monday 12, 4 (Feb 2007). Google ScholarCross Ref
- Justin Wolfers and Eric Zitzewitz. 2004. Prediction markets. Journal of Economic Perspectives 18, 2 (2004), 107--126. Google ScholarCross Ref
Index Terms
-
Manipulation among the Arbiters of Collective Intelligence: How Wikipedia Administrators Mold Public Opinion
-
Recommendations
-
Manipulation among the arbiters of collective intelligence: how wikipedia administrators mold public opinion
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementOur reliance on networked, collectively built information is a vulnerability when the quality or reliability of this information is poor. Wikipedia, one such collectively built information source, is often our first stop for information on all kinds of ...
-
Collective Intelligence: from the Enlightenment to the Crowd Science
ICCSE'17: Proceedings of the 2nd International Conference on Crowd Science and EngineeringFrom1 the 18th century Enlightenment to the New Millennium, the wisdom of crowd, later named the collective intelligence, has attracted many scientists to study its principle and application. This article shows the picture of collective intelligence ...
-
A Sociopsychological Perspective on Collective Intelligence in Metaheuristic Computing
In studies of genetic algorithms, evolutionary computing, and ant colony mechanisms, it is recognized that the higher-order forms of collective intelligence play an important role in metaheuristic computing and computational intelligence. Collective ...
Comments