skip to main content
10.1145/3091478.3091488acmconferencesArticle/Chapter ViewAbstractPublication PageswebsciConference Proceedingsconference-collections

"(Weitergeleitet von Journalistin)": The Gendered Presentation of Professions on Wikipedia

Published: 25 June 2017 Publication History


Previous research has shown the existence of gender biases in the depiction of professions and occupations in search engine results. Such an unbalanced presentation might just as likely occur on Wikipedia, one of the most popular knowledge resources on the Web, since the encyclopedia has already been found to exhibit such tendencies in past studies. Under this premise, our work assesses gender bias with respect to the content of German Wikipedia articles about professions and occupations along three dimensions: used male vs. female titles (and redirects), included images of persons, and names of professionals mentioned in the articles. We further use German labor market data to assess the potential misrepresentation of a gender for each specific profession. Our findings in fact provide evidence for systematic over-representation of men on all three dimensions. For instance, for professional fields dominated by females, the respective articles on average still feature almost two times more images of men; and in the mean, 83% of the mentioned names of professionals were male and only 17% female.


Rami Al-Rfou, Vivek Kulkarni, Bryan Perozzi, and Steven Skiena. 2015. PolyglotNER: Massive Multilingual Named Entity Recognition. In Proceedings of the 2015 SIAM International Conference on Data Mining. SIAM, 586--594.
Judd Antin, Raymond Yee, Coye Cheshire, and Oded Nov. 2011. Gender Differences in Wikipedia Editing. In Proceedings of the 7th International Symposium on Wikis and Open Collaboration (WikiSym '11). ACM, 11--14.
Florian Arendt and Temple Northup. 2015. Effects of Long-Term Exposure to News Stereotypes on Implicit and Explicit Attitudes. International Journal of Communication 9, 0 (2015), 21.
Sherryl Browne Graves. 1999. Television and Prejudice Reduction: When Does Television as a Vicarious Experience Make a Difference? Journal of Social Issues 55, 4 (1999), 707--727.
David Chambers. 1983. Stereotypic images of the scientist: The draw-a-scientist test. Science Education 67, 2 (1983), 255--265.
Giovanni Luca Ciampaglia and Dario Taraborelli. 2015. MoodBar: Increasing New User Retention in Wikipedia Through Lightweight Socialization. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15). ACM, 734--742.
Benjamin Collier and Julia Bear. 2012. Conflict, Criticism, or Confidence: An Empirical Examination of the Gender Gap in Wikipedia Contributions. In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work (CSCW '12). ACM, 383--392.
Shelley J. Correll. 2001. Gender and the Career Choice Process: The Role of Biased Self-Assessments. Amer. J. Sociology 106, 6 (May 2001), 1691--1730.
Olive Jean Dunn. 1959. Estimation of the Medians for Dependent Variables. The Annals of Mathematical Statistics 30, 1 (1959), 192--197.
Liye Fu, Cristian Danescu-Niculescu-Mizil, and Lillian Lee. 2016. Tie-breaker: Using language models to quantify gender bias in sports journalism. (July 2016). arXiv: 1607.03895.
Eduardo Graells-Garrido, Mounia Lalmas, and Filippo Menczer. 2015. First Women, Second Sex: Gender Bias in Wikipedia. In Proceedings of the 26th ACM Conference on Hypertext & Social Media (HT '15). ACM, 165--174.
A. G. Greenwald and M. R. Banaji. 1995. Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychological Review 102, 1 (1995), 4--27.
Eszter Hargittai and Aaron Shaw. 2015. Mind the skills gap: the role of Internet know-how and gender in differentiated contributions to Wikipedia. Information, Communication & Society 18, 4 (2015), 424--442.
Benjamin Mako Hill and Aaron Shaw. 2013. The Wikipedia Gender Gap Revisited: Characterizing Survey Response Bias with Propensity Score Estimation. PLOS ONE 8, 6 (2013), e65782.
Marit Hinnosaar. 2015. Gender Inequality in New Media: Evidence from Wikipedia. (May 2015). Available at SSRN:
Fariba Karimi, Claudia Wagner, Florian Lemmerich, Mohsen Jadidi, and Markus Strohmaier. 2016. Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods. In Proceedings of the 25th International Conference Companion on World Wide Web. ACM, 53--54.
Matthew Kay, Cynthia Matuszek, and Sean A. Munson. 2015. Unequal Representation and Gender Stereotypes in Image Search Results for Occupations. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). ACM, 3819--3828.
Shyong (Tony) K. Lam, Anuradha Uduwage, Zhenhua Dong, Shilad Sen, David R. Musicant, Loren Terveen, and John Riedl. 2011. WP:Clubhouse? An Exploration of Wikipedia's Gender Imbalance. In Proceedings of the 7th International Symposium on Wikis and Open Collaboration (WikiSym '11). ACM, 1--10.
Paul L. MacDonald and Robert C. Gardner. 2000. Type I Error Rate Comparisons of Post Hoc Procedures for I x J Chi-Square Tables. Educational and Psychological Measurement 60, 5 (2000), 735--54.
Jörg Michael. 2007. 40000 Namen, Anredebestimmung anhand des Vornamens. c't 07, 17 (2007), 182--183.
Jonathan T. Morgan, Siko Bouterse, Heather Walls, and Sarah Stierch. 2013. Tea and Sympathy: Crafting Positive New User Experiences on Wikipedia. In Proceedings of the 2013 Conference on Computer Supported Cooperative Work (CSCW '13). ACM, 839--848.
B. V. North, D. Curtis, and P. C. Sham. 2002. A Note on the Calculation of Empirical P Values from Monte Carlo Procedures. American Journal of Human Genetics 71, 2 (2002), 439--441.
D. M. W. Powers. 2011. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies 2, 1 (2011), 37--63. 436993
Joseph Reagle and Lauren Rhue. 2011. Gender Bias in Wikipedia and Britannica. International Journal of Communication 5, 0 (2011), 21.
Stacy L. Smith, Marc Choueiti, and Stephanie Gall. 2010. Gender inequality in popular films: Examining on screen portrayals and behind-the-scenes employment patterns in motion pictures released between 2007-2009. Los Angeles, CA: Annenberg School for Communication & Journalism.
U.S. Census Bureau. 1983. 1980, Census of the population. Detailed Occupation and Years of School Completed by Age, for the Civilian Labor Force; by Sex, Race, and Spanish Origin. U.S. Department of Commerce. (1983).
Viola Bernacchi. 2015. Gender imbalance and Wikipedia. Ph.D. Dissertation. Polytechnic University of Milan, Italy.
Claudia Wagner, David Garcia, Mohsen Jadidi, and Markus Strohmaier. 2015. It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia. In Ninth International AAAI Conference on Web and Social Media.
Claudia Wagner, Eduardo Graells-Garrido, David Garcia, and Filippo Menczer. 2016. Women through the glass ceiling: gender asymmetries in Wikipedia. EPJ Data Science 5, 1 (2016).

Cited By

View all
  • (2025)Demographic disparity in Wikipedia coverage: a global perspectiveEPJ Data Science10.1140/epjds/s13688-025-00530-414:1Online publication date: 21-Feb-2025
  • (2024)Inklusive Online-Gemeinschaften? Über die Exklusionsbedingtheit jeder Form der Inklusion und plattformspezifische HandlungsspielräumeGleichstellung in progress10.1007/978-3-658-44365-8_5(75-91)Online publication date: 8-Aug-2024
  • (2023)Wikipedia gender gap: a scoping reviewEl Profesional de la información10.3145/epi.2023.nov.17Online publication date: 16-Dec-2023
  • Show More Cited By



Information & Contributors


Published In

cover image ACM Conferences
WebSci '17: Proceedings of the 2017 ACM on Web Science Conference
June 2017
438 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].



Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2017


Request permissions for this article.

Check for updates

Author Tags

  1. gender bias
  2. gender inequality
  3. professions
  4. wikipedia


  • Research-article


WebSci '17
WebSci '17: ACM Web Science Conference
June 25 - 28, 2017
New York, Troy, USA

Acceptance Rates

WebSci '17 Paper Acceptance Rate 30 of 85 submissions, 35%;
Overall Acceptance Rate 245 of 933 submissions, 26%

Upcoming Conference

Websci '25
17th ACM Web Science Conference
May 20 - 24, 2025
New Brunswick , NJ , USA


Other Metrics

Bibliometrics & Citations


Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)2
Reflects downloads up to 01 Mar 2025

Other Metrics


Cited By

View all
  • (2025)Demographic disparity in Wikipedia coverage: a global perspectiveEPJ Data Science10.1140/epjds/s13688-025-00530-414:1Online publication date: 21-Feb-2025
  • (2024)Inklusive Online-Gemeinschaften? Über die Exklusionsbedingtheit jeder Form der Inklusion und plattformspezifische HandlungsspielräumeGleichstellung in progress10.1007/978-3-658-44365-8_5(75-91)Online publication date: 8-Aug-2024
  • (2023)Wikipedia gender gap: a scoping reviewEl Profesional de la información10.3145/epi.2023.nov.17Online publication date: 16-Dec-2023
  • (2023)Wikipedia gender gap: a scoping reviewEl Profesional de la información10.3145/10.3145/epi.2023.nov.17Online publication date: 16-Dec-2023
  • (2023)Diversity matters: Robustness of bias measurements in WikidataProceedings of the 15th ACM Web Science Conference 202310.1145/3578503.3583620(208-218)Online publication date: 30-Apr-2023
  • (2022)Hidden inequalities: the gendered labour of women on micro-tasking platformsInternet Policy Review10.14763/2022.1.162311:1Online publication date: 22-Feb-2022
  • (2022)Visibility layers: a framework for systematising the gender gap in Wikipedia contentInternet Policy Review10.14763/2022.1.162111:1Online publication date: 22-Mar-2022
  • (2022)The social embeddedness of peer production: A comparative qualitative analysis of three Indian language Wikipedia editionsProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501832(1-18)Online publication date: 29-Apr-2022
  • (2022)An Analysis of Content Gaps Versus User Needs in the Wikidata Knowledge GraphThe Semantic Web – ISWC 202210.1007/978-3-031-19433-7_21(354-374)Online publication date: 16-Oct-2022
  • (2020)Image Wishlist: Context and Images in Commons-Based Peer Production CommunitiesProceedings of the ACM on Human-Computer Interaction10.1145/34152494:CSCW2(1-21)Online publication date: 15-Oct-2020
  • Show More Cited By

View Options

Login options

View options


View or Download as a PDF file.



View online with eReader.







Share this Publication link

Share on social media