skip to main content
10.1145/2631775.2631825acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
short-paper

Inferring nationalities of Twitter users and studying inter-national linking

Authors Info & Claims
Published:01 September 2014Publication History

ABSTRACT

Twitter user profiles contain rich information that allows researchers to infer particular attributes of users' identities. Knowing identity attributes such as gender, age, and/or nationality are a first step in many studies which seek to describe various phenomena related to computational social science. Often, it is through such attributes that studies of social media that focus on, for example, the isolation of foreigners, become possible. However, such characteristics are not often clearly stated by Twitter users, so researchers must turn to other means to ascertain various categories of identity. In this paper, we discuss the challenge of detecting the nationality of Twitter users using rich features from their profiles. In addition, we look at the effectiveness of different features as we go about this task. For the case of a highly diverse country---Qatar---we provide a detailed network analysis with insights into user behaviors and linking preference (or the lack thereof) to other nationalities.

References

  1. E. Badger. Map the iphone users in any city, and you know where the rich live, 2013.Google ScholarGoogle Scholar
  2. P. Bourdieu. Distinction: A social critique of the judgement of taste. Harvard University Press, 1984.Google ScholarGoogle Scholar
  3. J. D. Burger, J. Henderson, G. Kim, and G. Zarrella. Discriminating gender on twitter. In EMNLP, pages 1301--1309, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Z. Cheng, J. Caverlee, and K. Lee. You are where you tweet: a content-based approach to geo-locating twitter users. In CIKM, pages 759--768. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. H. Friedman. Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4):367--378, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. O. G. Gavilanes, D. Quercia, and A. Jaimes. Cultural dimensions in twitter: Time, individualism and power. In ICWSM, 2013.Google ScholarGoogle Scholar
  7. B. Hecht, L. Hong, B. Suh, and E. H. Chi. Tweets from justin bieber's heart: the dynamics of the location field in user profiles. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pages 237--246. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Mahmud, J. Nichols, and C. Drews. Where is this tweet from? inferring home locations of twitter users. In ICWSM, 2012.Google ScholarGoogle Scholar
  9. A. Mislove, S. Lehmann, Y.-Y. Ahn, J.-P. Onnela, and J. N. Rosenquist. Understanding the demographics of twitter users. In ICWSM, pages 554--557, 2011.Google ScholarGoogle Scholar
  10. L. Mitchell, M. R. Frank, K. D. Harris, P. S. Dodds, and C. M. Danforth. The geography of happiness: Connecting twitter sentiment and expression, demographics, and objective characteristics of place. PLOS One, 8, 2013.Google ScholarGoogle Scholar
  11. C. M. Paschyn. Anatomy of a globalized state. Think. Issue 2, 2012.Google ScholarGoogle Scholar
  12. M. Pennacchiotti and A.-M. Popescu. A machine learning approach to twitter user classification. In ICWSM, pages 281--288, 2011.Google ScholarGoogle Scholar
  13. B. Poblete, R. O. G. Gavilanes, M. Mendoza, and A. Jaimes. Do all birds tweet the same?: characterizing twitter around the world. In CIKM, pages 1025--1030, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Quercia, J. Ellis, L. Capra, and J. Crowcroft. Tracking "gross community happiness" from tweets. In CSCW, pages 965--968, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Quercia, D. Ó. Séaghdha, and J. Crowcroft. Talk of the city: Our tweets, our community happiness. In ICWSM, 2012.Google ScholarGoogle Scholar
  16. D. Rao, D. Yarowsky, A. Shreevats, and M. Gupta. Classifying latent user attributes in twitter. In SMUC, pages 37--44, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Santani and D. Gatica-Perez. Speaking swiss: languages and venues in foursquare. In ACM Multimedia, pages 501--504, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. A. Schwartz, J. C. Eichstaedt, M. L. Kern, L. Dziurzynski, R. E. Lucas, M. Agrawal, G. J. Park, S. K. Lakshmikanth, S. Jha, M. E. P. Seligman, and L. H. Ungar. Characterizing geographic variation in well-being using tweets. In ICWSM, 2013.Google ScholarGoogle Scholar
  19. N. Shuyo. Language detection library for java, 2010.Google ScholarGoogle Scholar
  20. P. Treeratpituk and C. L. Giles. Name-ethnicity classification and ethnicity-sensitive name matching. In AAAI, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. W. Xie, C. Li, F. Zhu, E.-P. Lim, and X. Gong. When a friend in twitter is a friend in life. In WebSci, pages 344--347, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. I. P. Young and J. A. Fox. Asian, hispanic, and native american job candidates: Prescreened or screened within the selection process. Educational Administration Quarterly, 38(4):530--554, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  23. F. A. Zamal, W. Liu, and D. Ruths. Homophily and latent attribute inference: Inferring latent attributes of twitter users from neighbors. In ICWSM, pages 387--390, 2012.Google ScholarGoogle Scholar

Index Terms

  1. Inferring nationalities of Twitter users and studying inter-national linking

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        HT '14: Proceedings of the 25th ACM conference on Hypertext and social media
        September 2014
        346 pages
        ISBN:9781450329545
        DOI:10.1145/2631775

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 September 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        HT '14 Paper Acceptance Rate49of86submissions,57%Overall Acceptance Rate378of1,158submissions,33%

        Upcoming Conference

        HT '24
        35th ACM Conference on Hypertext and Social Media
        September 10 - 13, 2024
        Poznan , Poland

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader