ABSTRACT
Twitter user profiles contain rich information that allows researchers to infer particular attributes of users' identities. Knowing identity attributes such as gender, age, and/or nationality are a first step in many studies which seek to describe various phenomena related to computational social science. Often, it is through such attributes that studies of social media that focus on, for example, the isolation of foreigners, become possible. However, such characteristics are not often clearly stated by Twitter users, so researchers must turn to other means to ascertain various categories of identity. In this paper, we discuss the challenge of detecting the nationality of Twitter users using rich features from their profiles. In addition, we look at the effectiveness of different features as we go about this task. For the case of a highly diverse country---Qatar---we provide a detailed network analysis with insights into user behaviors and linking preference (or the lack thereof) to other nationalities.
- E. Badger. Map the iphone users in any city, and you know where the rich live, 2013.Google Scholar
- P. Bourdieu. Distinction: A social critique of the judgement of taste. Harvard University Press, 1984.Google Scholar
- J. D. Burger, J. Henderson, G. Kim, and G. Zarrella. Discriminating gender on twitter. In EMNLP, pages 1301--1309, 2011. Google ScholarDigital Library
- Z. Cheng, J. Caverlee, and K. Lee. You are where you tweet: a content-based approach to geo-locating twitter users. In CIKM, pages 759--768. ACM, 2010. Google ScholarDigital Library
- J. H. Friedman. Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4):367--378, 2002. Google ScholarDigital Library
- R. O. G. Gavilanes, D. Quercia, and A. Jaimes. Cultural dimensions in twitter: Time, individualism and power. In ICWSM, 2013.Google Scholar
- B. Hecht, L. Hong, B. Suh, and E. H. Chi. Tweets from justin bieber's heart: the dynamics of the location field in user profiles. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pages 237--246. ACM, 2011. Google ScholarDigital Library
- J. Mahmud, J. Nichols, and C. Drews. Where is this tweet from? inferring home locations of twitter users. In ICWSM, 2012.Google Scholar
- A. Mislove, S. Lehmann, Y.-Y. Ahn, J.-P. Onnela, and J. N. Rosenquist. Understanding the demographics of twitter users. In ICWSM, pages 554--557, 2011.Google Scholar
- L. Mitchell, M. R. Frank, K. D. Harris, P. S. Dodds, and C. M. Danforth. The geography of happiness: Connecting twitter sentiment and expression, demographics, and objective characteristics of place. PLOS One, 8, 2013.Google Scholar
- C. M. Paschyn. Anatomy of a globalized state. Think. Issue 2, 2012.Google Scholar
- M. Pennacchiotti and A.-M. Popescu. A machine learning approach to twitter user classification. In ICWSM, pages 281--288, 2011.Google Scholar
- B. Poblete, R. O. G. Gavilanes, M. Mendoza, and A. Jaimes. Do all birds tweet the same?: characterizing twitter around the world. In CIKM, pages 1025--1030, 2011. Google ScholarDigital Library
- D. Quercia, J. Ellis, L. Capra, and J. Crowcroft. Tracking "gross community happiness" from tweets. In CSCW, pages 965--968, 2012. Google ScholarDigital Library
- D. Quercia, D. Ó. Séaghdha, and J. Crowcroft. Talk of the city: Our tweets, our community happiness. In ICWSM, 2012.Google Scholar
- D. Rao, D. Yarowsky, A. Shreevats, and M. Gupta. Classifying latent user attributes in twitter. In SMUC, pages 37--44, 2010. Google ScholarDigital Library
- D. Santani and D. Gatica-Perez. Speaking swiss: languages and venues in foursquare. In ACM Multimedia, pages 501--504, 2013. Google ScholarDigital Library
- H. A. Schwartz, J. C. Eichstaedt, M. L. Kern, L. Dziurzynski, R. E. Lucas, M. Agrawal, G. J. Park, S. K. Lakshmikanth, S. Jha, M. E. P. Seligman, and L. H. Ungar. Characterizing geographic variation in well-being using tweets. In ICWSM, 2013.Google Scholar
- N. Shuyo. Language detection library for java, 2010.Google Scholar
- P. Treeratpituk and C. L. Giles. Name-ethnicity classification and ethnicity-sensitive name matching. In AAAI, 2012.Google ScholarDigital Library
- W. Xie, C. Li, F. Zhu, E.-P. Lim, and X. Gong. When a friend in twitter is a friend in life. In WebSci, pages 344--347, 2012. Google ScholarDigital Library
- I. P. Young and J. A. Fox. Asian, hispanic, and native american job candidates: Prescreened or screened within the selection process. Educational Administration Quarterly, 38(4):530--554, 2002.Google ScholarCross Ref
- F. A. Zamal, W. Liu, and D. Ruths. Homophily and latent attribute inference: Inferring latent attributes of twitter users from neighbors. In ICWSM, pages 387--390, 2012.Google Scholar
Index Terms
- Inferring nationalities of Twitter users and studying inter-national linking
Recommendations
Identifying communicator roles in twitter
WWW '12 Companion: Proceedings of the 21st International Conference on World Wide WebTwitter has redefined the way social activities can be coordinated; used for mobilizing people during natural disasters, studying health epidemics, and recently, as a communication platform during social and political change. As a large scale system, ...
Leveraging Followee List Memberships for Inferring User Interests for Passive Users on Twitter
HT '17: Proceedings of the 28th ACM Conference on Hypertext and Social MediaUser modeling for inferring user interests from Online Social Networks (OSNs) such as Twitter has received great attention in the user modeling community with the growing popularity of OSNs. The focus of previous works has been on analyzing user-...
Predicting Personality Traits, Gender and Psychopath Behavior of Twitter Users
Social networking sites, such as Facebook and Twitter, are quickly becoming one of the most popular tools for social interaction and information exchange. Users of social networks reveal a lot about themselves in their public profiles, photos and status ...
Comments