ABSTRACT
Accessibility researchers have difficulty recruiting representative participants with disabilities given their scarcity. The rich information on social media provides accessibility researchers with a new approach to collecting data about these populations. Because social media is used by multiple stakeholders, a major barrier to this approach is differentiating representative users who have disabilities from unrepresentative users who do not. We (1) introduce an empirical study that compares representative users who are amputees with unrepresentative users in terms of linguistic behavior, online interaction, and community characteristics on Reddit and (2) develop a feature extraction method based on statistical analyses and graph mining to classify representative users. Those features allow us to detect amputees using a supervised learning method with an overall accuracy of 88% in amputee-related subreddits. Our findings improve our understanding of anonymous online users with physical disabilities, and can inform better tools for online data collection for accessibility researchers.
- A. Sears and V. L. Hanson, "Representing users in accessibility research," ACM Transactions on Accessible Computing (TACCESS), vol. 4, no. 2, p. 7, 2012. Google ScholarDigital Library
- J. P. Bigham and A. C. Cavender, "Evaluating existing audio captchas and an interface optimized for non-visual use," in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2009, pp. 1829--1838. Google ScholarDigital Library
- H. Petrie, F. Hamilton, N. King, and P. Pavan, "Remote usability evaluations with disabled people," in Proceedings of the SIGCHI conference on Human Factors in computing systems. ACM, 2006, pp. 1133--1141. Google ScholarDigital Library
- M. De Choudhury, E. Kiciman, M. Dredze, G. Coppersmith, and M. Kumar, "Discovering shifts to suicidal ideation from mental health content in social media," in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 2016, pp. 2098--2110. Google ScholarDigital Library
- M. Walker, L. Thornton, M. De Choudhury, J. Teevan, C. M. Bulik, C. A. Levinson, and S. Zerwas, "Facebook use and disordered eating in college-aged women," Journal of Adolescent Health, vol. 57, no. 2, pp. 157--163, 2015. Google ScholarCross Ref
- X. Ma, J. Hancock, and M. Naaman, "Anonymity, intimacy and self-disclosure in social media," in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 2016, pp. 3857--3869. Google ScholarDigital Library
- M. Dee and V. L. Hanson, "A large user pool for accessibility research with representative users," in Proceedings of the 16th international ACM SIGACCESS conference on Computers & accessibility. ACM, 2014, pp. 35--42. Google ScholarDigital Library
- M. R. Morris, A. Perkins, C. Yao, S. Bahram, J. P. Bigham, and S. K. Kane, ""with most of it being pictures now, i rarely use it": Understanding twitter's evolving accessibility to blind users," in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 2016, pp. 5506--5516. Google ScholarDigital Library
- S. Wu and L. A. Adamic, "Visually impaired users on an online social network," in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2014, pp. 3133--3142. Google ScholarDigital Library
- V. Voykinska, S. Azenkot, S. Wu, and G. Leshed, "How blind people interact with visual content on social networking services," in Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing. ACM, 2016, pp. 1584--1595. Google ScholarDigital Library
- M. Burke, R. Kraut, and D. Williams, "Social use of computer-mediated communication by adults on the autism spectrum," in Proceedings of the 2010 ACM conference on Computer supported cooperative work. ACM, 2010, pp. 425--434. Google ScholarDigital Library
- B. Wentz and J. Lazar, "Are separate interfaces inherently unequal?: an evaluation with blind users of the usability of two interfaces for a social networking platform," in Proceedings of the 2011 iConference. ACM, 2011, pp. 91--97. Google ScholarDigital Library
- M. J. Paul and M. Dredze, "You are what you tweet: Analyzing twitter for public health." ICWSM, vol. 20, pp. 265--272, 2011.Google Scholar
- C. W. Schmidt, "Using social media to predict and track disease outbreaks," Environmental health perspectives, vol. 120, no. 1, p. A31, 2012. Google ScholarCross Ref
- S. Argamon, M. Koppel, J. W. Pennebaker, and J. Schler, "Mining the blogosphere: Age, gender and the varieties of self-expression," First Monday, vol. 12, no. 9, 2007. Google ScholarCross Ref
- J. D. Burger, J. Henderson, G. Kim, and G. Zarrella, "Discriminating gender on twitter," in Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2011, pp. 1301--1309.Google Scholar
- D. Rao, D. Yarowsky, A. Shreevats, and M. Gupta, "Classifying latent user attributes in twitter," in Proceedings of the 2nd international workshop on Search and mining user-generated contents. ACM, 2010, pp. 37--44. Google ScholarDigital Library
- J. Schler, M. Koppel, S. Argamon, and J. W. Pennebaker, "Effects of age and gender on blogging." in AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, vol. 6, 2006, pp. 199--205.Google Scholar
- E. L. Brady, Y. Zhong, M. R. Morris, and J. P. Bigham, "Investigating the appropriateness of social network question asking as a resource for blind users," in Proceedings of the 2013 conference on Computer supported cooperative work. ACM, 2013, pp. 1225--1236. Google ScholarDigital Library
- M. Hoffman, F. R. Bach, and D. M. Blei, "Online learning for latent dirichlet allocation," in advances in neural information processing systems, 2010, pp. 856--864.Google Scholar
- P. E. Compeau, P. A. Pevzner, and G. Tesler, "How to apply de bruijn graphs to genome assembly," Nature biotechnology, vol. 29, no. 11, pp. 987--991, 2011. Google ScholarCross Ref
- P. Pons and M. Latapy, "Computing communities in large networks using random walks." J. Graph Algorithms Appl., vol. 10, no. 2, pp. 191--218, 2006. Google ScholarCross Ref
- A. N. Langville and C. D. Meyer, Google's PageRank and beyond: The science of search engine rankings. Princeton University Press, 2011.Google ScholarDigital Library
- M. Sandri and P. Zuccolotto, "A bias correction algorithm for the gini variable importance measure in classification trees," Journal of Computational and Graphical Statistics, 2012.Google Scholar
- C. Strobl, A.-L. Boulesteix, A. Zeileis, and T. Hothorn, "Bias in random forest variable importance measures: Illustrations, sources and a solution," BMC bioinformatics, vol. 8, no. 1, p. 1, 2007. Google ScholarCross Ref
- R. Tibshirani, "Regression shrinkage and selection via the lasso," Journal of the Royal Statistical Society. Series B (Methodological), pp. 267--288, 1996.Google ScholarCross Ref
- L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5--32, 2001. Google ScholarDigital Library
- S. Rude, E.-M. Gortner, and J. Pennebaker, "Language use of depressed and depression-vulnerable college students," Cognition & Emotion, vol. 18, no. 8, pp. 1121--1133, 2004. Google ScholarCross Ref
- M. Pasupathi, "Telling and the remembered self: Linguistic differences in memories for previously disclosed and previously undisclosed events," Memory, vol. 15, no. 3, pp. 258--270, 2007. Google ScholarCross Ref
- J. W. Pennebaker and L. A. King, "Linguistic styles: language use as an individual difference." Journal of personality and social psychology, vol. 77, no. 6, p. 1296, 1999. Google ScholarCross Ref
- M. R. Mehl, S. D. Gosling, and J. W. Pennebaker, "Personality in its natural habitat: manifestations and implicit folk theories of personality in daily life." Journal of personality and social psychology, vol. 90, no. 5, p. 862, 2006. Google ScholarCross Ref
- M. L. Calle and V. Urrea, "Letter to the editor: stability of random forest importance measures," Briefings in bioinformatics, vol. 12, no. 1, pp. 86--89, 2011. Google ScholarCross Ref
- Understanding and Classifying Online Amputee Users on Reddit
Recommendations
Visually impaired users on an online social network
CHI '14: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsIn this paper we present the first large-scale empirical study of how visually impaired people use online social networks, specifically Facebook. We identify a sample of 50K visually impaired users, and study the activities they perform, the content ...
Online communities for older users: what can we learn from local community interactions to create social sites that work for older people
BCS-HCI '14: Proceedings of the 28th International BCS Human Computer Interaction Conference on HCI 2014 - Sand, Sea and Sky - Holiday HCIThis study looked at the significance of family and local community connections in determining online community engagement amongst a sample of older people in the south of England. Four catalysts were identified which motivated engagement with local and ...
Classifying online social network users through the social graph
FPS'12: Proceedings of the 5th international conference on Foundations and Practice of SecurityIn this paper, we address the problem of classifying online social network users using a naively anonymized version of a social graph. We use two main user attributes defined by the graph structure to build an initial classifier, node degree and ...
Comments