skip to main content
10.1145/3025453.3025820acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article
Public Access

Differences in Crowdsourced vs. Lab-based Mobile and Desktop Input Performance Data

Published:02 May 2017Publication History

ABSTRACT

Research on the viability of using crowdsourcing for HCI performance experiments has concluded that online results are similar to those achieved in the lab---at least for desktop interactions. However, mobile devices, the most popular form of online access today, may be more problematic due to variability in the user's posture and in movement of the device. To assess this possibility, we conducted two experiments with 30 lab-based and 303 crowdsourced participants using basic mouse and touchscreen tasks. Our findings show that: (1) separately analyzing the crowd and lab data yields different study conclusions-touchscreen input was significantly less error prone than mouse input in the lab but more error prone online; (2) age-matched crowdsourced participants were significantly faster and less accurate than their lab-based counterparts, contrasting past work; (3) variability in mobile device movement and orientation increased as experimenter control decreased--a potential factor affecting the touchscreen error differences. This study cautions against assuming that crowdsourced data for performance experiments will directly reflect lab-based data, particularly for mobile devices.

References

  1. Mohammad Allahbakhsh, Boualem Benatallah, A Ignjatovic, H R Motahari-Nezhad, E Bertino, and S Dustdar. 2013. Quality control in crowdsourcing systems. IEEE Internet Computing 17, 2: 76--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Adam J. Berinsky, Gregory A. Huber, and Gabriel S. Lenz. 2012. Evaluating Online Labor Markets for Experimental Research: Amazon.com's Mechanical Turk. Political Analysis 20, 3: 351--368.Google ScholarGoogle ScholarCross RefCross Ref
  3. Xiaojun Bi, Yang Li, and Shumin Zhai. 2013. FFitts Law: Modeling Finger Touch with Fitts' Law. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 1363--1372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Michael Buhrmester, Tracy Kwang, and Samuel D Gosling. 2011. Amazon's Mechanical Turk a new source of inexpensive, yet high-quality, data? Perspectives on psychological science 6, 1: 3--5.Google ScholarGoogle Scholar
  5. A. Cockburn, D. Ahlström, and C. Gutwin. 2012. Understanding performance in touch selections: Tap, drag and radial pointing drag with finger, stylus and mouse. International Journal of Human-Computer Studies 70, 3: 218--233. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Jay L Devore. 2015. Probability and Statistics for Engineering and the Sciences. Cengage Learning.Google ScholarGoogle Scholar
  7. Steven Dow, Anand Kulkarni, Scott Klemmer, and Björn Hartmann. 2012. Shepherding the crowd yields better work. In Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work, 1013--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Carsten Eickhoff and Arjen P de Vries. 2013. Increasing cheat robustness of crowdsourcing tasks. Information retrieval 16, 2: 121--137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Leah Findlater, Jon E. Froehlich, Kays Fattal, Jacob O. Wobbrock, and Tanya Dastyar. 2013. Age-related differences in performance with touchscreens compared to traditional mouse input. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13), 343--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Tovi Grossman and Ravin Balakrishnan. 2005. The bubble cursor: enhancing target acquisition by dynamic resizing of the cursor's activation area. In Proceedings of the SIGCHI conference on Human factors in computing systems, 281--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jeffrey Heer and Michael Bostock. 2010. Crowdsourcing graphical perception: using mechanical turk to assess visualization design. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 203--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Niels Henze, Enrico Rukzio, and Susanne Boll. 2011. 100,000,000 taps: analysis and improvement of touch performance in the large. In Proceedings of the International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI'11), 133--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. John J Horton, David G Rand, and Richard J Zeckhauser. 2011. The online laboratory: Conducting experiments in a real labor market. Experimental Economics 14, 3: 399--425.Google ScholarGoogle ScholarCross RefCross Ref
  14. International Organization for Standardization. 2002. Ergonomic requirements for office work with visual display terminals (VDTs) Requirements for nonkeyboard input devices.Google ScholarGoogle Scholar
  15. Panagiotis G Ipeirotis. 2010. Analyzing the amazon mechanical turk marketplace. XRDS: Crossroads, The ACM Magazine for Students 17, 2: 16--21. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Panos Ipeirotis. 2016. MTurk Tracker. Retrieved August 28, 2016 from http://demographics.mturktracker.com/Google ScholarGoogle Scholar
  17. Aniket Kittur, Ed H Chi, and Bongwon Suh. 2008. Crowdsourcing user studies with Mechanical Turk. In Proceedings of the SIGCHI conference on human factors in computing systems, 453--456. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Aniket Kittur, Jeffrey V Nickerson, Michael Bernstein, et al. 2013. The future of crowd work. In Proceedings of the 2013 conference on Computer supported cooperative work, 1301--1318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Steven Komarov, Katharina Reinecke, and Krzysztof Z Gajos. 2013. Crowdsourcing Performance Evaluations of User Interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 207--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. I. Scott MacKenzie and Poika Isokoski. 2008. Fitts' throughput and the speed-accuracy tradeoff. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI'08), 1633--1636. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Winter Mason and Siddharth Suri. 2012. Conducting behavioral research on Amazon's Mechanical Turk. Behavior research methods 44, 1: 1--23.Google ScholarGoogle Scholar
  22. Gabriele Paolacci and Jesse Chandler. 2014. Inside the turk understanding mechanical turk as a participant pool. Current Directions in Psychological Science 23, 3: 184--188.Google ScholarGoogle ScholarCross RefCross Ref
  23. Alexander J Quinn and Benjamin B Bederson. 2011. Human computation: a survey and taxonomy of a growing field. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI'11), 1403--1412. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Joel Ross, Lilly Irani, M Silberman, Andrew Zaldivar, and Bill Tomlinson. 2010. Who are the crowdworkers?: shifting demographics in mechanical turk. In CHI'10 extended abstracts on Human factors in computing systems, 2863--2872. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Dmitry Rudchenko, Tim Paek, and Eric Badger. 2011. Text Text Revolution: a game that improves text entry on mobile touchscreen keyboards. In Proceedings of Pervasive 2011, 206--213. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Six Silberman, Kristy Milland, Rochelle LaPlante, Joel Ross, and Lilly Irani. 2015. Stop citing Ross et al. 2010, "Who are the crowdworkers"? Retrieved August 28, 2016 from https://medium.com/@silberman/stopciting-ross-et-al-2010-who-are-the-crowdworkersb3b9b1e8d300#.hv8r2sjcyGoogle ScholarGoogle Scholar
  27. R. William Soukoreff and I. Scott MacKenzie. 2004. Towards a standard for pointing device evaluation, perspectives on 27 years of Fitts' law research in HCI. International Journal of Human-Computer Studies 61, 6: 751--789. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Jacob O Wobbrock, Leah Findlater, Darren Gergle, and James J Higgins. 2011. The Aligned Rank Transform for Nonparametric Factorial Analyses Using Only Anova Procedures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '11), 143--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Jacob O Wobbrock, Kristen Shinohara, and Alex Jansen. 2011. The effects of task dimensionality, endpoint deviation, throughput calculation, and experiment design on pointing measures and models. Proceedings of the 2011 annual conference on Human factors in computing systems CHI '11: 1639. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Differences in Crowdsourced vs. Lab-based Mobile and Desktop Input Performance Data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI '17: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems
      May 2017
      7138 pages
      ISBN:9781450346559
      DOI:10.1145/3025453

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 May 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CHI '17 Paper Acceptance Rate600of2,400submissions,25%Overall Acceptance Rate6,199of26,314submissions,24%

      Upcoming Conference

      CHI '24
      CHI Conference on Human Factors in Computing Systems
      May 11 - 16, 2024
      Honolulu , HI , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader