skip to main content
10.1145/1054972.1055028acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
Article

A method to standardize usability metrics into a single score

Published:02 April 2005Publication History

ABSTRACT

Current methods to represent system or task usability in a single metric do not include all the ANSI and ISO defined usability aspects: effectiveness, efficiency & satisfaction. We propose a method to simplify all the ANSI and ISO aspects of usability into a single, standardized and summated usability metric (SUM). In four data sets, totaling 1860 task observations, we show that these aspects of usability are correlated and equally weighted and present a quantitative model for usability. Using standardization techniques from Six Sigma, we propose a scalable process for standardizing disparate usability metrics and show how Principal Components Analysis can be used to establish appropriate weighting for a summated model. SUM provides one continuous variable for summative usability evaluations that can be used in regression analysis, hypothesis testing and usability reporting.

References

  1. Abran, A., Surya, W., Khelifi, A., Rilling, J., Seffah, A., Robert, F. (2003). Consolidating the ISO Usability Models. Paper presented at 11th annual International Software Quality Management Conference.Google ScholarGoogle Scholar
  2. ANSI (2001). Common industry format for usability test reports (ANSI-NCITS 354-2001). Washington, DC: American National Standards Institute.Google ScholarGoogle Scholar
  3. Babiker, E.M., Fujihara, H., Boyle, Craig. D. B. (1991). A metric for hypertext usability. In Proc. 11th Annual International Conference on Systems documentation, (pp.95--104). ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Breyfogle, F. (1999). Implementing Six Sigma: Smarter Solutions Using Statistical Methods. John Wiley and Sons.Google ScholarGoogle Scholar
  5. Brooke, J. (1996). SUS: A "quick and dirty" usability scale. In P. Jordan, B. Thomas, and B. Weerdmeester (Eds.), Usability Evaluation in Industry (pp.189--194). London: Taylor and Francis. See also http://www.cee.hw.ac.uk/~ph/sus.htmlGoogle ScholarGoogle Scholar
  6. Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1, 245--276.Google ScholarGoogle Scholar
  7. Chin, J. P., Diehl, V. A., and Norman, K. L. (1988). Development of an instrument measuring user satisfaction of the human-computer interface. Proc. CHI '88. (pp. 213--218). Washington, D.C.: ACM Press. See also http://www.lap.umd.edu/QUIS/index.html Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cordes, R. E (1984). Application of Magnitude Estimation for Evaluating Software Ease of Use. In Gavriel Salvendy (Ed.) First USA-Japan Conference on Human Computer Interaction, Amsterdam: Elsevier Science Publishers.Google ScholarGoogle Scholar
  9. Dumas, J. S. (2003). User-based evaluations. In J. A. Jacko and A. Sears (Eds.), The Human-Computer Interaction Handbook (pp. 1093--1117). Mahwah, NJ: Lawrence Erlbaum. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dunteman, George H, (1989) Principal Components Analysis. In Sage University Papers Series Quantitative Applications in the Social Sciences; No. 07-069 Newbury Park Sage Publications, Inc.Google ScholarGoogle Scholar
  11. Dumas, J., and Redish, J. C. (1999). A practical guide to usability testing. Portland, OR: Intellect. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Frøkjær, E., Hertzum, M., and Hornbæk, K. (2000) Measuring usability: are effectiveness, efficiency, and satisfaction really correlated? In Proc. CHI 2000, (pp.345--352). Washington, D.C.: ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Gliem, J. and Gliem, R. (2003). Calculating, Interpreting, and Reporting Cronbach's Alpha Reliability Coefficient for Likert-Type Scales. In 2003 Midwest Research to Practice Conference in Adult, Continuing and Community Education. Columbus, OH.Google ScholarGoogle Scholar
  14. Hair, Anderson, Tatham, Black (1998) Multivariate Data Analysis Fifth Edition. NJ: Prentice Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Hagen, E. (Ed.) (1976). Human Reliability Analysis, control and instrumentation. Nuclear Safety. 17(3), 315--326.Google ScholarGoogle Scholar
  16. Hassenzahl, M. Sandweg, N. (2004). From Mental Effort to Perceived Usability: Transforming Experiences into Summary Assessments. In the Extended Abstracts of the 2004 conference on Human Factors and Computing Systems (pp 1283--1286). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Harry, M. J (1987). The Nature of Six Sigma Quality. Technical Report, Government Electronics Group, Motorola Inc. Scottsdale, AZ.Google ScholarGoogle Scholar
  18. ISO. (1998). Ergonomic requirements for office work with visual display terminals (VDTs) - Part 11: Guidance on usability (ISO 9241-11:1998(E)). Geneva, Switzerland: Author.Google ScholarGoogle Scholar
  19. Jolliffe, I. T. (1972). Discarding variables in a principal component analysis 1: Artificial data. Applied Statistics, 21, 160--173.Google ScholarGoogle ScholarCross RefCross Ref
  20. Jolliffe, Ian T.(2002). Principal Component Analysis. Secaucus, NJ, USA: Springer-Verlag.Google ScholarGoogle Scholar
  21. Kaiser, H. F. (1960). The application of electronic computers to factor analysis. Educational Psychology Measurement, 20, 141--151.Google ScholarGoogle ScholarCross RefCross Ref
  22. Kirakowski, J. (1996). The Software Usability Measurement Inventory: Background and usage. In P. Jordan, B. Thomas, and B. Weerdmeester (Eds.), Usability Evaluation in Industry (pp. 169--178). London, UK: Taylor and Francis. (Also, see http://www.ucc.ie/hfrg/questionnaires/sumi/index.html).Google ScholarGoogle Scholar
  23. Kirakowski, J., and Corbett, M. (1993). SUMI: The Software Usability Measurement Inventory. British Journal of Educational Technology, 24, 210--212.Google ScholarGoogle ScholarCross RefCross Ref
  24. Lewis, J. R (1982) "Testing Small System Customer Setup" in Proceedings of the Human Factors Society 26th Annual Meeting p. 718--720.Google ScholarGoogle Scholar
  25. Lewis, J (1991) A Rank-Based Method for the Usability Comparison of Competing Products. In Proceedings of the Human Factors and Ergonomics Society 35th Annual Meeting San Francisco California (pp1312--1316).Google ScholarGoogle Scholar
  26. Lewis, J. R. (1991). Psychometric evaluation of an after-scenario questionnaire for computer usability studies: The ASQ. SIGCHI Bulletin, 23, 78--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Lewis, J. R. (1992). Psychometric evaluation of the Post-Study System Usability Questionnaire: The PSSUQ. In Proceedings of the Human Factors Society 36th Annual Meeting (pp. 1259--1263). Atlanta, GA: Human Factors Society.Google ScholarGoogle ScholarCross RefCross Ref
  28. Lewis, J. R. (1993). IBM computer usability satisfaction questionnaires: Psychometric evaluation and instructions for use (Tech. Report 54.786). Boca Raton, FL: IBM Corp. http://drjim.0catch.com/usabqtr.pdfGoogle ScholarGoogle Scholar
  29. Lewis, J. R. (1995). IBM computer usability satisfaction questionnaires: Psychometric evaluation and instructions for use. International Journal of Human-Computer Interaction, 7, 57--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Lewis, J.R. (In Press). Handbook of Human Factors and Ergonomics 3rd Edition. Gavriel Salvendy Editor. John Wiley & Sons.Google ScholarGoogle Scholar
  31. Martin,P and Bateson, P (1993) Measuring Behaviour. (2nd Edition) Cambridge {England}; New York, NY, USA: Cambridge University Press.Google ScholarGoogle Scholar
  32. McGee, M. (2003). Usability magnitude estimation. Proc. HFES, 47th Annual Meeting, (691--695).Google ScholarGoogle ScholarCross RefCross Ref
  33. McGee, M (2004). Master usability scaling:magnitude estimation and master scaling applied to usability measurement. In Proc. CHI 2004, (pp 335--342). Washington, D.C.: ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. McIver, J. P., & Carmines, E. G. (1981). Unidimensional scaling. Thousand Oaks, CA: Sage.Google ScholarGoogle Scholar
  35. Molich, R., Ede, M., Kaasgaard, K., and Karyukin, B (2004). Comparative Usability Evaluation. Behaviour & Information Technology, 23(1), 65--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Morrison, D. F. (1976). Multivariate statistical methods (2nd ed.). New York: McGraw-Hill.Google ScholarGoogle Scholar
  37. Nielsen, J (1994) Usability Engineering. San Francisco: Morgan Kaufman. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Nielsen, J. and Levy, J. (1994) Measuring Usability: Preference vs. Performance. Communications of the ACM, 37, p. 66--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Nielsen, J. (1997). Usability testing. In G. Salvendy (Ed.), The Handbook of Human Factors and Ergonomics, (2rd Edition). John Wiley & Sons.Google ScholarGoogle Scholar
  40. Nunnally, J. C. (1978). Psychometric Theory. New York, NY: McGraw-Hill.Google ScholarGoogle Scholar
  41. Park, Kyung S. (1997). Human Error. In Gavriel Salvendy (Ed.), The Handbook of Human Factors and Ergonomics, (2rd Edition). John Wiley & Sons.Google ScholarGoogle Scholar
  42. Sauro, J. (2004) How long should a task take? Identifying Spec Limits for Task Times in Usability Tests. Retrieved September 13, 2004, from Measuring Usability Web site : http://measuringusability.com/time_specs.htmGoogle ScholarGoogle Scholar
  43. Sauro, J. (2004) How Do You Calculate a Z-Score? Retrieved September 13, 2004, from Measuring Usability Web site: http://measuringusability.com/z_calc.htmGoogle ScholarGoogle Scholar
  44. Sauro, J & Kindlund E. (In Press) Making Sense of Usability Metrics: Usability and Six Sigma, in Proceedings of the 14th Annual Conference of the Usability Professionals Association, Montreal, Canada.Google ScholarGoogle Scholar
  45. Stevens, S.S. (1975). Psychophysics: Introduction to its Perceptual, Neural, and Social Prospects. New York: John Wiley.Google ScholarGoogle Scholar
  46. Whiteside, J., Bennett, J. and Holtzblatt, K. (1988) "Usability Engineering: Our Experience and Evolution" in The Handbook of Human Computer Interaction Elsevier Science Publishers, Amsterdam, pp 791--817.Google ScholarGoogle Scholar

Index Terms

  1. A method to standardize usability metrics into a single score

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI '05: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
      April 2005
      928 pages
      ISBN:1581139985
      DOI:10.1145/1054972

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 April 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      CHI '05 Paper Acceptance Rate93of372submissions,25%Overall Acceptance Rate6,199of26,314submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader