skip to main content
10.1145/2487575.2487628acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
poster

Massively parallel expectation maximization using graphics processing units

Published:11 August 2013Publication History

ABSTRACT

Composed of several hundreds of processors, the Graphics Processing Unit (GPU) has become a very interesting platform for computationally demanding tasks on massive data. A special hierarchy of processors and fast memory units allow very powerful and efficient parallelization but also demands novel parallel algorithms. Expectation Maximization (EM) is a widely used technique for maximum likelihood estimation. In this paper, we propose an innovative EM clustering algorithm particularly suited for the GPU platform on NVIDIA's Fermi architecture. The central idea of our algorithm is to allow the parallel threads exchanging their local information in an asynchronous way and thus updating their cluster representatives on demand by a technique called Asynchronous Model Updates (Async-EM). Async-EM enables our algorithm not only to accelerate convergence but also to reduce the overhead induced by memory bandwidth limitations and synchronization requirements. We demonstrate (1) how to reformulate the EM algorithm to be able to exchange information using Async-EM and (2) how to exploit the special memory and processor architecture of a modern GPU in order to share this information among threads in an optimal way. As a perspective Async-EM is not limited to EM but can be applied to a variety of algorithms.

References

  1. X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G. J. McLachlan, A. Ng, B. Liu, P. S. Yu, and etal., "Top 10 algorithms in data mining," Knowledge and Information Systems, vol. 14, no. 1, pp. 1--37, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. Liang and D. Klein, "Online em for unsupervised models," in Proceedings of Human Language Technologies, ser. NAACL Stroudsburg, PA, USA: Association for Computational Linguistics, 2009, pp. 611--619. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. Kowalczyk and N. A. Vlassis, "Newscast em," in NIPS, 2004.Google ScholarGoogle Scholar
  4. A. Nikseresht and M. Gelgon, "Gossip-based computation of a gaussian mixture model for distributed multimedia indexing," IEEE Transactions on Multimedia, vol. 10, no. 3, pp. 385--392, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. Mensink, W. Zajdel, and B. Kröse, "Distributed em learning for multi-camera tracking," in IEEE/ACM International Conference on Distributed Smart Cameras, 2007.Google ScholarGoogle Scholar
  6. R. D. Nowak, "Distributed em algorithms for density estimation and clustering in sensor networks," IEEE Transactions on Signal Processing, vol. 51, no. 8, pp. 2245--2253, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. Gu, "Distributed em algorithm for gaussian mixtures in sensor networks," IEEE Transactions on Neural Networks, vol. 19, no. 7, pp. 1154--1166, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. D. Pangborn, "Scalable data clustering using gpus. master thesis, rochester institute of technology, 2010," http://apangborn.com/ms-thesis/#Download.Google ScholarGoogle Scholar
  9. N. S. L. P. Kumar, S. Satoor, and I. Buck, "Fast parallel expectation maximization for gaussian mixture models on gpus using cuda," in HPCC, 2009, pp. 103--109. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Harp, "Em of gmms with gpu acceleration with cuda," http://andrewharp.com/gmmcuda, lastly updated on 21/05/2009.Google ScholarGoogle Scholar
  11. J. L. Hennessy and D. A. Patterson, Computer Architecture A Quantitative Approach, 5th Edition. Morgan Kaufmann, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Plant and C. Böhm, "Inconco: interpretable clustering of numerical and categorical objects," in KDD, 2011, pp. 1127--1135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Frank and A. Asuncion, "UCI machine learning repository," 2010. {Online}. Available: http://archive.ics.uci.edu/mlGoogle ScholarGoogle Scholar
  14. R. Neal and G. E. Hinton, "A view of the em algorithm that justifies incremental, sparse, and other variants," in Learning in Graphical Models. Kluwer Academic Publishers, 1998, pp. 355--368. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Massively parallel expectation maximization using graphics processing units

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
        August 2013
        1534 pages
        ISBN:9781450321747
        DOI:10.1145/2487575

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 August 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • poster

        Acceptance Rates

        KDD '13 Paper Acceptance Rate125of726submissions,17%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader