skip to main content
10.1145/1180995.1181050acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
Article

Tracking head pose and focus of attention with multiple far-field cameras

Authors Info & Claims
Published:02 November 2006Publication History

ABSTRACT

In this work we present our recent approach on estimating head orientations and foci of attention of multiple people in a smart room, which is equipped with several cameras to monitor the room. In our approach, we estimate each person's head orientation with respect to the room coordinate system by using all camera views. We implemented a Neural Network to estimate head pose on every single camera view, a Bayes filter is then applied to integrate every estimate into one final, joint hypothesis. Using this scheme, we can track peoples' horizontal head orientations in a full 360° range at almost all positions within the room. The tracked head orientations are then used to determine who is looking at whom, i.e. people's focus of attention. We report experimental results on one meeting video, that was recorded in the smart room.

References

  1. S. O. Ba and J.-M. Obodez. A probabilistic framework for joint head tracking and pose estimation. In Proceedings of the 17th International Conference on Pattern Recognition, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. H. Gee and R. Cipolla. Non-intrusive gaze tracking for human-computer interaction. In Proceedings of Mechatronics and Machine Vision in Practise, pages 112--117, 1994.Google ScholarGoogle Scholar
  3. T. Horprasert, Y. Yacoob, and L. S. Davis. Computing 3-d head orientation from a monocular image sequence. In Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Katzenmaier, R. Stiefelhagen, T. Schultz, I. Rogina, and A. Waibel. Identifying the addressee in human-human-robot interactions based on head pose and speech. In International Conference on Multimodal Interfaces ICMI, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. R. Langton. The mutual influence of gaze and head orientation in the analysis of social attention direction. In The Quaterly Journal of Experimental Psychology, 53A(3):825--845, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. C. Michael Argyle. Gaze and Mutual Gaze. Cambridge University Press, 1976.Google ScholarGoogle Scholar
  7. K. Otsuka, Y. Takemae, J. Yamamoto, and H. Murase. A probabilistic inference of multiparty-conversation structure based on markov-switching models of gaze patterns, head directions and utterances. In Proceedings of the International Conference on Multimodal Interfaces - ICMI, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Pappu and P. Beardsley. A qualitative approach to classifying gaze direction. In Proceedings of FG98, pages 160--165, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. V. B. Stephen R. H. Langton, Roger J. Watt. Do the eyes have it? cues to the direction of social attention. In Trends in Cognitive Neuroscience, 4(2):50--58, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  10. R. Stiefelhagen. Tracking focus of attention in meetings. In IEEE International Conference on Multimodal Interfaces, pages 273--280, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Stiefelhagen, J. Yang, and A. Waibel. Simultaneous tracking of head poses in a panoramic view. In International Conference on Pattern Recognition, volume 3, pages 726--729, September 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Stiefelhagen, J. Yang, and A. Waibel. Modeling focus of attention for meeting indexing based on multiple cues. IEEE Transactions on Neural Networks, 13(4), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Stiefelhagen and J. Zhu. Head orientation and gaze direction in meetings. In Conference on Human Factors in Computing Systems (CHI2002), Minneapolis, April 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y.-L. Tian, L. Brown, J. Connell, S. Pankanti, A. Hampapur, A. Senior, and R. Bolle. Absolute head pose estimation from overhead wide-angle cameras. In IEEE International Workshop on Analysis and Modeling of Faces and Gestures, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. van Turhnout, J. Terken, I. Bakx, and B. Eggen. Identifying the intended addressee in mixed human-human and human-computer interaction from non-verbal features. In Proceedings of the International Conference on Multimodal Interfaces - ICMI, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Vertegaal. Attentive user interfaces. Communications of the ACM, 46(3), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Voit, K. Nickel, and R. Stiefelhagen. Multi-view head pose estimation using neural networks. In Second Workshop on Face Processing in Video (FPiV'05), in Proceedings of Second Canadian Conference on Computer and Robot Vision. (CRV'05), 9-11 May 2005, Victoria, BC, Canada, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. L. Z. Zhang, G. Potamianos and T. Huang. Robust multi-view multi-camera face detection inside smart rooms using spatio-temporal dynamic programming. In Proceedings of Automatic Face and Gesture Recognition (FG), Southampton, United Kingdom, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Tracking head pose and focus of attention with multiple far-field cameras

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces
        November 2006
        404 pages
        ISBN:159593541X
        DOI:10.1145/1180995

        Copyright © 2006 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 November 2006

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate453of1,080submissions,42%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader