skip to main content
10.1145/1117309.1117352acmconferencesArticle/Chapter ViewAbstractPublication PagesetraConference Proceedingsconference-collections
Article

Perceptual attention focus prediction for multiple viewers in case of multimedia perceptual compression with feedback delay

Published: 27 March 2006 Publication History

Abstract

Human eyes have limited perception capabilities. Only 2 degrees of our 180 degree vision field provide the highest quality of perception. Due to this fact the idea of perceptual attention focus emerged to allow a visual content to be changed in a way that only part of the visual field where a human attention is directed to is encoded with a high quality. The image quality in the periphery can be reduced without a viewer noticing it. This compression approach allows a significant decrease in bit-rate for a video stream, and in the case of the 3D stream rendering, it decreases the computational burden. A number of previous researchers have investigated the topic of real-time perceptual attention focus but only for a single viewer. In this paper we investigate a dynamically changing multi-viewer scenario. In this type of scenario a number of people are watching the same visual content at the same time. Each person is using eye-tracking equipment. The visual content (video, 3D stream) is sent through a network with a large transmission delay. The area of the perceptual attention focus is predicted for the viewers to compensate for the delay value and identify the area of the image which requires highest quality coding.

References

[1]
Irwin, D. E. 1992. Visual Memory Within and Across Fixations. In Eye movements and Visual Cognition: Scene Preparation And Reading, K. Rayner, Ed. Springer-Verlag, Springer Series in Neurophysiology, New-York, NY. 146--165.
[2]
Komogortsev O., Khan J. 2004. Predictive Perceptual Compression for Real Time Video Communication. In Proceedings of the 12th ACM International conference on Multimedia (ACM MM 04), 220--227.
[3]
Murphy, H., and Duchowski, A. T. 2001. Gaze-contingent level of detail rendering. In EuroGraphics 2001, EuroGraphics Association.
[4]
Kortum, P., Geisler, W. S. 1996. Implementation of a Foveated Image Coding System for Image Bandwidth Reduction. In Proceedings of SPIE Vol. g657, Human Vision and Electronic Imaging, 350--360.
[5]
Lee, S., Pattichis, M., Bovok. 2001. A. Foveated Video Compression with Optimal Rate Control. In IEEE Transaction of Image Processing, V. 10, n.7, 977--992.
[6]
Kuyel, T., Geisler, W. S., Ghosh, J. 1998. Retinally reconstructed images (RRIs): digital images having a resolution match with the human eye. In Proceedings of SPIE Vol. 3299, Human Vision and Electronic Imaging, 603--614.
[7]
Loschky, L., McConkie, G. 2000. User performance with gaze contingent multiresolutional displays. In Proceedings of the symposium on Eye tracking research & applications, 97--103.
[8]
Komogortsev, O., Khan, J. 2006. Perceptual Attention Field Video Set. At www.cs.kent.edu/~okomogor/ETRA06VideoSet.htm.
[9]
Duchowski A. T. 2003. Eye Tracking Methodology: Theory and Practice, Springer-Verlag, London, UK.
[10]
Stelmach, L. B., Tam, W. J., Hearty, P. J. 1991. Static and dynamic spatial resolution in image coding: an investigation of eye movements. In Proceedings of SPIE Vol. 1453, Human Vision, Visual Processing, and Digital Display II, 147--152.
[11]
Stelmach, L. B., Tam, W. J. 1994. Processing image sequences based on eye movements. In Proceedings of SPIE Vol. 2179, Human Vision, Visual Processing, and Digital Display V, 90--98.
[12]
Daly S., Matthews K., Ribas-Corbera J. 2001. As Plain as the Noise on Your Face: Adaptive Video Compression Using Face Detection and Visual Eccentricity Models. In Journal of Electronic Imaging V. 10 (01), 30--46.
[13]
Khan J., Yang S., Patel D., Komogortsev O., Oh W., Guo Z., Gu Q., Mail P. 2002. Resource Adaptive Netcentric Systems on Active Network: a Self-Organizing Video Stream that Automorphs itself while in Transit Via a Quasi-Active Network. In Proceedings of the Active Networks Conference and Exposition (DANCE '2002), IEEE Computer Society Press, 409--426.
[14]
Geisler, W. S., Perry, J. S. 1998. Real-time Foveated Multiresolution System for Low-bandwidth Video Communication. In Proceedings of SPIE Vol. 3299, Human Vision and Electronic Imaging III, 294--305.
[15]
Wang, Z., Lu, L., Bovik, A. 2001. Rate scalable video coding using a foveation-based human visual system model. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP '01), Vol. 3, 1785--1788.
[16]
Westen, S. J., Lagendijk, R., Biemond, J. 1997. Spatiotemporal model of human vision for digital video compression. In Proceedings of SPIE Vol. 3016, Human Vision and Electronic Imaging II, 260--268.
[17]
Duchowski, A. T., McCormick B. H. 1998. Gaze-contingent video resolution degradation. In Proceedings of SPIE Vol. 3299, Human Vision and Electronic Imaging III, 318--329.

Cited By

View all
  • (2022)Event Construal in Text and Image. Gaze Behavior and Discourse InterpretationsRUDN Journal of Language Studies, Semiotics and Semantics10.22363/2313-2299-2022-13-2-396-41513:2(396-415)Online publication date: 14-Jul-2022
  • (2020)A Survey of Digital Eye Strain in Gaze-Based Interactive SystemsACM Symposium on Eye Tracking Research and Applications10.1145/3379155.3391313(1-12)Online publication date: 2-Jun-2020
  • (2013)Predicting Primary Gaze Behavior Using Social Saliency Fields2013 IEEE International Conference on Computer Vision10.1109/ICCV.2013.435(3503-3510)Online publication date: Dec-2013
  • Show More Cited By

Index Terms

  1. Perceptual attention focus prediction for multiple viewers in case of multimedia perceptual compression with feedback delay

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ETRA '06: Proceedings of the 2006 symposium on Eye tracking research & applications
      March 2006
      175 pages
      ISBN:1595933050
      DOI:10.1145/1117309
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 March 2006

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. compression
      2. media adaptation
      3. perceptual attention prediction

      Qualifiers

      • Article

      Conference

      ETRA06
      ETRA06: Eye Tracking Research and Applications
      March 27 - 29, 2006
      California, San Diego

      Acceptance Rates

      Overall Acceptance Rate 69 of 137 submissions, 50%

      Upcoming Conference

      ETRA '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Event Construal in Text and Image. Gaze Behavior and Discourse InterpretationsRUDN Journal of Language Studies, Semiotics and Semantics10.22363/2313-2299-2022-13-2-396-41513:2(396-415)Online publication date: 14-Jul-2022
      • (2020)A Survey of Digital Eye Strain in Gaze-Based Interactive SystemsACM Symposium on Eye Tracking Research and Applications10.1145/3379155.3391313(1-12)Online publication date: 2-Jun-2020
      • (2013)Predicting Primary Gaze Behavior Using Social Saliency Fields2013 IEEE International Conference on Computer Vision10.1109/ICCV.2013.435(3503-3510)Online publication date: Dec-2013
      • (2013)Visual Attention and Gaze Behavior in Games: An Object-Based ApproachGame Analytics10.1007/978-1-4471-4769-5_25(543-583)Online publication date: 6-Mar-2013
      • (2012)Evaluating Eye Tracking Systems for Computer InputGaze Interaction and Applications of Eye Tracking10.4018/978-1-61350-098-9.ch015(205-225)Online publication date: 2012
      • (2010)An empirical pipeline to derive gaze prediction heuristics for 3D action gamesACM Transactions on Applied Perception10.1145/1857893.18578978:1(1-30)Online publication date: 10-Nov-2010
      • (2009)Gaze-contingent video compression with targeted gaze containment performanceJournal of Electronic Imaging10.1117/1.315860918:3(033001)Online publication date: 1-Jul-2009
      • (2008)Predictive real-time perceptual compression based on eye-gaze-position analysisACM Transactions on Multimedia Computing, Communications, and Applications10.1145/1386109.13861164:3(1-16)Online publication date: 18-Sep-2008

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media