skip to main content
10.1145/1101149.1101289acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

An object-based video coding framework for video sequences obtained from static cameras

Published:06 November 2005Publication History

ABSTRACT

This paper presents a novel object-based video coding framework for videos obtained from a static camera. As opposed to most existing methods, the proposed method does not require explicit 2D or 3D models of objects and hence is general enough to cater for varying types of objects in the scene. The proposed system detects and tracks objects in the scene and learns the appearance model of each object online using incremental principal component analysis (IPCA). Each object is then coded using the coefficients of the most significant principal components of its learned appearance space. Due to smooth transitions between limited number of poses of an object, usually a limited number of significant principal components contribute to most of the variance in the object's appearance space and therefore only a small number of coefficients are required to code the object. The rigid component of the object's motion is coded in terms of its affine parameters. The framework is applied to compressing videos in surveillance and video phone domains. The proposed method is evaluated on videos containing a variety of scenarios such as multiple objects undergoing occlusion, splitting, merging, entering and exiting, as well as a changing background. Results on standard MPEG-7 videos are also presented. For all the videos, the proposed method displays higher Peak Signal to Noise Ratio (PSNR) compared to MPEG-2 and MPEG-4 methods, and provides comparable or better compression.

References

  1. Y. Altunbasak and A. M. Tekalp, "Occlusion-adaptive content-based 2-D mesh design and tracking for object-based coding," In IEEE Transactions on Image Processing, vol. 6, no. 9, pp.1270--1280, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Aizawa and T. Huang, "Model Based Image Coding: Advanced Video Coding Techniques for Very Low Bit-Rate Applications," In Proceedings of the IEEE, vol. 83, no. 2, pp.259--271, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  3. J. Bergen, P. Anandan, K. Hanna, and R. Hingorani. "Hierarchical model-based motion estimation". In European Conference on Computer Vision, pp.237--252, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Buck, "Segmentation of moving head-and-shoulder shapes," In Picture Coding Symposium, Boston, 1990.Google ScholarGoogle Scholar
  5. C. S. Choi and T. Tekebe, "Analysis and synthesis of facial image sequences in model-based image coding," In IEEE Transactions on Video Technology, pp.257--275, 1994.Google ScholarGoogle Scholar
  6. D. DeCarlo, D. Metaxas, and M.Stone, "An Anthropometric Face Model using Variational Techniques," In Proc. SIGGRAPH, pp.67--74, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. O. Javed, K. Shafique, and M. Shah. "A hierarchical approach to robust background subtraction using color and gradient Information". In Workshop on Motion and Video Computing, pp.22--27, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. O. Javed, Z. Rasheed, O. Alatas, and M. Shah, "KNIGHTM:A Real Time Surveillance System for Multiple Overlapping and Non-Overlapping Cameras". In The Fourth International Conference on Multimedia and Expo, Baltimore, Maryland, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O. Javed, and M. Shah. "Tracking and Object Classification for Automated Surveillance". In European Conference on Computer Vision, pp.343--357, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. I. T. Jolliffe. "Principal component analysis". New York: Springer-Verlag, 1986.Google ScholarGoogle Scholar
  11. R. Koch, "Dynamic 3D scene analysis through synthesis feedback control", In IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15(6), June, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. I. Martins and L. Corte-Real, "A Video-coder using 3D model-based background for video surveillance applications,". In IEEE Internation Conference on Image Processing, pp.919--923, 1998.Google ScholarGoogle Scholar
  13. G. Menegaz and J. Thiran, "Lossy to lossless object-based coding of 3D MRI data". In IEEE Transactions on Image Processing, Vol.11(9), pp.1053--1061, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Nakaya, K. Aizawa, and H. Harashima, "Texture updating methods in model-based coding of facial images,". In Picture Coding Symposium, Boston, 1990.Google ScholarGoogle Scholar
  15. C. Staffer, and W. E. L. Grimson. "Learning patterns of actvity using real-time tracking." In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.22(8), pp.747--757, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Strang. "Linear algebra and its applications". New York: Academic, 1980.Google ScholarGoogle Scholar
  17. M. G. Strinzis, "Object-based coding of stereospic and 3D image sequences,". In IEEE Signal Processing Magazine, pp.14--28, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  18. M. J. Turk, and A. Pentland. "Eigenfaces for recognition". In Workshop on Human Computer Interaction, Vol.3, pp.71--86, 1991.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Vetro, T Haga, K. Sumi, and H. Sun, "Object based coding for long term archive of surveillance video". In Technical Report, TR-2003-98, MERL, July 2003.Google ScholarGoogle Scholar
  20. W. J. Welsh "Model-based coding of videophone images," In Electronic and Communication Engineering Journal, Vol.3(1), pp.29--36, 1991.Google ScholarGoogle ScholarCross RefCross Ref
  21. J. Weng, Y. Zhang, and W. Hwang. "Candid covariance-free incremental principal component analysis". In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.25(8), pp.1034--1040, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Y. Zhang, and J. Weng. "Convergence analysis of complementary candid incremental principal component analysis". Technical Report MSU-CSE-01-23, Michigan State University, East Lansing, 2001.Google ScholarGoogle Scholar
  23. A. Yilmaz, X. Li, and M. Shah. "Contour-based object tracking with occlusion handling in video acquired using mobile cameras". In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.26(11), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. L. Yuille, P. W. Hallina, and D. S. Cohen, "Feature extraction from faces using deformable templates," In International Journal of Computer Vision, Vol.8(2), pp.99--111, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. "CAVIAR: Context Aware Vision using Image-based Active Recognition", Downloadable from http://homepages.inf.ed.ac.uk/rbf/CAVIAR/.Google ScholarGoogle Scholar
  26. "PETS: Performance Evaluation of Tracking and Surveillance", Downloadable from http://pets2002.visualsurveillance.org/.Google ScholarGoogle Scholar

Index Terms

  1. An object-based video coding framework for video sequences obtained from static cameras

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia
          November 2005
          1110 pages
          ISBN:1595930442
          DOI:10.1145/1101149

          Copyright © 2005 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 6 November 2005

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          MULTIMEDIA '05 Paper Acceptance Rate49of312submissions,16%Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader