skip to main content
10.1145/1873951.1873992acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Learning to photograph

Published:25 October 2010Publication History

ABSTRACT

In this paper, we propose an intelligent photography system, which automatically and professionally generates/recommends user-favorite photo(s) from a wide view or a continuous view sequence. This task is quite challenging given that the evaluation of photo quality is under-determined and usually subjective. Motivated by the recent prevalence of online media, we present a solution y mining the underlying knowledge and experience of the photographers from massively crawled professional photos (about 100,000 images, which are highly ranked by users) of those popular photo sharing websites, e.g. Flickr.com. Generally far contexts are critical in characterizing the composition rules for professional photos, and thus we present a method called omni-range context modeling to learn the patch/object spatial correlation distribution for the concurrent patch/object pair of arbitrary distance. The learned photo omni-range context priors then serve as rules to guide the composition of professional photos. When a wide view is fed into the system, these priors are utilized together with other cues (e.g., placements of faces at different poses, patch number, etc) to form a posterior probability formulation for professional sub-view finding. Moreover, this system can function as intelligent professionalview guider based on real-time view quality assessment and the embedded compass (for recording capture direction). Beyond the salient areas targeted by most existing view recommendation algorithms, the proposed system targets at professional photo composition. Qualitative experiments as well as comprehensive user studies well demonstrate the validity and efficiency of the proposed omnirange context learning method as well as the automatic view finding framework.

References

  1. http://www.portraitprofessional.com/.Google ScholarGoogle Scholar
  2. S. Avidan and A. Shamir. Seam carving for content-aware image resizing. ACM Transactions on Graphics, 26(3):1--9, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Banerjee and B. L. Evans. In-camera automation of photographic composition rules. IEEE Transactions on Image Processing, 16:1807--1820, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Birchfield and S. Rangarajan. Spatiograms versus histograms for region-based tracking. In CVPR, volume 2, pages 1158--1163, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, pages 886--893, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Dance, J. Willamowski, L. Fan, C. Bray, and G. Csurka. Visual categorization with bags of keypoints. In ECCV International Workshop on Statistical Learning in Computer Vision, 2004.Google ScholarGoogle Scholar
  7. A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1--38, 1977.Google ScholarGoogle Scholar
  8. P. Felzenszwalb and D. Huttenlocher. Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2):167--181, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. W. Gilks, S. Richardson, and D. Spiegelhalter. Markov Chain Monte Carlo in Practice: Interdisciplinary Statistics. Chapman and Hall/CRC, 1996.Google ScholarGoogle Scholar
  10. Y. Guo, F. Liu, J. Shi, Z.-H. Zhou, and M. Gleicher. Image retargeting using mesh parametrization. IEEE Transactions on Multimedia, 11(5):856--867, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Haralick, K. Shanmugam, and I. Dinstein. Textural features for image classification. IEEE Transactions on Systems, Man, and Cybernetics, (6):610--621, 1973.Google ScholarGoogle ScholarCross RefCross Ref
  12. X. Hou and L. Zhang. Saliency detection: A spectral residual approach. In CVPR, pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  13. S. C. Johnson. Hierarchical clustering schemes. Psychometrika, 32(3):2410--254, 1967.Google ScholarGoogle ScholarCross RefCross Ref
  14. Y. Ke, X. Tang, and F. Jing. The design of high-level features for photo quality assessment. In CVPR, pages 419--426, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Li, W. Wu, T. Wang, and Y. Zhang. One step beyond histogram: Image representation using markov stationary features. In CVPR, pages 1--8.Google ScholarGoogle Scholar
  16. Y. Luo and X. Tang. Photo and video quality evaluation: Focusing on the subject. In ECCV, pages 386--399, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Mei, X.-S. Hua, H.-Q. Zhou, and S. Li. Modeling and mining of userscapture intention for home videos. IEEE Transactions on Multimedia, 9(1):66--77, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. B. Ni, S. Yan, and A. Kassim. Contextualizing histogram. In CVPR, pages 1682--1689, 2009.Google ScholarGoogle Scholar
  19. Y. Rui and T. Huang. A novel relevance feedback technique in image retrieval. In ACM MM, pages 67--70, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. Sande, T. Gevers, and C. Snoek. Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (in press), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. V. Setlur, S. Takagi, R. Raskar, M. Gleicher, and B. Gooch. Automatic image retargeting. In ACM International Conference on Mobile and Ubiquitous Multimedia, pages 59--68, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Sheikh, A. Bovik, and G. Veciana. An information fidelity criterion for image quality assessment using natural scene statistics. IEEE Transactions on Image Processing, 14:2117--2128, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. Stricker and M. Orengo. Similarity of color images. In Storage and Retrieval of Image and Video Databases III, volume 2, pages 381--392, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  24. X. Sun, H. Yao, R. Ji, and S. Liu. Photo assessment based on computational visual attention model. In ACM MM, pages 541--544, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. P. Viola and M. Jones. Robust real-time object detection. International Journal of Computer Vision, (2):137--154, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Y. Zheng, M. Zhao, S.-Y. Neo, T.-S. Chua, and Q. Tian. Visual synset: towards a higher-level visual representation. In CVPR, 2008.Google ScholarGoogle Scholar

Index Terms

  1. Learning to photograph

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '10: Proceedings of the 18th ACM international conference on Multimedia
      October 2010
      1836 pages
      ISBN:9781605589336
      DOI:10.1145/1873951

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 October 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader