research-article

Learning to photograph

Authors:
Bin Cheng

NUS, Singapore, Singapore

NUS, Singapore, Singapore
View Profile

,
Bingbing Ni

NUS, Singapore, Singapore

NUS, Singapore, Singapore
View Profile

,
Shuicheng Yan

NUS, Singapore, Singapore

NUS, Singapore, Singapore
View Profile

,
Qi Tian

UTSA, San Antonio, TX, USA

UTSA, San Antonio, TX, USA
View Profile

MM '10: Proceedings of the 18th ACM international conference on MultimediaOctober 2010Pages 291–300https://doi.org/10.1145/1873951.1873992

Published:25 October 2010Publication History

MM '10: Proceedings of the 18th ACM international conference on Multimedia

Pages 291–300

ABSTRACT

In this paper, we propose an intelligent photography system, which automatically and professionally generates/recommends user-favorite photo(s) from a wide view or a continuous view sequence. This task is quite challenging given that the evaluation of photo quality is under-determined and usually subjective. Motivated by the recent prevalence of online media, we present a solution y mining the underlying knowledge and experience of the photographers from massively crawled professional photos (about 100,000 images, which are highly ranked by users) of those popular photo sharing websites, e.g. Flickr.com. Generally far contexts are critical in characterizing the composition rules for professional photos, and thus we present a method called omni-range context modeling to learn the patch/object spatial correlation distribution for the concurrent patch/object pair of arbitrary distance. The learned photo omni-range context priors then serve as rules to guide the composition of professional photos. When a wide view is fed into the system, these priors are utilized together with other cues (e.g., placements of faces at different poses, patch number, etc) to form a posterior probability formulation for professional sub-view finding. Moreover, this system can function as intelligent professionalview guider based on real-time view quality assessment and the embedded compass (for recording capture direction). Beyond the salient areas targeted by most existing view recommendation algorithms, the proposed system targets at professional photo composition. Qualitative experiments as well as comprehensive user studies well demonstrate the validity and efficiency of the proposed omnirange context learning method as well as the automatic view finding framework.

References

http://www.portraitprofessional.com/.Google Scholar
S. Avidan and A. Shamir. Seam carving for content-aware image resizing. ACM Transactions on Graphics, 26(3):1--9, 2007. Google ScholarDigital Library
S. Banerjee and B. L. Evans. In-camera automation of photographic composition rules. IEEE Transactions on Image Processing, 16:1807--1820, 2007. Google ScholarDigital Library
S. Birchfield and S. Rangarajan. Spatiograms versus histograms for region-based tracking. In CVPR, volume 2, pages 1158--1163, 2005. Google ScholarDigital Library
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, pages 886--893, 2005. Google ScholarDigital Library
C. Dance, J. Willamowski, L. Fan, C. Bray, and G. Csurka. Visual categorization with bags of keypoints. In ECCV International Workshop on Statistical Learning in Computer Vision, 2004.Google Scholar
A. Dempster, N. Laird, and D. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B, 39(1):1--38, 1977.Google Scholar
P. Felzenszwalb and D. Huttenlocher. Efficient graph-based image segmentation. International Journal of Computer Vision, 59(2):167--181, 2004. Google ScholarDigital Library
W. Gilks, S. Richardson, and D. Spiegelhalter. Markov Chain Monte Carlo in Practice: Interdisciplinary Statistics. Chapman and Hall/CRC, 1996.Google Scholar
Y. Guo, F. Liu, J. Shi, Z.-H. Zhou, and M. Gleicher. Image retargeting using mesh parametrization. IEEE Transactions on Multimedia, 11(5):856--867, 2009. Google ScholarDigital Library
R. Haralick, K. Shanmugam, and I. Dinstein. Textural features for image classification. IEEE Transactions on Systems, Man, and Cybernetics, (6):610--621, 1973.Google ScholarCross Ref
X. Hou and L. Zhang. Saliency detection: A spectral residual approach. In CVPR, pages 1--8, 2007.Google ScholarCross Ref
S. C. Johnson. Hierarchical clustering schemes. Psychometrika, 32(3):2410--254, 1967.Google ScholarCross Ref
Y. Ke, X. Tang, and F. Jing. The design of high-level features for photo quality assessment. In CVPR, pages 419--426, 2006. Google ScholarDigital Library
J. Li, W. Wu, T. Wang, and Y. Zhang. One step beyond histogram: Image representation using markov stationary features. In CVPR, pages 1--8.Google Scholar
Y. Luo and X. Tang. Photo and video quality evaluation: Focusing on the subject. In ECCV, pages 386--399, 2008. Google ScholarDigital Library
T. Mei, X.-S. Hua, H.-Q. Zhou, and S. Li. Modeling and mining of userscapture intention for home videos. IEEE Transactions on Multimedia, 9(1):66--77, 2007. Google ScholarDigital Library
B. Ni, S. Yan, and A. Kassim. Contextualizing histogram. In CVPR, pages 1682--1689, 2009.Google Scholar
Y. Rui and T. Huang. A novel relevance feedback technique in image retrieval. In ACM MM, pages 67--70, 1999. Google ScholarDigital Library
K. Sande, T. Gevers, and C. Snoek. Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (in press), 2009. Google ScholarDigital Library
V. Setlur, S. Takagi, R. Raskar, M. Gleicher, and B. Gooch. Automatic image retargeting. In ACM International Conference on Mobile and Ubiquitous Multimedia, pages 59--68, 2005. Google ScholarDigital Library
H. Sheikh, A. Bovik, and G. Veciana. An information fidelity criterion for image quality assessment using natural scene statistics. IEEE Transactions on Image Processing, 14:2117--2128, 2005. Google ScholarDigital Library
M. Stricker and M. Orengo. Similarity of color images. In Storage and Retrieval of Image and Video Databases III, volume 2, pages 381--392, 1995.Google ScholarCross Ref
X. Sun, H. Yao, R. Ji, and S. Liu. Photo assessment based on computational visual attention model. In ACM MM, pages 541--544, 2009. Google ScholarDigital Library
P. Viola and M. Jones. Robust real-time object detection. International Journal of Computer Vision, (2):137--154, 2004. Google ScholarDigital Library
Y. Zheng, M. Zhao, S.-Y. Neo, T.-S. Chua, and Q. Tian. Visual synset: towards a higher-level visual representation. In CVPR, 2008.Google Scholar

Index Terms

Learning to photograph
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks

Recommendations

StoryTags: once upon a time, there was a photo
CHI EA '09: CHI '09 Extended Abstracts on Human Factors in Computing Systems

With the growing volume of digital information users must deal with, management and retrieval tasks have become increasingly problematic. A popular way to help users organize their information is tagging, as is the case in web sites such as flickr, ...
Read More
Image space: an empirical study of geotagged mobile media content capture and sharing
MindTrek '12: Proceeding of the 16th International Academic MindTrek Conference

In the past few years, there has been a rapid increase in the everyday usage of cameraphones and image sharing services. The existing services offer means to store, tag and share photos, but they offer only limited means to geotag and offer meaningful ...
Read More
AutoAlbum: Clustering Digital Photographs using Probabilistic Model Merging
CBAIVL '00: Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00)

Consumers need help finding digital photographs in their personal collections. AutoAlbum helps users find their photos by automatically clustering photos into albums. The albums are presented in an easy-to-use browsing user interface. AutoAlbum uses the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '10: Proceedings of the 18th ACM international conference on Multimedia
October 2010
1836 pages
ISBN:9781605589336
DOI:10.1145/1873951
General Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Shih-Fu Chang
Columbia University, USA
,
Program Chair:
Arnold Smeulders
University of Amsterdam, NL
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 October 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
automatic view finding
digital photography
omni-range context
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 99
  Total Citations
  View Citations
- 880
  Total Downloads
- Downloads (Last 12 months)25
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning to photograph

MM '10: Proceedings of the 18th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

StoryTags: once upon a time, there was a photo

Image space: an empirical study of geotagged mobile media content capture and sharing

AutoAlbum: Clustering Digital Photographs using Probabilistic Model Merging