skip to main content
10.1145/1124772.1124886acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
Article

Gaze-based interaction for semi-automatic photo cropping

Published: 22 April 2006 Publication History

Abstract

We present an interactive method for cropping photographs given minimal information about important content location, provided by eye tracking. Cropping is formulated in a general optimization framework that facilitates adding new composition rules, and adapting the system to particular applications. Our system uses fixation data</ to identify important image content and compute the best crop for any given aspect ratio or size, enabling applications such as automatic snapshot recomposition, adaptive documents, and thumbnailing. We validate our approach with studies in which users compare our crops to ones produced by hand and by a completely automatic approach. Experiments show that viewers prefer our gaze-based crops to uncropped images and fully automatic crops.

References

[1]
Arnheim, R. Art and Visual Perception. University of California Press, 1974.
[2]
Arnheim, R. The Power of the Center. University of California Press, 1988.
[3]
Banerjee, S. Composition Guided Image Aquisition. PhD thesis, University of Texas at Austin, 2004.
[4]
Byers, Z., Dixon, M., Smart, W. D., and Grimm, C. M. Say cheese!: Experiences with a robot photographer. Proceedings of the Fifteenth Innovative Applications of Artificial Intelligence Conference (IAAI-03), Acapulco, Mexico.
[5]
Chen, L., Xie, X., Fan, X., Ma, W., Shang, H., and Zhou, H. A visual attention mode for adapting images on small displays. MSR-TR-2002-125 Microsoft Research, Redmond, WA (2002).
[6]
Christoudias, C., Georgescu, B., and Meer, P. Synergism in low level vision. Proceedings ICPR 2002.
[7]
Crow, F. Summed-area tables for texture mapping. Siggraph '84. 207--2 12.
[8]
David, H. A. The method of paired comparisons. Charles Griffin and Company, London, 1969.
[9]
DeCarlo, D., and Santella, A. Stylization and abstraction of photographs. Proceedings of ACM SIGGRAPH 2002. 769--776.
[10]
Duchowski, A. Acuity-matching resolution degradation through wavelet coefficient scaling. IEEE Trans. on Image Processing 9, 8 (2000), 1437--1440.
[11]
Foley, J., van Dam, A., Feiner, S., and Hughes, J. Computer Graphics: Principles and Practice, 2nd edition. Addison Wesley, 1997.
[12]
Gooch, B., Reinhard, E., Moulding, C., and Shirley, P. Artistic composition for image creation. Proceedings of the 12th Eurographics workshop on Rendering Technique. 83--88.
[13]
Graham, D. Composing Pictures. Van Nostrand Reinhold, 1970.
[14]
Grill, T., and Scanlon, M. Photographic Composition Guidelines for Total image control through effective design. AMPHOTO, 1988.
[15]
Itti, L., and Koch, C. A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research 40 (2000), 1489--1506.
[16]
Itti, L., Koch, C., and Niebur, E. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20 (1998), 1254--1259.
[17]
Jacob, R. J. Eye-movement-based human-computer interaction techniques: Toward non-command interfaces. 151--190.
[18]
Jacobs, C., Li, W., Schrier, E., Bargeron, D., and Salesin, D. Adaptive grid-based document layout. ACM Trans. Graph. 22, 3 (2003), 838--847.
[19]
Kendall, M. G. On the method of paired comparisons. Biometrika 31 (1940), 324 --345.
[20]
Li, S. Z., Zhu, L., Zhang, Z. Q., Blake, A., Zhang, H. J., and Shum, H. Statistical learning of multi-view face detection. Proceedings 7th European Conference on Computer Vision(ECCV 2002) 4 (2002), 67 --81.
[21]
Li, Y., Sun, J., Tang, C.-K., and Shum, H.-Y. Lazy snapping. ACM Trans. Graph. 23, 3 (2004), 303--308.
[22]
Locher, P. J. The contribution of eye-movement research to an understanding of the nature of pictorial balance perception: a review of the literature. Empirical Studies of the Arts 14, 2 (1996), 146--163.
[23]
Locher, P. J., Stappers, P. J., and Overbeeke, K. The role of balance as an organizing design principle underlying adults' compositional strategies for creating visual displays. Acta Psychologica 99 (1998), 141--161.
[24]
Locher, P. J., Stappers, P. J., and Overbeeke, K. An empirical evaluation of the visual rightness theory of pictorial composition. Acta Psychologica 103 (1999), 261--280.
[25]
Lok, S., Feiner, S., and Ngai, G. Evaluation of visual balance for automated layout. Proceedings of the 9th international conference on Intelligent user interface. 101--106.
[26]
McManus, I., Edmondson, D., and Rodgers, J. Balance in pictures. British Journal of Psychology 76 (1985), 73--94.
[27]
Peterson, B. Learning to see Creatively: How to compose great photographs. AMPHOTO, 1988.
[28]
Setlur, V., Takagi, S., Raskar, R., Gleicher, M., and Gooch, B. Automatic image retargeting. ACM SIGGRAPH Technical Sketch.
[29]
Suh, B., Ling, H., Bederson, B. B., and Jacobs, D. W. Automatic thumbnail cropping and it's effectivness. ACM Conference on User Interface and Software Technolgy (UIST 2003) (2003), 95--104.
[30]
Vertegaal, R. The gaze groupware system: Mediating joint attention in mutiparty communication and collaboration. Proceedings CHI '99. 294--301.
[31]
Vertegaal, R. Designing attentive interfaces. Proceedings of the Eye Tracking Research and Applications (ETRA) Symposium 2002. 23--30.

Cited By

View all
  • (2024)Gaze-Guided Graph Neural Network for Action Anticipation Conditioned on IntentionProceedings of the 2024 Symposium on Eye Tracking Research and Applications10.1145/3649902.3653340(1-9)Online publication date: 4-Jun-2024
  • (2024)Rectify ViT Shortcut Learning by Visual SaliencyIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.331053135:12(18013-18025)Online publication date: Dec-2024
  • (2024)Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box RegressionIEEE Transactions on Multimedia10.1109/TMM.2024.337712526(8157-8171)Online publication date: 13-Mar-2024
  • Show More Cited By

Index Terms

  1. Gaze-based interaction for semi-automatic photo cropping

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI '06: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
    April 2006
    1353 pages
    ISBN:1595933727
    DOI:10.1145/1124772
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 April 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. composition
    2. cropping
    3. evaluation
    4. eye tracking
    5. photography
    6. visual perception

    Qualifiers

    • Article

    Conference

    CHI06
    Sponsor:
    CHI06: CHI 2006 Conference on Human Factors in Computing Systems
    April 22 - 27, 2006
    Québec, Montréal, Canada

    Acceptance Rates

    Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

    Upcoming Conference

    CHI 2025
    ACM CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)9
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Gaze-Guided Graph Neural Network for Action Anticipation Conditioned on IntentionProceedings of the 2024 Symposium on Eye Tracking Research and Applications10.1145/3649902.3653340(1-9)Online publication date: 4-Jun-2024
    • (2024)Rectify ViT Shortcut Learning by Visual SaliencyIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.331053135:12(18013-18025)Online publication date: Dec-2024
    • (2024)Pseudo Label Fusion With Uncertainty Estimation for Semi-Supervised Cropping Box RegressionIEEE Transactions on Multimedia10.1109/TMM.2024.337712526(8157-8171)Online publication date: 13-Mar-2024
    • (2024)Supervised Deep Learning for Ideal Identification of Image Retargeting TechniquesIEEE Access10.1109/ACCESS.2024.3510675(1-1)Online publication date: 2024
    • (2024)Supervised deep learning for content-aware image retargeting with Fourier ConvolutionsMultimedia Tools and Applications10.1007/s11042-024-18876-883:36(83611-83627)Online publication date: 19-Mar-2024
    • (2024)Deep learning-based importance map for content-aware media retargetingMultimedia Tools and Applications10.1007/s11042-024-18389-483:30(74301-74322)Online publication date: 15-Feb-2024
    • (2024)A new content-aware image resizing based on Rényi entropy and deep learningNeural Computing and Applications10.1007/s00521-024-09517-036:15(8885-8899)Online publication date: 30-Mar-2024
    • (2024)Tunable and real‐time automatic interventional x‐ray collimation from semi‐supervised deep feature extractionMedical Physics10.1002/mp.17522Online publication date: 6-Dec-2024
    • (2023)Seam Carving for Content-Aware Image ResizingSeminal Graphics Papers: Pushing the Boundaries, Volume 210.1145/3596711.3596776(609-617)Online publication date: 2-Aug-2023
    • (2023)Image Cropping under Design ConstraintsProceedings of the 5th ACM International Conference on Multimedia in Asia10.1145/3595916.3626412(1-7)Online publication date: 6-Dec-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media