Article

Image annotation: which approach for realistic databases?

Authors:

Nicolas Hervé,

Nozha BoujemaaAuthors Info & Claims

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

Pages 170 - 177

https://doi.org/10.1145/1282280.1282310

Published: 09 July 2007 Publication History

Abstract

This paper describes an efficient approach to image annotation. It ranked first on the recent scene categorization track of the ImagEVAL¹ benchmark. We show how homogeneous global image descriptors combined with a pool of Support Vector Machines achieve very good results. We also used this approach on several well known object recognition databases to emphasize two main aspects of this research domain: the importance of contextual information in object recognition and the unsuitability of many standard databases for this task.

References

[1]

J. Amores, N. Sebe, and P. Radeva. Efficient object-class recognition by boosting contextual information. In IbPRIA, 2005.

Digital Library

[2]

S. Boughorbel. Kernels for Image Classification with Support Vector Machines. PhD thesis, Paris XI, 2005.

[3]

N. Boujemaa et al. Ikona: interactive specific and generic image retrieval. In MMCBIR, 2001.

[4]

C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001.

[5]

Y. Chen, J. Bi, and J. Z. Wang. Miles: Multiple-instance learning via embedded instance selection. PAMI, 2006.

Digital Library

[6]

Y. Chen and J. Z. Wang. Image categorization by learning and reasoning with regions. JMLR, 5:913--939, 2004.

Digital Library

[7]

G. Csurka, C. Bray, C. Dance, and L. Fan. Visual categorization with bags of keypoints. In ECCV'04 Workshop on Statistical Learning in Computer Vision

[8]

F. Cutzu, R. Hammoud, and A. Leykin. Distinguishing paintings from photographs. Computer Vision and Image Understanding, 100:249--273, 2005.

Digital Library

[9]

M. Everingham et al. The 2005 pascal visual object classes challenge. In Selected Proceedings of the First PASCAL Challenges Workshop, 2006.

[10]

L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In CVPR 2004, Workshop on Generative-Model Based Vision.

Digital Library

[11]

M. Ferecatu. Image retrieval with active relevance feedback using both visual and keyword-based descriptors. PhD thesis, University of Versailles Saint-Quentin-En-Yvelines, 2005.

[12]

R. Fergus, P. Perona, and A. Zisserman. Object class recognition by unsupervised scale-invariant learning. In Computer Vision and Pattern Recognition, 2003.

[13]

A. Guérin-Dugué and A. Oliva. Classification of scene photographs from local orientations features. Pattern Recognition Letters, 21:1135--1140, 2000.

Digital Library

[14]

A. K. Jain and A. Vailaya. Image retrieval using color and shape. Pattern Recognition, 29:1233--1244, 1996.

[15]

S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, 2006.

Digital Library

[16]

O. Maron and A. L. Ratan. Multiple-instance learning for natural scene classification. In ICML, 1998.

Digital Library

[17]

P.-A. Moëllic and C. Fluhr. Imageval 2006 official campaign. Technical report, CEA List, 2006.

[18]

H. Müller, S. Marchand-Maillet, and T. Pun. The truth about corel - evaluation in image retrieval. In CIVR, 2002.

[19]

P. Mylonas, T. Athanasiadis, and Y. Avrithis. Improving image analysis using a contextual approach. In WIAMIS, 2006.

[20]

A. Oliva and A. Torralba. Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV, 42:145--175, 2001.

Digital Library

[21]

A. Oliva and A. Torralba. Scene-centered representation from spatial envelope descriptors. In Biologically Motivated Computer Vision, 2002.

Digital Library

[22]

C. Picault. Constitution of the imageval database, an end-user oriented approach. Technical report, Paragraphe Laboratory, Université Paris 8, 2006.

[23]

J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In ALMC, 1999.

[24]

J. Ponce et al. Toward Category-Level Object Recognition, chapter Dataset Issues in Object Recognition. Springer-Verlag Lecture Notes in Computer Science, 2006.

Digital Library

[25]

R. J. Qian, P. Van Beek and M. I. Sezan Image Retrieval Using Blob Histograms. In ICME, 2000.

[26]

N. Serrano, A. Savakis, and J. Luo. A computationally efficient approach to indoor/outdoor scene classification. In ICPR, 2002.

Digital Library

[27]

N. Serrano, A. E. Savakis, and J. Luo. Improved scene classification using efficient low-level features and semantic cues. PR, 37:1773--1784, 2004.

[28]

E. Spyrou, H. L. Borgne, T. Mailis, E. Cooke, Y. Avrithis, and N. O'Connor. Fusing mpeg-7 visual descriptors for image classification. 2005.

[29]

M. Szummer and R. W. Picard. Indoor-outdoor image classification. Workshop on Content-based Access of Image and Video Databases, 1998.

Digital Library

[30]

A. Vailaya, M. Figueiredo, A. Jain, and H. J. Zhang. Content-based hierarchical classification of vacation images. IEEE Multimedia Systems, 1999.

Digital Library

[31]

A. Vailaya, H. Zhang, C. Yang, F.-I. Liu, and A. K. Jain. Automatic image orientation detection. Ieee Transactions On Image Processing, 11, 2002.

Digital Library

[32]

C. Vertan and N. Boujemaa. Upgrading color distributions for image retrieval: can we do better? In International Conference on Visual Information Systems, 2000.

Digital Library

[33]

T. Westerveld and A. P. de Vries. Experimental evaluation of a generative probabilistic image retrieval model on 'easy' data. In SIGIR Multimedia Information Retrieval Workshop, 2003.

Digital Library

[34]

J. Willamowski, D. Arregui, G. Csurka, C. Dance, and L. Fan. Categorizing nine visual classes using local appearance descriptors. In ICPR Workshop Learning for Adaptable Visual Systems Cambridge, 2004.

[35]

H. Zhang, A. C. Berg, M. Maire, and J. Malik. Svm-knn: Discriminative nearest neighbor classification for visual category recognition. In CVPR, 2006.

Digital Library

[36]

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid. Local features and kernels for classifcation of texture and object categories: An in-depth study. Technical Report RR-5737, INRIA Rhône-Alpes, 2005.

[37]

L. Zhang, M. Li, and H.-J. Zhang. Boosting image orientation detection with indoor vs. outdoor classification. In Workshop on Applications of Computer Vision, 2002.

Digital Library

Cited By

Jiménez Schlegl P(2021)El aprendizaje en sistemas autónomos e inteligentes: visión general y sesgos de fuentes de datosArbor10.3989/arbor.2021.802005197:802(a627)Online publication date: 30-Dec-2021
https://doi.org/10.3989/arbor.2021.802005
Hervé NBoujemaa N(2018)Automatic Image AnnotationEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_1010(228-236)Online publication date: 7-Dec-2018
https://doi.org/10.1007/978-1-4614-8265-9_1010
Marée R(2017)The need for careful data collection for pattern recognition in digital pathologyJournal of Pathology Informatics10.4103/jpi.jpi_94_168:1(19)Online publication date: 2017
https://doi.org/10.4103/jpi.jpi_94_16
Show More Cited By

Index Terms

Image annotation: which approach for realistic databases?
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations

Recommendations

Random interest regions for object recognition based on texture descriptors and bag of features

In this work we propose a novel method for object recognition based on a random selection of interest regions, texture features (local binary/ternary patterns and local phase quantization) for describing each region, a bag-of-features approach for ...
Image Annotation Fusing Content-Based and Tag-Based Technique Using Support Vector Machine and Vector Space Model
SITIS '14: Proceedings of the 2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems

In this paper, we propose a new image annotation method by combining content-based image annotation and tag-based image annotation techniques. Content-based image annotation technique is adopted to extract "loosely defined concepts" by analyzing pre-...
Image Annotation Based on Feature Weight Selection
CW '08: Proceedings of the 2008 International Conference on Cyberworlds

Multimedia content description interface (MPEG-7) includes a number of image feature descriptors to represent low-level image features such as colors, textures and shapes effectively. But, the contribution of each descriptor may not be the same for a ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval

July 2007

655 pages

ISBN:9781595937339

DOI:10.1145/1282280

General Chairs:
Nicu Sebe
Univ. of Amsterdam, The Netherlands
,
Marcel Worring
Univ. of Amsterdam, The Netherlands

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

In-Cooperation

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

CIVR07

Sponsor:

SIGMM

CIVR07: International Conference on Image and Video Retrieval 2007

July 9 - 11, 2007

Amsterdam, The Netherlands

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
416
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Jiménez Schlegl P(2021)El aprendizaje en sistemas autónomos e inteligentes: visión general y sesgos de fuentes de datosArbor10.3989/arbor.2021.802005197:802(a627)Online publication date: 30-Dec-2021
https://doi.org/10.3989/arbor.2021.802005
Hervé NBoujemaa N(2018)Automatic Image AnnotationEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_1010(228-236)Online publication date: 7-Dec-2018
https://doi.org/10.1007/978-1-4614-8265-9_1010
Marée R(2017)The need for careful data collection for pattern recognition in digital pathologyJournal of Pathology Informatics10.4103/jpi.jpi_94_168:1(19)Online publication date: 2017
https://doi.org/10.4103/jpi.jpi_94_16
Hervé NBoujemaa N(2017)Automatic Image AnnotationEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_1010-2(1-9)Online publication date: 2-Jan-2017
https://doi.org/10.1007/978-1-4899-7993-3_1010-2
Zand MDoraisamy SHalin AMustaffa M(2015)Texture classification and discrimination for region-based image retrievalJournal of Visual Communication and Image Representation10.1016/j.jvcir.2014.10.00526:C(305-316)Online publication date: 1-Jan-2015
https://dl.acm.org/doi/10.1016/j.jvcir.2014.10.005
Fan WBouguila N(2013)Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selectionPattern Recognition10.1016/j.patcog.2013.03.02646:10(2754-2769)Online publication date: 1-Oct-2013
https://dl.acm.org/doi/10.1016/j.patcog.2013.03.026
Zhang DMonirul Islam MLu G(2013)Structural image retrieval using automatic image annotation and region based inverted fileJournal of Visual Communication and Image Representation10.1016/j.jvcir.2013.07.00424:7(1087-1098)Online publication date: 1-Oct-2013
https://dl.acm.org/doi/10.1016/j.jvcir.2013.07.004
Mason RCharniak EChu-Carroll J(2012)Apples to orangesProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382053(172-181)Online publication date: 3-Jun-2012
https://dl.acm.org/doi/10.5555/2382029.2382053
Zhang DIslam MLu G(2012)A review on automatic image annotation techniquesPattern Recognition10.1016/j.patcog.2011.05.01345:1(346-362)Online publication date: 1-Jan-2012
https://dl.acm.org/doi/10.1016/j.patcog.2011.05.013
(2012)ReferencesMultimedia Information Extraction10.1002/9781118219546.refs(425-460)Online publication date: 24-Aug-2012
https://doi.org/10.1002/9781118219546.refs
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten