skip to main content
10.1145/1459359.1459421acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Annotating personal albums via web mining

Published: 26 October 2008 Publication History

Abstract

Nowadays personal albums are becoming more and more popular due to the explosive growth of digital image capturing devices. An effective automatic annotation system for personal albums is desired for both efficient browsing and search. Existing research on image annotation evolves through two stages: learning-based methods and web-based methods. Learning-based methods attempt to learn classifiers or joint probabilities between images and concepts, which are difficult to handle large-scale concept sets due to the lack of training data. Web-based methods leverage web image data to learn relevant annotations, which greatly expand the scale of concepts. However, they still suffer two problems: the query image lacks prior knowledge and the annotations are often noisy and incoherent. To address the above issues, we propose a web-based annotation approach to annotate a collection of photos simultaneously, instead of annotating them independently, by leveraging the abundant correlations among the photos. A multi-graph similarity propagation based semi-supervised learning (MGSP-SSL) algorithm is proposed to suppress the noises in the initial annotations from the Web. Experiments on real personal albums show that the proposed approach outperforms existing annotation methods.

References

[1]
J, Sivic., A, Zisserman. Video Google: A text retrieval approach to object matching in video. In Proceedings of International Conference on Computer Vision (ICCV), 2003.
[2]
J. Li and J. Z. Wang, Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach. IEEE Trans. On Pattern Analysis and Machine Intelligence, 2003. 25(19): p.1075--1088.
[3]
C. Cusano, G. Ciocca, and R. Schettini. Image Annotation Using SVM. Proceedings of Internet Imaging, Vol. SPIE 5304. 2004.
[4]
S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. In Proc. of CVPR, Washington, DC, June, 2004.
[5]
D. M. Blei and M. I. Jordan. Modeling annotated data. In Proc. SIGIR, Toronto, July. 2003.
[6]
C. Wang, F. Jing, L. Zhang and H.J. Zhang. Scalable search-based image annotation of personal images. In Proceedings of the 8th ACM international workshop on Multimedia information retrieval. ACM Press New York, NY, USA, 2006, 269--278
[7]
X. Wang, L. Zhang, F. Jing and W. Y. Ma. AnnoSearch: Image Auto-Annotation by Search. International Conference on Computer Vision and Pattern Recognition. Washington, DC, 2006, 1483--1490.
[8]
X. Rui, M. Li, Z. Li, W. Y. Ma, Yu, N. Bipartite Graph Reinforcement Model for web image annotation. In Proceedings of the 15th Annual ACM International Conference on Multimedia, 2007
[9]
J. Cui, F. Wen, R. Xiao, Y. Tian, X. Tang. EasyAlbum: An Interactive Photo Annotation System Based on Face Clustering and Re-ranking. In Proceedings of SIGCHI, 2007.
[10]
L. Zhang, L. Chen, M. Li, H. Zhang. Automated annotation of human faces in family albums. In Proceedings of ACM Multimedia, 2003.
[11]
D. Zhou, J. Huang, and B. Schölkopf. Learning with Local and global consistency. 18th Annual Conference on Neural Information Processing System, 2003.
[12]
J. Jeon, V. Lavrenko and R. Manmatha. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of SIGIR, Toronto, July 2003.
[13]
J Philbin, O Chum, M Isard, J Sivic, A Zisserman. Object retrieval with large vocabularies and fast spatial matching. CVPR, 2007.
[14]
M. Wang, X. Hua, X. Yuan, Y. Song and L. Dai. Optimizing multi-graph learning: towards a unified video annotation scheme. Proceedings of the 15th international conference on Multimedia, 2007.
[15]
J. Tang, X. Hua, T. Mei, G. Qi, S. Li and X. Wu. Temporally Consistent Gaussian Random Field for Video Semantic Analysis. IEEE International Conference on Image Processing, 2007.
[16]
V. Lavrenko, R. Manmatha and J. Jeon. A Model for Learning the Semantics of Pictures. In Proc. NIPS, 2003.
[17]
X. He, W. Y. Ma and H. J. Zhang. Learning an image manifold for retrieval. In Proc. of ACM international Conference On Multimedia, 2005.
[18]
R. L. Cilibrasi, P. M. B. Vitányi. The google similarity distance. IEEE Trans. on Knowledge and data engineering, 2007
[19]
J. Liu, M. Li, W. Y. Ma, Q. Liu and H. Lu An Adaptive Graph Model for Automatic Image Annotation. ACM Workshop on Multimedia Information Retrieval (MIR), 2006.
[20]
P. Duygulu and K. Barnard. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proc. of ECCV, 2002.
[21]
J. Kandola, J. Shawe-Taylor, N. Cristianini. Learning Semantic Similarity. Annual Conference on Neural Information Processing System, 2003.
[22]
X. Wang, W. Y. Ma, G. Xue, X. Li. Multi-Model Similarity Propagation and its Application for Web Image Retrieval. In Proceedings of ACM International Conference on Multimedia, 2004
[23]
D. Lowe. Local feature view clustering for 3D object recognition. In Proc. CVPR, 2001.
[24]
H. Lejsek, F. Ásmundsson, B. Jónsson, L. Amsaleg. Scalability of local image descriptors: a comparative study. ACM Multimedia 2006: 589--598.
[25]
S. Boll, P. Sandhaus, A. Scherp, U. Westermann. Semantics, content, and structure of many for the creation of personal photo albums. ACM Multimedia 2007: 641--650
[26]
X. Lian, L. Chen, J. X. Yu, G. Wang, G. Yu. Similarity Match Over High Speed Time-Series Streams. ICDE 2007: 1086--1095
[27]
F. Golshani. EIC's Message: Multimedia is Correlated Media. IEEE MultiMedia 11(1): (2004)
[28]
A. Zunjarwad, H. Sundaram, L. Xie. Contextual wisdom: social relations and correlations for multimedia event annotation. ACM Multimedia 2007: 615--624
[29]
Y. Lin, H. Sundaram, Y. Chi, J. Tatemura, B. Tseng Detecting splogs via temporal dynamics using self-similarity analysis. TWEB 2(1): (2008)q
[30]
M. Cooper, J. Foote, A. Girgensohn, L. Wilcox. Temporal event clustering for digital photo collections. TOMCCAP 1(3): 269--288 (2005)
[31]
R. Zhang, Z. Zhang, M. Li, W. Y. Ma, H. Zhang. A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieval. ICCV 2005: 846--851
[32]
W. Klas, R. King. Context-Aware Multimedia. Encyclopedia of Multimedia 2006
[33]
L. Hardman, J. Ossenbruggen. Creating meaningful multimedia presentations. ISCAS 2006
[34]
J. Liu, B. Wang, M. Li, Z. Li, W. Y. Ma, H. Lu and S. Ma. Dual Cross-Media Relevance Model for Image Annotation. In Proceedings of the 15th Annual ACM International Conference on Multimedia 2007.
[35]
E. Chang, et al. CBSA: Content-Based Soft Annotation for Multimodal Image Retrieval Using Bayes Point Machines. CirSysVideo, 2003. 13(1): p. 26--38.

Cited By

View all
  • (2020)Exploring nonnegative and low-rank correlation for noise-resistant spectral clusteringWorld Wide Web10.1007/s11280-020-00802-1Online publication date: 12-Mar-2020
  • (2015)Multitask Spectral Clustering by Exploring Intertask CorrelationIEEE Transactions on Cybernetics10.1109/TCYB.2014.234401545:5(1083-1094)Online publication date: May-2015
  • (2013)Discriminative Nonnegative Spectral Clustering with Out-of-Sample ExtensionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.11825:8(1760-1771)Online publication date: 1-Aug-2013
  • Show More Cited By

Index Terms

  1. Annotating personal albums via web mining

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '08: Proceedings of the 16th ACM international conference on Multimedia
    October 2008
    1206 pages
    ISBN:9781605583037
    DOI:10.1145/1459359
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 October 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. image annotation
    2. multi-graph
    3. personal albums
    4. semi-supervised learning
    5. similarity propagation

    Qualifiers

    • Research-article

    Conference

    MM08
    Sponsor:
    MM08: ACM Multimedia Conference 2008
    October 26 - 31, 2008
    British Columbia, Vancouver, Canada

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 16 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Exploring nonnegative and low-rank correlation for noise-resistant spectral clusteringWorld Wide Web10.1007/s11280-020-00802-1Online publication date: 12-Mar-2020
    • (2015)Multitask Spectral Clustering by Exploring Intertask CorrelationIEEE Transactions on Cybernetics10.1109/TCYB.2014.234401545:5(1083-1094)Online publication date: May-2015
    • (2013)Discriminative Nonnegative Spectral Clustering with Out-of-Sample ExtensionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.11825:8(1760-1771)Online publication date: 1-Aug-2013
    • (2013)Person re-identification by manifold ranking2013 IEEE International Conference on Image Processing10.1109/ICIP.2013.6738736(3567-3571)Online publication date: Sep-2013
    • (2013)Locally discriminative spectral clustering with composite manifoldNeurocomputing10.1016/j.neucom.2013.03.034119(243-252)Online publication date: Nov-2013
    • (2012)Semantic Based Image Retrieval using multi-agent model by searching and filtering replicated web images2012 World Congress on Information and Communication Technologies10.1109/WICT.2012.6409187(817-821)Online publication date: Oct-2012
    • (2012)An effective preprocessing method for subspace face recognition using genetic‐based clustering in ideal and noisy conditionsIEEJ Transactions on Electrical and Electronic Engineering10.1002/tee.217767:6(583-591)Online publication date: 27-Sep-2012
    • (2011)Automated content labeling using context in emailProceedings of the 17th International Conference on Management of Data10.5555/2591338.2591348(1-12)Online publication date: 19-Dec-2011
    • (2011)Retrieving and ranking unannotated images through collaboratively mining online search resultsProceedings of the 20th ACM international conference on Information and knowledge management10.1145/2063576.2063650(485-494)Online publication date: 24-Oct-2011
    • (2011)Collection-based sparse label propagation and its application on social group suggestion from photosACM Transactions on Intelligent Systems and Technology10.1145/1899412.18994162:2(1-21)Online publication date: 24-Feb-2011
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media