skip to main content
10.1145/1386352.1386408acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
research-article

Web-based information content and its application to concept-based video retrieval

Published: 07 July 2008 Publication History

Abstract

Semantic similarity between words or phrases is frequently used to find matching correlations between search queries and documents when straightforward matching of terms fails. This is particularly important for searching in visual databases, where pictures or video clips have been automatically tagged with a small set of semantic concepts based on analysis and classification of the visual content. Here, the textual description of documents is very limited, and semantic similarity based on WordNet's cognitive synonym structure, along with information content derived from term frequencies, can help to bridge the gap between an arbitrary textual query and a limited vocabulary of visual concepts. This approach, termed concept-based retrieval, has received significant attention over the last few years, and its success is highly dependent on the quality of the similarity measure used to map textual query terms to visual concepts.
In this paper, we consider some issues of semantic similarity measures based on Information Content (IC), and propose a way to improve them. In particular, we note that most IC-based similarity measures are derived from a small and relatively outdated corpus (the Brown corpus), which does not adequately capture the usage pattern of many contemporary terms: for example, out of more than 150,000 WordNet terms, only about 36,000 are represented. This shortcoming reflects very negatively on the coverage of typical search query terms. We therefore suggest using alternative IC corpora that are larger and better aligned with the usage of modern vocabulary. We experimentally derive two such corpora using the WWW Google search engine, and show that they provide better coverage of vocabulary, while showing comparable frequencies for Brown corpus terms. Finally, we evaluate the two proposed IC corpora in the context of a concept-based video retrieval application using the TRECVID 2005, 2006, and 2007 datasets, and we show that they increase average precision results by up to 200%.

References

[1]
Fellbaum, C. WordNet: An Electronic Lexical Database. 1998. MIT Press, Cambridge, MA.
[2]
Zhai, Y., Liu, J., Shah, M. Automatic Query Expansion for News Video Retrieval. In Proceedings of the International Conference on Multimedia and Expo (Toronto, Canada, July 9-12, 2006). ICME '06. IEEE Press, New York, NY, 965--968.
[3]
Snoek, C.G.M., Huurnink, B., Hollink, L., de Rijke, M., Schreiber, G., Worring, M. Adding Semantics to Detectors for Video Retrieval, IEEE Transactions on Multimedia, Vol. 9, Issue 5 (August 2007). IEEE Press, New York, NY, 975--986.
[4]
Wu. Z., Palmer, M. Verb semantics and lexical selection. In Proceedings of Annual Meeting of the Association for Computational Linguistics (Las Cruces, NM, June 27-30, 1994). Morgan Kaufmann, San Francisco, CA, 133--138.
[5]
Resnik, P. Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In Proceedings of the International Joint Conference on Artificial Intelligence (Montréal, Canada, August 20-25, 1995). IJCAI '95. Morgan Kaufmann, San Francsico, CA, 448--453.
[6]
Jiang, J.J., Conrath, D.W. Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In Proceedings of the International Conference Research on Computational Linguistics (Taipei, Taiwan, August 22-24, 1997). ROCLING X. 1997.
[7]
Lin, D. An information-theoretic definition of similarity. In Proceedings of the International Conference on Machine Learning (Madison, WI, 1998). ICML '98. Morgan Kaufmann, San Francisco, CA, 296--304.
[8]
Lesk, M.E. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In Proceedings of the Special Interest Group Design of Communication Conference (Toronto, Canada, June 8-11). SIGDOC '86. ACM Press, New York, NY, 24--26.
[9]
Leacock, C., Chodorow, M., Miller, G.A. Using corpus statistics and WordNet relations for sense identification. In Computational Linguistics, Vol. 24, Number 1 (March 1998). MIT Press, Cambridge, MA, 147--165.
[10]
Over, P. Ianeva, T., Kraaij, W., Smeaton, A.F. TRECVID 2005 An Overview. In Proceedings of the NIST TRECVID 2005 Workshop (Gaithersburg, MD, November 14-15, 2005). TRECVID '05.
[11]
Over P., Ianeva, T., Kraaij, W., Smeaton, A.F. TRECVID 2006 Overview. In Proceedings of the NIST TRECVID 2006 Workshop (Gaithersburg, MD, November 13-14, 2006). TRECVID '06.
[12]
Over, P. Awad, G. Kraaij, W., Smeaton, A.F. TRECVID 2007 - An Introduction. In Proceedings of the NIST TRECVID 2007 Workshop (Gaithersburg, MD, November 5-6, 2007). TRECVID '07.
[13]
Pedersen, T., Patwardhan, Michelizzi, J. Wordnet::similarity - measuring the relatedness of concepts. In Proceedings of the Annual Meeting of the North American Chapter of the Association for Computational Linguistics (Boston, MA, May 3-5, 2004). NAACL '04. Association for Computational Linguistics, Morristown, NJ, 38--41.
[14]
Patwardhan, S., Banerjee, S., Pedersen, T. Using Measures of Semantic Relatedness for Word Sense Disambiguation. In Proceedings of the International Conference on Intelligent Text Processing and Computational Linguistics (Mexico City, Mexico, February 16-22, 2003). CICLing '03. Springer Verlag, Berlin, Heidelberg, 241--257.
[15]
Seco, N., Veale, T., Hayes, J. An Intrinsic Information Content Metric for Semantic Similarity in WordNet. In Proceedings of the European Conference on Artificial Intelligence (Valencia, Spain, August 22-27, 2004). ECAI '04. IOS Press, Amsterdam, The Netherlands, 1089--1090.
[16]
Budanitsky, A., Hirst, G. Semantic distance in WordNet: An experimental, application--oriented evaluation of five measures. In Proceedings of the North American Chapter of the Association for Computational Linguistics Workshop (Pittsburgh, PA, June 2-7, 2001). NAACL '01. Association for Computational Linguistics, Morristown, NJ, 29--34.
[17]
Pucher, M. Performance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech. In Proceedings of the International Workshop on Computational Semantics (Tilburg, Netherlands, January 12-14, 2005). IWCS 6.
[18]
Pedersen, T., Pakhomov, S. Developing Measures of Semantic Relatedness for the Biomedical Domain. Digital Technology Initiatives Forum (Minneapolis, MN, Feb 28, 2005). Digital Technology Center, University of Minnesota.
[19]
Naphade, M., Smith, J.R., Souvannavong, F. On the Detection of Semantic Concepts at TRECVID. In Proceedings of the ACM Internation Multimedia Conference (New York, NY, October 10-16, 2004). ACM Press, New York, NY, 660--667.
[20]
Natsev, A., Haubold, A., Tesic, J., Xie, L., Yan, R. Semantic concept-based query expansion and re-ranking for multimedia retrieval. In Proceedings of the ACM International Conference on Multimedia (Augsburg, Germany, September 24-29, 2007). MM '07. ACM Press, New York, NY, 991--1000.
[21]
Neo, S.-Y., Zhao, J., Kan, M.-Y., Chua, T.-S. Video retrieval using high level features: Exploiting query matching and confidence-based weighting. In Proceedings of the ACM International Conference on Image and Video Retrieval (Tempe, AZ, July 13-15, 2006). CIVR '06. Spring Verlag, Berlin, Heidelberg, 143--152.
[22]
Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E., Zhang, D. Columbia University, TRECVID-2005 Video Search and High-Level Feature Extraction. In Proceedings of the NIST TRECVID 2005 Workshop (Gaithersburg, MD, November 14-15, 2005). TRECVID '05.
[23]
Chua, T.-S., Neo, S.-Y., Zheng, Y., Goh, H.-K., Xiao, Y., Zhao, M., Tang, S., Gao, S., Zhu, X., Chaisorn, L., Sun, Q. TRECVID-2006 by NUS-I2R. In Proceedings of the NIST TRECVID 2006 Workshop (Gaithersburg, MD, November 13-14, 2006). TRECVID '06.
[24]
Snoek, C. G. M., van Gemert, J. C., Geusebroek, J. M., Huurnink, B., Koelma, D. C., Nguyen, G. P., Rooij, O. D., Seinstra, F. J., Smeulders, A. W. M., Veenman, C. J., Worring, M. The MediaMill TRECVID 2005 Semantic Video Search Engine. In Proceedings of the NIST TRECVID 2005 Workshop (Gaithersburg, MD, November 14-15, 2005). TRECVID '05.
[25]
Haubold, A., Natsev, A., Naphade, M. Semantic multimedia retrieval using lexical query expansion and model-based reranking. In Proceedings of the International Conference on Multimedia and Expo (Toronto, Canada, July 9-12, 2006). ICME '06. IEEE Press, New York, NY, 1761--1764.
[26]
Varelas, G., Voutsakis, E., Raftopoulou, P., Petrakis, E.G.M., Milios, E.E. Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web. In Proceedings of the ACM Workshop on Web Information and Data Management (Bremen, Germany, November 5, 2005). WIDM '05. ACM Press, New York, NY, 10--16.

Cited By

View all
  • (2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
  • (2014)ThumbReelsProceedings of the SIGCHI Conference on Human Factors in Computing Systems10.1145/2556288.2557249(1217-1220)Online publication date: 26-Apr-2014
  • (2013)Concept-based video retrieval model based on the combination of semantic similarity measures2013 13th International Conference on Intellient Systems Design and Applications10.1109/ISDA.2013.6920709(64-68)Online publication date: Dec-2013
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '08: Proceedings of the 2008 international conference on Content-based image and video retrieval
July 2008
674 pages
ISBN:9781605580708
DOI:10.1145/1386352
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. LSCOM
  2. TRECVid
  3. WordNet
  4. brown corpus
  5. information content
  6. semantic similarity

Qualifiers

  • Research-article

Conference

CIVR08

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Learning a Multi-Concept Video Retrieval Model with Multiple Latent VariablesACM Transactions on Multimedia Computing, Communications, and Applications10.1145/317664714:2(1-21)Online publication date: 25-Apr-2018
  • (2014)ThumbReelsProceedings of the SIGCHI Conference on Human Factors in Computing Systems10.1145/2556288.2557249(1217-1220)Online publication date: 26-Apr-2014
  • (2013)Concept-based video retrieval model based on the combination of semantic similarity measures2013 13th International Conference on Intellient Systems Design and Applications10.1109/ISDA.2013.6920709(64-68)Online publication date: Dec-2013
  • (2013)An integrated semantic-based approach in concept based video retrievalMultimedia Tools and Applications10.1007/s11042-011-0848-464:1(77-95)Online publication date: 1-May-2013
  • (2012)Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace BoostingProceedings of the 2012 IEEE International Conference on Multimedia and Expo10.1109/ICME.2012.190(747-752)Online publication date: 9-Jul-2012
  • (2012)High Level Semantic Concept Retrieval Using a Hybrid Similarity MethodKnowledge Technology10.1007/978-3-642-32826-8_27(262-271)Online publication date: 2012
  • (2011)Semantic Video Retrieval by Integrating Concept- and Content-Aware MiningProceedings of the 2011 International Conference on Technologies and Applications of Artificial Intelligence10.1109/TAAI.2011.14(32-37)Online publication date: 11-Nov-2011
  • (2010)Topic-based awareness computing model for video-sharing service2010 2nd International Symposium on Aware Computing10.1109/ISAC.2010.5670453(44-50)Online publication date: Nov-2010
  • (2009)Concept-Based Video RetrievalFoundations and Trends in Information Retrieval10.1561/15000000142:4(215-322)Online publication date: 1-Apr-2009
  • (2009)Web news categorization using a cross-media document graphProceedings of the ACM International Conference on Image and Video Retrieval10.1145/1646396.1646431(1-8)Online publication date: 8-Jul-2009
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media