ABSTRACT
The semantics of rich multimedia presentations in the web such as SMIL, SVG, and Flash cannot or only to a very limited extend be understood by search engines today. This hampers the retrieval of such presentations and makes their archival and management a difficult task. Existing metadata models and metadata standards are either conceptually too narrow, focus on a specific media type only, cannot be used and combined together, or are not practically applicable for the semantic description of rich multimedia presentations.
In this paper, we propose the Multimedia Metadata Ontology (M3O) for annotating rich, structured multimedia presentations. The M3O provides a generic modeling framework for representing sophisticated multimedia metadata. It allows for integrating the features provided by the existing metadata models and metadata standards. Our approach bases on Semantic Web technologies and can be easily integrated with multimedia formats such as the W3C standards SMIL and SVG. With the M3O, we unlock the semantics of rich multimedia presentations in the web by making the semantics machine-readable and machine-understandable. The M3O is used with our SemanticMM4U framework for the multi-channel generation of semantically-rich multimedia presentations.
- Adobe. Flash file format, July 2008. http://www.adobe.com/licensing/developer/.Google Scholar
- Adobe Systems Incorporated. XMP - Adding Intelligence to Media, September 2005. http://www.adobe.com/products/xmp/.Google Scholar
- R. Arndt, R. Troncy, S. Staab, L. Hardman, and M. Vacura. COMM: designing a well-founded multimedia ontology for the web. In ISWC+ASWC, pages 30--43, 2007. Google ScholarDigital Library
- F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider, editors. The Description Logic Handbook. Cambridge University Press, 2003. Google ScholarDigital Library
- P. V. Biron and A. Malhotra. XML Schema Part 2: Datatypes Second Edition, W3C Recommendation. October 2004. http://www.w3.org/TR/xmlschema-2/.Google Scholar
- S. Boll, T. Burger, O. Celma, C. Halaschek-Wiener, E. Mannens, and R. Troncy. Multimedia Vocabularies on the Semantic Web. Multimedia Semantics Incubator Group Report (XGR), July 2007.Google Scholar
- S. Boll, P. Sandhaus, A. Scherp, and U. Westermann. Semantics, content, and structure of many for the creation of personal photo albums. In ACM MULTIMEDIA, pages 641--650, 2007. Google ScholarDigital Library
- S. Borgo and C. Masolo. Handbook on Ontologies, chapter Foundational choices in DOLCE. Springer, 2009. Google ScholarDigital Library
- D. Brickley. Basic Geo (WGS84 lat/long) Vocabulary, 2006.Google Scholar
- D. Brickley and L. Miller. The Friend Of A Friend (FOAF) vocabulary specification, November 2007. http://xmlns.com/foaf/spec/.Google Scholar
- S. Dasiopoulou, V. Tzouvaras, I. Kompatsiaris, and M. G. Strintzis. Enquiring MPEG-7 based multimedia ontologies. Oct. 2009.Google Scholar
- Dublin Core Metadata Initiative. DCMI Metadata Terms, Jan. 2008. http://dublincore.org/documents/dcmi-terms/.Google Scholar
- A. Gangemi and V. Presutti. Handbook on Ontologies, chapter Ontology Design Patterns. Springer, 2009.Google Scholar
- L. Hollink, G. Nguyen, G. Schreiber, J. Wielemaker, B. Wielinga, and M. Worring. Adding spatial semantics to image annotations. In Knowledge Markup and Semantic Annotation, 2004.Google Scholar
- L. Hollink, A. T. Schreiber, B. J. Wielinga, and M. Worring. Classification of user image descriptions. International Journal of Human-Computer Studies, 61(5):601--626, November 2004. Google ScholarDigital Library
- L. Hollink, G. Schreiber, and B. Wielinga. Patterns of semantic relations to improve image content search. Web Semantics: Science, Services and Agents on the World Wide Web, 5(3):195--203, 2007. Google ScholarDigital Library
- J. Hunter. Enhancing the semantic interoperability of multimedia through a core ontology. IEEE Transactions on Circuits and Systems for Video Technology, 13(1):49--58, January 2003. Google ScholarDigital Library
- Int. Federation of Library Associations and Institutions. Functional requirements for bibliographic records. Technical report, IFLA, 2009.Google Scholar
- International Press Telecommunications Council. "IPTC Core" Schema for XMP Version 1.0 Specification document, 2005. http://www.iptc.org/.Google Scholar
- A. Jaimes and S.-F. Chang. A conceptual framework for indexing visual information at multiple levels. In IS&T/SPIE Internet Imaging, volume 3964, 2000.Google Scholar
- JEITA. Exchangeable image file format for digital still cameras, April 2002.Google Scholar
- M. Markkula and E. Sormunen. End-user searching challenges indexing practices in the digital newspaper photo archive. Information Retrieval, 1(4):259--285, January 2000. Google ScholarDigital Library
- Media Annotations Working Group. Mapping table, 2008. http://www.w3.org/2008/WebVideo/Annotations/drafts/ontology10/WD/mapping_table.html, draft status.Google Scholar
- Merriam-Webster, Inc. Metadata, 2009. http://www.m-w.com/dictionary/metadata.Google Scholar
- MPEG-7. Multimedia content description interface. Technical report, Standard No. ISO/IEC n15938, 2001.Google Scholar
- M. Nilsson and M. Mutschler. ID3, 2009. http://www.id3.org/.Google Scholar
- A. Scherp. Canonical processes for creating personalized semantically rich multimedia presentations. Multimedia Syst., 14(6):415--425, 2008.Google ScholarDigital Library
- A. Scherp. Semantics support for personalized multimedia content. In Internet and Multimedia Systems and Applications, pages 57--65. IASTED, Mar. 2008. Google ScholarDigital Library
- A. Scherp, T. Franz, C. Saatho , and S. Staab. F|A Model of Events based on the Foundational Ontology DOLCE+ Ultralight. In Knowledge Capturing, 9 2009. Google ScholarDigital Library
- A. Scherp and R. Jain. An ecosystem for semantics. IEEE MultiMedia, 16(2):18--25, 2009. Google ScholarDigital Library
- F. Schmutzer. Albert Einstein, 1921. Public Domain, http://commons.wikimedia.org/wiki/File: Einstein1921_by_F_Schmutzer_2.jpg.Google Scholar
- G. Schreiber, I. Blok, D. Carlier, W. van Gent, J. Hokstam, and U. Roos. A mini-experiment in semantic annotation. pages 404--408, 2002. Google ScholarDigital Library
- R. Steinmetz and K. Nahrstedt. Multimedia Systems. Springer, 2004. Google ScholarDigital Library
- U.S. Federal Government. Atomic Bombing of Nagasaki, 1945. Public Domain, http://commons.wikimedia.org/wiki/File: Nagasakibomb.jpg.Google Scholar
- Owl web ontology language overview, January 2004.Google Scholar
- W3C. RDF Primer, Feb. 2004. http://www.w3.org/TR/REC-rdf-syntax/.Google Scholar
- W3C. SMIL 3.0, Dec. 2008. http://www.w3.org/TR/SMIL/.Google Scholar
- W3C. SVG, Apr. 2009. http://www.w3.org/TR/SVG/.Google Scholar
Index Terms
- Unlocking the semantics of multimedia presentations in the web with the multimedia metadata ontology
Recommendations
Linked Data and multimedia: the state of affairs
Linked Data is a way of exposing and sharing data as resources on the Web and interlinking them with semantically related resources. In the last three years significant amounts of data have been generated, increasingly forming a globally connected, ...
Enquiring MPEG-7 based multimedia ontologies
Machine understandable metadata forms the main prerequisite for the intelligent services envisaged in a Web, which going beyond mere data exchange and provides for effective content access, sharing and reuse. MPEG-7, despite providing a comprehensive ...
Semantic technologies for multimedia content: foundations and applications
MM '13: Proceedings of the 21st ACM international conference on MultimediaHigher-level semantics for multimedia content is essential to answer questions like ``Give me all presentations of German Physicists of the 20th century''. The tutorial provides an introduction and overview to such semantics and the developments in ...
Comments