ACM Home Page
Please provide us with feedback. Feedback
Real-time background music monitoring based on content-based retrieval
Full text PdfPdf (3.16 MB)
Source International Multimedia Conference archive
Proceedings of the 12th annual ACM international conference on Multimedia table of contents
New York, NY, USA
SESSION: Technical session 3: audio processing table of contents
Pages: 120 - 127  
Year of Publication: 2004
ISBN:1-58113-893-8
Authors
Yoshiharu Suga  NTT Cyber Solution Laboratories, NTT Corporation, Kanagawa, Japan
Naoko Kosugi  NTT Cyber Solution Laboratories, NTT Corporation, Kanagawa, Japan
Masashi Morimoto  NTT Cyber Solution Laboratories, NTT Corporation, Kanagawa, Japan
Sponsors
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 52,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1027527.1027550
What is a DOI?

ABSTRACT

In this paper, we describe music monitoring in TV broadcasting based on content-based retrieval. A part of audio signals is sequentially extracted from TV broadcasting as a retrieval key, and a music DB that stores a great number of musical pieces is retrieved by this key based on content-based retrieval, and a musical piece is identified sequentially. In this way, we are able to carry out music monitoring. There are three necessary requirements important for realization of the music monitoring. They are robustness against non-stationary noise, real-time processing of large-scale music DB retrieval, and high granularity of the retrieval key. As a method of realizing robustness against non-stationary noise, we propose a partially similar retrieval method which improves retrieval accuracy by using the moment in which no superfluous noise is produced during the existence of non-stationary noise. In order to realize real-time processing of large-scale music DB retrieval, we adopt a coarse-to-fine strategy, and propose a spectral peaks hashing method which performs high-speed refining by using hashing. To calculate a hash value in this hashing, frequency channel numbers of the spectral peaks are used. In order to realize high granularity of the retrieval key, it is necessary to solve the problem of retrieval accuracy degradation associated with heightening the granularity. To improve this accuracy, we propose a detection-by-continuity method which uses music continuity. Moreover, by using music continuity to correct the starting point and the terminal point of a musical piece in TV broadcasting, the retrieval accuracy is improved further. In order to evaluate the effectiveness of the proposed methods, we performed experiments using a music DB which stores over 28,000 musical pieces (over 1800 hours) and TV broadcasting audio signals containing music and background music (BGM). The granularity of the retrieval key was set at about 0.5 seconds. Through these experiments, We verified that music monitoring was possible for over 90% of the total time of music and BGM used in TV broadcasting, and that real-time processing was possible.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
L. Boney, A. H. Tewfik, and K. N. Harmdy. digital watermarks for audio signals. In Internatioal conference on multimedia computing and systems, pages 473--480, 1996.
 
2
P. Cano, E. Battle, T. Kallker, and J. Haitsma. A review of algorithms for audio fingerprinting. In internatioanl Workshop on Multimedia Signal Processing, 2002.
 
3
P. Cano, E. Battle, H. Mayer, and H. Neuschmied. Robust sound modeling for song detection in broadcast audio. In Proc. AES 112th Int. Conv., 2002.
 
4
 
5
J. T. Foote. Content-based retrieval of music and audio. In Multimedia Storage and Archiving System II, Proc. of SPIE, volume 3229, pages 138--147, 1997.
 
6
E. Gómez, P. Cano, L. de~C. T. Gomes, and E. B. M. Bonnet. Mixed watermarking-fingeprinting approach for integrity verification of audio recordings. In Proc. of IEEE international telecommunications symposiom, 2002.
 
7
J. Haitsma and T. Kalker. A highly robust audio fingerprinting system. In Proc. of IMSIR 2002, October 2002.
 
8
K. Kashino, G. Smith, and H. Murase. Time-series active search for quick retrieval of audio and video. In Proc. of 1999 International Conference of Acoustics, Speech and Signal Processing, volume VI, pages 2993--2996, 1999.
 
9
D. Kirovski and H. Malvar. Robust spread-spectrum audio watermarking. In Proc. 2001 IEEE International Conference on Acoustics, Speech and Signal Processing, volume~3, pages 1345--1348, May 2001.
10
 
11
B. Logan. Mel frequency cepstral coefficients for music modeling. In the first international symposium on music information retrieval, 2002.
 
12
H. Nagano, K. Kashino, and H. Murase. A fast search algorithm for background music signals based on search for numerous small signal components. In Proc. of 2003 IEEE International Conference on Acoustics, Speech and Signal Processing, volume V, pages 796--799, April 2003.
 
13
F. A. P. Petitcolas, R. J. Anderson, and M. G. Kuhn. Information hiding - a survey. In Proc. of IEEE special issue on Protection of multimedia content, pages 1062--1078, 1999.
 
14
 
15
A. Wang. An industrial strength audio search algorithm. In Online Proceedings of ISMIR2003. http://ismir2003.ismir.net/presentations/Wang.PDF.
 
16

Collaborative Colleagues:
Yoshiharu Suga: colleagues
Naoko Kosugi: colleagues
Masashi Morimoto: colleagues