ACM Home Page
Please provide us with feedback. Feedback
Accurate repeat finding and object skipping using fingerprints
Full text PdfPdf (192 KB)
Source International Multimedia Conference archive
Proceedings of the 13th annual ACM international conference on Multimedia table of contents
Hilton, Singapore
SESSION: Content 3: audio and security table of contents
Pages: 656 - 665  
Year of Publication: 2005
ISBN:1-59593-044-2
Author
Cormac Herley  Microsoft Research, Redmond, WA
Sponsors
ACM: Association for Computing Machinery
SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
SIGMULTIMEDIA: ACM Special Interest Group on Multimedia
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 48,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101149.1101295
What is a DOI?

ABSTRACT

This paper introduces a novel and very accurate segmentation algorithm. It is very efficient and consumes less than 10% of CPU on a simple desktop PC to segment a stream in real-time. It operates on an audio stream, or on the audio portion of a audio-visual stream. It is very accurate: it accurately detects the positions and durations of objects on an over-the-air broadcast television signal, and songs on both FM and internet radio stations (as checked against labeled ground truth streams). The algorithm does not require any prior information or training. We detail the system design and present results of segmenting broadcast streams.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
 
3
 
4
A. Del Bimbo, P. Pala, and L. Tanganelli. Retrieval by content of commercials based on dynamics of color flows. Proc. ICME, pages 479--482, 2000.
5
 
6
C. J. C. Burges, J. C. Platt and S. Jana. Distortion descriminant analysis for audio fingerprinting. IEEE Trans. on Speech and Audio Processing, 11:165--174, 2003.
 
7
P. Cano, E. Batlle, T. Kalker, and J. Haitsma. A review of algorithms for audio fingerprinting. IEEE Workshop on Multimedia Signal Processing, 2002.
 
8
M. Cooper and J. Foote. Summarizing video using non-negative similarity matrix factorization. Proc. IEEE Multimedia Signal Processing Workshop, 2002.
 
9
 
10
J. Haitsma, T. Kalker, and J. Oostveen. An efficient database search strategy for audio fingerprinting.
 
11
A. Hampapur and R. Bolle. Feature based indexing for media tracking. Proc. ICME, 2000.
 
12
C. Herley. ARGOS: Automatically Extracting Repeating Objects from Multimedia Streams. IEEE Trans. Multimedia.
 
13
C. Herley. Extracting repeats from streams. Proc. ICASSP, 2004.
 
14
J.-L. Hsu, C.-C. Liu, and A. L. P. Chen. Discovering nontrivial repeating patterns in music data. IEEE Trans. on Multimedia, 3(3):311--325, 2001.
 
15
J. Haitsma and T. Kalker. A highly robust audio fingerprinting system. Proc. Intl Conf on Music Information Retrieval, 2002.
 
16
H. Jiang, T. Lin, and H.-J. Zhang. Video segmentation with the assistance of audio content analysis. ICME, 2000.
 
17
S. E. Johnson and P. C. Woodland. A method for direct audio search with applications to indexing and retrieval. ICASSP, 2000.
 
18
K. Kashino, T. Kurozumi, and H. Murase. A quick search method for audio and video signals based on histogram pruning. IEEE Trans. on Multimedia, 5(4):348--357, June 2003.
 
19
 
20
T. Muramoto and M. Sugiyama. Visual and audio segmentation for video streams. Proc. ICME, pages 1547--1550, 2000.
21
22
23
24
 
25
H. Sundaram and S.-F. Chang. Video scene segmentation using video and audio features. ICME, 2000.
 
26
 
27
M. Yeung, B.-L. Yeo, and B. Liu. Extracting story units from long programs for video browsing and navigation. Proc. IEEE Conf. on Multimedia Computing and Systems, pages 296--305, June 1996.