ABSTRACT
In this paper, we present an efficient query-by-singing based musical retrieval system. We first combine multiple Support Vector Machines by classifier committee learning to segment the sentences from a song automatically. Many new methods in manipulating Mel-Frequency Cepstral Coefficient (MFCC) matrix are studied and compared for optimal feature selection. Experiments show that the 3rd coefficient is the most relevant to music comparison out of 13 coefficients and the proposed simplified MFCC feature is able to achieve a reasonable trade-off between accuracy and efficiency. To improve system efficiency, we re-organize the database by a new two-stage clustering scheme in both time space and feature space. We combine K-means algorithm and dynamic time wrapping similarity measurement for feature space clustering. We also propose a new method for model-selection of K-means algorithm. Experiments show that the proposed approach can achieve more than 30 percent increase in accuracy while speed up more than 16 times in average query time.
- A. Ghias, J. Logan, D. Chamberlin and B. C. Smith, "Query by Humming - Musical Information Retrieval in an Audio Database," ACM Multimedia 1995. Google ScholarDigital Library
- J.S. Roger Jang, H. R. Lee and M. Y. Kao, "Content-based music retrieval using linear scaling and branch-and bound tree search," IEEE ICME 2001.Google Scholar
- L Lu, H. You and H. J. Zhang, "A New Approach To Query By Humming in Music Retrieval," IEEE ICME 2001.Google Scholar
- H. H. Shih, S. S. Narayanan and C.-C. Jay Kuo, "Multidimensional Humming Transcription Using a Statistical Approach for Query by Humming Systems," IEEE ICASSP 2003.Google Scholar
- L. Lu, H. J. Zhang and S. Z. Li, "Content-based audio classification and segmentation by using support vector machines," ACM Multimedia System Journal 8(6), pp. 482--492, 2003.Google ScholarCross Ref
- T. Joachims, "Making large-scale SVM learning practical," Advances in Kernel Methods - Support Vector Learning, MIT Press, 1999. Google ScholarDigital Library
- G. Guo and S. Z. Li, "Content-based Audio Classification and Retrieval by Support Vector Machines," IEEE Trans. on NN, Vol. 14, No. 1, 2003. Google ScholarDigital Library
- J. Kittler, M. Hatef, P. W. Duin and J. Matas, "On Combining Classifiers," IEEE Trans. On PAMI. Vol. 20, no. 3, pp. 226--239, Mar. 1998. Google ScholarDigital Library
- L. Rabiner and B. H. Juang, "Fundamentals of Speech Recognition", Prentice Hall Signal Processing Series. Prentice Hall, Englewood Cliffs, NJ, 1993. Google ScholarDigital Library
Index Terms
- K-BOX: a query-by-singing based music retrieval system
Recommendations
Music segmentation and summarization based on self-similarity matrix
ICUIMC '13: Proceedings of the 7th International Conference on Ubiquitous Information Management and CommunicationIn this paper, we propose a new method for segmenting and summarizing music based on its structure analysis. To do that, we first extract timbre feature from acoustic music signal and construct a self-similarity matrix that shows similarities among the ...
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks
AbstractWe present a new dataset of 3000 artificial music tracks with rich annotations based on real instrument samples and generated by algorithmic composition with respect to music theory. Our collection provides ground truth onset information and has ...
Towards Timbre-Invariant Audio Features for Harmony-Based Music
Chroma-based audio features are a well-established tool for analyzing and comparing harmony-based Western music that is based on the equal-tempered scale. By identifying spectral components that differ by a musical octave, chroma features possess a ...
Comments