skip to main content
10.1145/1027527.1027639acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

K-BOX: a query-by-singing based music retrieval system

Authors Info & Claims
Published:10 October 2004Publication History

ABSTRACT

In this paper, we present an efficient query-by-singing based musical retrieval system. We first combine multiple Support Vector Machines by classifier committee learning to segment the sentences from a song automatically. Many new methods in manipulating Mel-Frequency Cepstral Coefficient (MFCC) matrix are studied and compared for optimal feature selection. Experiments show that the 3rd coefficient is the most relevant to music comparison out of 13 coefficients and the proposed simplified MFCC feature is able to achieve a reasonable trade-off between accuracy and efficiency. To improve system efficiency, we re-organize the database by a new two-stage clustering scheme in both time space and feature space. We combine K-means algorithm and dynamic time wrapping similarity measurement for feature space clustering. We also propose a new method for model-selection of K-means algorithm. Experiments show that the proposed approach can achieve more than 30 percent increase in accuracy while speed up more than 16 times in average query time.

References

  1. A. Ghias, J. Logan, D. Chamberlin and B. C. Smith, "Query by Humming - Musical Information Retrieval in an Audio Database," ACM Multimedia 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J.S. Roger Jang, H. R. Lee and M. Y. Kao, "Content-based music retrieval using linear scaling and branch-and bound tree search," IEEE ICME 2001.Google ScholarGoogle Scholar
  3. L Lu, H. You and H. J. Zhang, "A New Approach To Query By Humming in Music Retrieval," IEEE ICME 2001.Google ScholarGoogle Scholar
  4. H. H. Shih, S. S. Narayanan and C.-C. Jay Kuo, "Multidimensional Humming Transcription Using a Statistical Approach for Query by Humming Systems," IEEE ICASSP 2003.Google ScholarGoogle Scholar
  5. L. Lu, H. J. Zhang and S. Z. Li, "Content-based audio classification and segmentation by using support vector machines," ACM Multimedia System Journal 8(6), pp. 482--492, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  6. T. Joachims, "Making large-scale SVM learning practical," Advances in Kernel Methods - Support Vector Learning, MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. Guo and S. Z. Li, "Content-based Audio Classification and Retrieval by Support Vector Machines," IEEE Trans. on NN, Vol. 14, No. 1, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Kittler, M. Hatef, P. W. Duin and J. Matas, "On Combining Classifiers," IEEE Trans. On PAMI. Vol. 20, no. 3, pp. 226--239, Mar. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. Rabiner and B. H. Juang, "Fundamentals of Speech Recognition", Prentice Hall Signal Processing Series. Prentice Hall, Englewood Cliffs, NJ, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. K-BOX: a query-by-singing based music retrieval system

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia
        October 2004
        1028 pages
        ISBN:1581138938
        DOI:10.1145/1027527

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 10 October 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate995of4,171submissions,24%

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader