skip to main content
10.1145/2350716.2350732acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Robust speech recognition based on binaural speech enhancement system as a preprocessing step

Authors Info & Claims
Published:23 August 2012Publication History

ABSTRACT

In this paper, we present a robust speech recognition based on binaural speech enhancement system as a preprocessing step. This system uses an existing dereverberation technique followed by a spatial masking-based noise removal algorithm where only signals coming from the desired directions are retained by using a threshold angle. While state-of-the art approaches fix the threshold angle heuristically over all time frames, in this paper, we propose to consider an adaptive computation where this threshold angle is first learned in several noise-only frames and then updated frame by frame. Speech recognition results in real environment show the effectiveness of the proposed speech enhancement approach.

References

  1. M. P. Cooke, P. Green, L. Josifovski, and A. Vizinho. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34: 267--285, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. N. Q. K. Duong, E. Vincent, and R. Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. IEEE Trans. on Audio, Speech and Language Processing, 18(7): 1830--1840, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. T. T. et al. Speech enhancement using combination of dereverberation and noise reduction for robust speech recognition. In Proceedings of the Second Symposium on Information and Communication Technology, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Kim, K. Kumar, and R. M. Stern. Binaural sound source separation motivated by auditory processing. In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4574--4577, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. C. Kim and R. M. Stern. Nonlinear enhancement of onset for robust speech recognition. In Proc. Int. Conf. on Spoken Language Processing (INTERSPEECH), pages 2058--2061, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  6. G. Kim and P. Loizou. Improving speech intelligibility in noise using environment-optimized algorithms. IEEE Trans. Audio, Speech, Language Processing, 18(8): 2080--2090, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. F. Kleinschmidt. Robust speech recognition using speech enhancement. PhD thesis, Queensland University of Technology, March 2010.Google ScholarGoogle Scholar
  8. S. Makino, T.-W. Lee, and H. Sawada. Blind Speech Separation. Springer, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  9. H. Park and R. M. Stern. Spatial separation of speech signals using amplitude estimation based on interaural comparisons of zero crossings. Speech Communication, 51(1): 15--25, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Raj and R. M. Stern. Missing-feature methods for robust automatic speech recognition. IEEE Signal Processing Magazine, 22(5): 101--116, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  11. F. I. Shoji Kajita, Kazuya Takeda. A binaural speech processing methos using subband-crosscerrelation analysis for noise robust recognition. In IEEE. Conference Acoustic, Speech, and Signal Processing, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Slaney. Auditory toolbox. Technical report, Interval Research Corporation, 1998.Google ScholarGoogle Scholar
  13. S. Srinivasan and D. L. Wang. Robust speech recognition by integrating speech separation and hypothesis testing. Speech Communication, 52: 72--81, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill, H. Sawada, A. Ozerov, V. Gowreesunker, D. Lutter, and N. Q. K. Duong. The Signal Separation Campaign (2007--2010): Achievements and remaining challenges. Signal Processing, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Robust speech recognition based on binaural speech enhancement system as a preprocessing step

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Other conferences
                SoICT '12: Proceedings of the 3rd Symposium on Information and Communication Technology
                August 2012
                290 pages
                ISBN:9781450312325
                DOI:10.1145/2350716

                Copyright © 2012 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 23 August 2012

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article

                Acceptance Rates

                Overall Acceptance Rate147of318submissions,46%
              • Article Metrics

                • Downloads (Last 12 months)0
                • Downloads (Last 6 weeks)0

                Other Metrics

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader