skip to main content
10.1145/2187980.2188130acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
poster

CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources

Authors Info & Claims
Published:16 April 2012Publication History

ABSTRACT

Query spelling correction is an important component of modern search engines that can help users to express an information need more accurately and thus improve search quality. In this work we proposed and implemented an end-to-end speller correction system, namely CloudSpeller. The CloudSpeller system uses a Hidden Markov Model to effectively model major types of spelling errors in a unified framework, in which we integrate a large-scale lexicon constructed using Wikipedia, an error model trained from high confidence correction pairs, and the Microsoft Web N-gram service. Our system achieves excellent performance on two search query spelling correction datasets, reaching 0.960 and 0.937 F1 scores on the TREC dataset and the MSN dataset respectively.

References

  1. http://research.microsoft.com/en-us/collaboration/focus/cs/web-ngram.aspx.Google ScholarGoogle Scholar
  2. E. Brill and R. Moore. An improved error model for noisy channel spelling correction. In ACL 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Q. Chen, M. Li, and M. Zhou. Improving query spelling correction using web search results. In EMNLP 2007.Google ScholarGoogle Scholar
  4. J. Gao, X. Li, D. Micol, C. Quirk, and X. Sun. A large scale ranker-based system for search query spelling correction. In COLING 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In EMNLP, 2004.Google ScholarGoogle Scholar

Index Terms

  1. CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
      April 2012
      1250 pages
      ISBN:9781450312301
      DOI:10.1145/2187980

      Copyright © 2012 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 April 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader