skip to main content
10.1145/1772690.1772841acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

An information retrieval approach to spelling suggestion

Published: 26 April 2010 Publication History

Abstract

In this paper, we present a two-step language-independent spelling suggestion system. In the first step, candidate suggestions are generated using an Information Retrieval(IR) approach. In step two, candidate suggestions are re-ranked using a new string similarity measure that uses the length of the longest common substrings occurring at the beginning and end of the words. We obtained very impressive results by reranking candidate suggestions using the new similarity measure. The accuracy of first suggestion is 92.3%, 90.0% and 83.5% for Dutch, Danish and Bulgarian language datasets respectively.

References

[1]
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In Proceedings of the Conference on EMNLP, 2004.
[2]
S. J. R. Schiller N. O., Greenhall J. A. and C. A. Serial order effects in spelling errors: evidence from two dysgraphic patients. Neurocase, 7:1--14, 2001.
[3]
K. Sparck Jones. A statistical interpretation of term specificity and its application in retrieval. pages 132--142, 1988.
[4]
W. J. Wilbur, W. Kim, and N. Xie. Spelling correction in the search engine. Inf. Retr., 9(5):543--564, 2006.

Cited By

View all
  • (2011)A crime reports analysis system to identify related crimesJournal of the American Society for Information Science and Technology10.1002/asi.2155262:8(1533-1547)Online publication date: 1-Aug-2011

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '10: Proceedings of the 19th international conference on World wide web
April 2010
1407 pages
ISBN:9781605587998
DOI:10.1145/1772690

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 April 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. information retrieval
  2. language independent
  3. spelling suggestion

Qualifiers

  • Poster

Conference

WWW '10
WWW '10: The 19th International World Wide Web Conference
April 26 - 30, 2010
North Carolina, Raleigh, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2011)A crime reports analysis system to identify related crimesJournal of the American Society for Information Science and Technology10.1002/asi.2155262:8(1533-1547)Online publication date: 1-Aug-2011

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

EPUB

View this article in ePub.

ePub

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media