short-paper

Interactive and context-aware tag spell check and correction

Authors:
Francesco Bonchi

Yahoo! Research, Barcelona, Spain

Yahoo! Research, Barcelona, Spain
View Profile

,
Ophir Frieder

Georgetown University, Washington, DC, USA

Georgetown University, Washington, DC, USA
View Profile

,
Franco Maria Nardini

ISTI-CNR, Pisa, Italy

ISTI-CNR, Pisa, Italy
View Profile

,
Fabrizio Silvestri

ISTI-CNR, Pisa, Italy

ISTI-CNR, Pisa, Italy
View Profile

,
Hossein Vahabi

ISTI-CNR, Pisa, Italy

ISTI-CNR, Pisa, Italy
View Profile

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge managementOctober 2012Pages 1869–1873https://doi.org/10.1145/2396761.2398534

Published:29 October 2012Publication History

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

Pages 1869–1873

ABSTRACT

Collaborative content creation and annotation creates vast repositories of all sorts of media, and user-defined tags play a central role as they are a simple yet powerful tool for organizing, searching and exploring the available resources. We observe that when a user annotates a resource with a set of tags, those tags are introduced one at a time. Therefore, when the fourth tag is introduced, a knowledge represented by the previous three tags, i.e., the context in which the fourth tag is produced, is available and exploitable for generating potential correction of the current tag. This context, together with the "wisdom of the crowd" represented by the co-occurrences of tags in all the resources of the repository, can be exploited to provide interactive tag spell check and correction. We develop this idea in a framework, based on a weighted tag co-occurrence graph and on nodes relatedness measures defined on weighted neighborhoods. We test our proposal on a dataset coming from YouTube. The results show that our framework is effective as it outperforms two important baselines. We also show that it is efficient, thus enabling its use in modern tagging services.

References

Z. Bao, B. Kimelfeld, and Y. Li. A graph approach to spelling correction in domain-centric search. In Proc. HLT'11, pages 905--914, Stroudsburg, PA, USA, 2011. ACL. Google ScholarDigital Library
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In Proc. EMNLP, 2004.Google Scholar
J. Gao, X. Li, D. Micol, C. Quirk, and X. Sun. A large scale ranker-based system for search query spelling correction. In Proc. COLING'10, pages 358--366, Stroudsburg, PA, USA, 2010. ACL. Google ScholarDigital Library
A. R. Golding and D. Roth. A winnow-based approach to context-sensitive spelling correction. Mach. Learn., 34(1-3):107--130, 1999. Google ScholarDigital Library
K. Kukich. Techniques for automatically correcting words in text. ACM Comput. Surv., 24(4):377--439, 1992. Google ScholarDigital Library
D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. JASIST, 58(7):1019--1031, 2007. Google ScholarDigital Library
L. Mangu and E. Brill. Automatic rule acquisition for spelling correction. In Proc. ICML'97, pages 187--194, San Francisco, CA, USA, 1997. Morgan Kaufmann Publishers Inc. Google ScholarDigital Library
Y. Merhav and O. Frieder. On multiword entity ranking in peer-to-peer search. In Proc. SIGIR'08. ACM, 2008. Google ScholarDigital Library
F. M. Nardini, F. Silvestri, H. Vahabi, P. Vahabi, and O. Frieder. On tag spell checking. In Proc. SPIRE'10, SPIRE'10, pages 37--42. Springer-Verlag, 2010. Google ScholarDigital Library
C. Whitelaw, B. Hutchinson, G. Y. Chung, and G. Ellis. Using the web for language independent spellchecking and autocorrection. In Proc. EMNLP'09. ACL, 2009. Google ScholarDigital Library

Index Terms

Interactive and context-aware tag spell check and correction
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval
    1. Document representation
      1. Content analysis and feature selection

Recommendations

Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites
SAMT '08: Proceedings of the 3rd International Conference on Semantic and Digital Media Technologies: Semantic Multimedia

In this paper, we propose an automatic photo tag expansion system for the community photo collections, such as Flickr. Our aim is to suggest relevant tags for a target photograph uploaded to the system by a user, by incorporating the visual and textual ...
Read More
Tag navigation
SoSEA '09: Proceedings of the 2nd international workshop on Social software engineering and applications

The amount of information available on the world wide web keeps growing at an exponential pace. Social tagging is a feature of various online social networks to organize information elements by letting people label these with free-form text, called ...
Read More
Context-oriented web video tag recommendation
WWW '10: Proceedings of the 19th international conference on World wide web

Tag recommendation is a common way to enrich the textual annotation of multimedia contents. However, state-of-the-art recommendation methods are built upon the pair-wised tag relevance, which hardly capture the context of the web video, i.e., when who ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
October 2012
2840 pages
ISBN:9781450311564
DOI:10.1145/2396761
General Chair:
Xuewen Chen
Wayne State University, USA
,
Program Chairs:
Guy Lebanon
Georgia Institute of Technology
,
Haixun Wang
Microsoft Research Asia
,
Mohammed J. Zaki
Rensselaer Polytechnic Institute
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
tag co-occurrence graph
tag spell checking and correction
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 181
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Interactive and context-aware tag spell check and correction

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites

Tag navigation

Context-oriented web video tag recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Interactive and context-aware tag spell check and correction

CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Tag Suggestr: Automatic Photo Tag Expansion Using Visual Information for Photo Sharing Websites

Tag navigation

Context-oriented web video tag recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media