skip to main content
10.1145/1772690.1772885acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

Efficient web pages identification for entity resolution

Published: 26 April 2010 Publication History

Abstract

Entity resolution (ER) is a problem that arises in many areas. In most of cases, it represents a task that multiple entities from different sources require to be identified if they refer to the same or different objects because there are not unique identifiers associated with them. In this paper, we propose a model using web pages identification to identify entities and merge those entities refer to one object together. We use a classical name disambiguation problem as case study and examine our model on a subset of digital library records as the first stage of our work. The favorable results indicated that our proposed approach is highly effective.

References

[1]
M. I. Lam and Z. Gong. Web information extraction. Information Acquisition, IEEE International Conference, vol. 27, 2005.
[2]
J. Zhu, G. P. C. Fung and X. F. Zhou. A Term-based Driven Clustering Approach for Name Disambiguation. Proc. Joint. APWeb/WAIM, vol. 6, 2009.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '10: Proceedings of the 19th international conference on World wide web
April 2010
1407 pages
ISBN:9781605587998
DOI:10.1145/1772690

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 April 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. entity resolution
  2. name disambiguation
  3. web pages identification

Qualifiers

  • Poster

Conference

WWW '10
WWW '10: The 19th International World Wide Web Conference
April 26 - 30, 2010
North Carolina, Raleigh, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)A novel multiple layers name disambiguation framework for digital libraries using dynamic clusteringScientometrics10.1007/s11192-017-2611-8114:3(781-794)Online publication date: 1-Mar-2018
  • (2014)Robust hybrid name disambiguation framework for large databasesScientometrics10.1007/s11192-013-1151-098:3(2255-2274)Online publication date: 1-Mar-2014
  • (2011)Efficient name disambiguation in digital librariesProceedings of the 12th international conference on Web-age information management10.5555/2035562.2035612(430-441)Online publication date: 14-Sep-2011
  • (2011)Efficient Name Disambiguation in Digital LibrariesWeb-Age Information Management10.1007/978-3-642-23535-1_37(430-441)Online publication date: 2011

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

EPUB

View this article in ePub.

ePub

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media