research-article

Predicting website correctness from consensus analysis

Authors:
Steven O'Hara

University of Texas San Antonio, One UTSA Circle, San Antonio, TX

University of Texas San Antonio, One UTSA Circle, San Antonio, TX
View Profile

,
Tom Bylander

University of Texas San Antonio, One UTSA Circle, San Antonio, TX

University of Texas San Antonio, One UTSA Circle, San Antonio, TX
View Profile

RACS '12: Proceedings of the 2012 ACM Research in Applied Computation SymposiumOctober 2012Pages 49–54https://doi.org/10.1145/2401603.2401613

Published:23 October 2012Publication History

RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium

Pages 49–54

ABSTRACT

Websites vary in terms of reliability. One could assume that NASA's website will be very accurate for Astronomy questions. Wikipedia is less accurate but is still more accurate than a generic Google search. In this research we ask a large number of "factoid" questions to several different search engines. We collect those responses and determine the correctness of each candidate answer. The answers are grouped by website source, and are compared to other websites to infer website correctness.

References

X. Yin, W. Tan and C. Liu, "FACTO: A Fact Lookup Engine Based on Web Tables," in World Wide Web Conference (WWW), Hyderabad, India, 2011. Google ScholarDigital Library
S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine," Computer Networks and ISDN Systems 30, pp. 107--117, 1988. Google ScholarDigital Library
X. Yin, J. Han and P. S. Yu, "Truth Discovery with Multiple Conflicting Information Providers on the Web," Knowledge Discovery and Data Mining (KDD), 2007. Google ScholarDigital Library
X. L. Dong, L. Berti-Equille and D. Srivastava, "Integrating Conflicting Data: The Role of Source Dependence," Very Large Databases (VLDB), 2009. Google ScholarDigital Library
A. Galland, A. Marian, S. Abiteboul and P. Senellart, "Corroborating Information from Disagreeing Views," Web Search and Data Mining (WSDM), 2010. Google ScholarDigital Library
S. O'Hara and T. Bylander, "Numeric Query Answering on the Web," International Journal on Semantic Web and Information Systems, pp. 1--17, January-March 2011. Google ScholarDigital Library
B. Katz, S. Felshin, D. Yuret, A. Ibrahim, J. Lin, G. Marton, A. J. McFarland and B. Temelkuran, "Omnibase: Uniform Access to Heterogeneous Data for Question Answering," in Proceedings of the 7th International Workshop on Applications of Natural Language to Information Systems, Stockholm, Sweden, 2002. Google ScholarDigital Library
V. I. Levenshtein, "Binary Codes Capable of Correcting Deletions, Insertions and Reversals," Cybernetics and Control Theory, pp. 845--848, 1965.Google Scholar
X. Li and D. Roth, "Learning Question Classifiers: The Role of Semantic Information," in International Conference on Computational Linguistics, Taipei, 2002. Google ScholarDigital Library
J. Ko, L. Si and E. Nyberg, "A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering," in Proceedings of SIGIR, Amsterdam, 2007. Google ScholarDigital Library
C. Kwok, O. Etzioni and D. S. Weld, "Scaling question answering to the web," ACM Transactions on Information Systems, vol. 19, no. 3, pp. 242--262, July 2001. Google ScholarDigital Library
M. Barcala, J. Vilares, M. A. Alonso, J. Grana and m. Vilares, "Tokenization and Proper Noun Recognition for Information Retrieval," Departamento de Computacion, Universidade da Coruna, La Coruna, Spain.Google Scholar
D. Roussinov, W. Fan and J. Robles-Flores, "Beyond keywords: Automated question answering on the web," Communications of the ACM, vol. 51, no. 9, pp. 60--65, September 2008. Google ScholarDigital Library

Index Terms

Predicting website correctness from consensus analysis
1. Information systems
  1. Information retrieval
  2. Information storage systems

Recommendations

Web Searching with Multiple Correct Answers
WIMS '14: Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14)

Most web search engines today are geared towards providing a list of relevant websites, along with snippets of text from each website that are relevant to the user's search text. Some of them may also provide specific answers to the user's question. ...
Read More
Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data
Intelligent Data Analysis in Granular Computing

Several recent studies have explored the virtues of behavioral targeting and personalization for online advertising. In this paper, we add to this literature by proposing a cost-effective methodology for the prediction of demographic website visitor ...
Read More
Website Valuation: How to calculate the worth of a website?
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium
October 2012
488 pages
ISBN:9781450314923
DOI:10.1145/2401603
General Chairs:
Yookun Cho
Seoul National University, Korea
,
Rex E. Gantenbein
University of Wyoming, USA
,
Tei-Wei Kuo
National Taiwan University, Taiwan
,
Program Chair:
Vahid Tarokh
Harvard University
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 October 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
answer consolidation
intelligent search
question answering
text retrieval
website reliability
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate393of1,581submissions,25%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 66
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Predicting website correctness from consensus analysis

RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium

ABSTRACT

References

Cited By

Index Terms

Recommendations

Web Searching with Multiple Correct Answers

Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data

Website Valuation: How to calculate the worth of a website?

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Predicting website correctness from consensus analysis

RACS '12: Proceedings of the 2012 ACM Research in Applied Computation Symposium

ABSTRACT

References

Cited By

Index Terms

Recommendations

Web Searching with Multiple Correct Answers

Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data

Website Valuation: How to calculate the worth of a website?

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media