Retrieving Information from Multiple Sources

Authors:
Anurag Roy

IIEST Shibpur, Kolkata, India

IIEST Shibpur, Kolkata, India
View Profile

,
Kripabandhu Ghosh

IIT Kanpur, Kanpur, India

IIT Kanpur, Kanpur, India
View Profile

,
Moumita Basu

UEM Kolkata & IIEST Shibpur, Kolkata, India

UEM Kolkata & IIEST Shibpur, Kolkata, India
View Profile

,
Parth Gupta

Amazon, Bengaluru, India

Amazon, Bengaluru, India
View Profile

,
Saptarshi Ghosh

IIT Kharagpur & IIEST Shibpur, Kharagpur, India

IIT Kharagpur & IIEST Shibpur, Kharagpur, India
View Profile

WWW '18: Companion Proceedings of the The Web Conference 2018April 2018Pages 43–44https://doi.org/10.1145/3184558.3186920

Published:23 April 2018Publication History

WWW '18: Companion Proceedings of the The Web Conference 2018

Pages 43–44

ABSTRACT

The Web has several information sources on which an ongoing event is discussed. To get a complete picture of the event, it is important to retrieve information from multiple sources. We propose a novel neural network based model which integrates the embeddings from multiple sources, and thus retrieves information from them jointly, %all the sources together, as opposed to combining multiple retrieval results. The importance of the proposed model is that no document-aligned comparable data is needed. Experiments on posts related to a particular event from three different sources - Facebook, Twitter and WhatsApp - exhibit the efficacy of the proposed model.

References

Edward A. Fox and Joseph A. Shaw. 1993. Combination of Multiple Searches. In Proceedings of TREC 1993. http://trec.nist.gov/pubs/trec2/papers/txt/23.txt.Google Scholar
T. Mikolov, W.T. Yih, and G. Zweig. 2013. Linguistic Regularities in Continuous Space Word Representations NAACL HLT 2013.Google Scholar
S. Siegel. 1956. Nonparametric Statistics for the Behavioral Sciences. McGraw-Hill. showLCCN56008185Google Scholar
Ke Tao, Fabian Abel, Claudia Hauff, Geert-Jan Houben, and Ujwal Gadiraju. 2013. Groundhog Day: Near-duplicate Detection on Twitter Proc. World Wide Web (WWW). Google ScholarDigital Library
Ivan Vuliç and Marie-Francine Moens. 2015. Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings. In Proc. ACM SIGIR. 363--372. Google ScholarDigital Library
Ivan Vuliç, Susana Zoghbi, and Marie-Francine Moens. 2014. Learning to Bridge Colloquial and Formal Language Applied to Linking and Search of E-Commerce Data. In Proc. ACM SIGIR. 1195--1198. Google ScholarDigital Library

Index Terms

Retrieving Information from Multiple Sources
1. Information systems
  1. Information retrieval

Recommendations

Improving Arabic information retrieval using word embedding similarities

Term mismatch is a common limitation of traditional information retrieval (IR) models where relevance scores are estimated based on exact matching of documents and queries. Typically, good IR model should consider distinct but semantically similar words ...
Read More
Combining IR Models for Bengali Information Retrieval

Word mismatch between queries and documents is a fundamental problem in information retrieval domain. In this article, the authors present an effective approach to Bengali information retrieval that combines two IR models to tackle the word mismatch ...
Read More
Word-embedding-based pseudo-relevance feedback for Arabic information retrieval

Pseudo-relevance feedback (PRF) is a very effective query expansion approach, which reformulates queries by selecting expansion terms from top k pseudo-relevant documents. Although standard PRF models have been proven effective to deal with vocabulary ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '18: Companion Proceedings of the The Web Conference 2018
April 2018
2023 pages
ISBN:9781450356404
General Chairs:
Pierre-Antoine Champin
Université Claude Bernard Lyon 1, France
,
Fabien Gandon
Inria, Université Côte d'Azur, CNRS, I3S, France
,
Lionel Médini
Université Claude Bernard Lyon 1, CNRS, LIRIS, France
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Panagiotis G. Ipeirotis
New York University, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
International World Wide Web Conferences Steering Committee
Republic and Canton of Geneva, Switzerland
Publication History
- Published: 23 April 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
deep learning
multi-view retrieval
word embedding
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 737
  Total Downloads
- Downloads (Last 12 months)245
- Downloads (Last 6 weeks)22
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Retrieving Information from Multiple Sources

WWW '18: Companion Proceedings of the The Web Conference 2018

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving Arabic information retrieval using word embedding similarities

Combining IR Models for Bengali Information Retrieval

Word-embedding-based pseudo-relevance feedback for Arabic information retrieval