skip to main content
research-article

Improving non-English web searching (iNEWS07)

Published: 01 December 2007 Publication History

Abstract

This workshop attempted to promote the discussion and the research on non-English Web searching. Most search engines were first built for English. They do not take full account of inflectional semantics nor, for example, diacritics or the use of capitals. Our main aim was to discuss the additional problems faced in non-English Web queries and to suggest techniques to improve the response of searching systems. Papers related to Arabic, Basque, Farsi (Persian), Greek, Spanish, Swedish, Hindi, Bengali and other south asian languages were accepted. Conclusions were that search engines would be more effective if they took more account of the properties of individual languages, and that there is a need for more studies of real user behaviour in practical situations.

References

[1]
Efthimiadis E., N. Malevris, A. Kousaridas, A. Lepeniotou and N. Loutas (2007), How do Search Engines handle Greek Queries? In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 9--13.
[2]
Hammarström H. (2007), A Fine-Grained Model for Language Identification In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 14--20.
[3]
Macdonald C., C. Lioma and I. Ounis (2007), Terrier takes on the non-English Web, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 21--28.
[4]
Tzekou P., S. Stamou, N. Zotos, E. Kozanidis (2007), Querying the Greek Web in Greeklish, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 29--38.
[5]
Ahmed F., A. Nürnberger (2007), N-Grams Conflation Approach for Arabic Text, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 39--46.
[6]
Leturia I., A. Gurrutxaga, N. Areta, I. Alegria, A. Ezeiza (2007), EusBila, a search service designed for the agglutinative nature of Basque, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 47--54.
[7]
De Luca E. W., M. Eul, A. Nürnberger (2007), Multilingual Query-Reformulation using RDF-OWL EuroWordNet, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 55--61.
[8]
Qasemizadeh B. (2007), Farsi e-Orthography: an Example of e-Orthography Concept, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 62--64.
[9]
Ribadas F., E. Lloves-Calvino, V. M. Darriba (2007), Thesaurus topic assignment using hierarchical text categorization In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 65--70.
[10]
Singh A. K., H. Surana, K. Gali (2007), More Accurate Fuzzy Text Search for Languages Using Abugida Scripts, In: F. Lazarinis, J. Vilares, J. Tait (eds) Improving Non-English Web Searching (iNEWS07) SIGIR07 Workshop, pp. 71--78.

Cited By

View all
  • (2021)Using Linguistically Non-local Punjabi Queries to Search the Global WebSoft Computing: Theories and Applications10.1007/978-981-16-1696-9_26(279-287)Online publication date: 27-Jun-2021
  • (2012)Attentes versus réalitéExpectations versus Reality. Search Engine Features needed for Web Research in 2008Questions de communication10.4000/questionsdecommunication.719(49-74)Online publication date: 21-Mar-2012
  • (2012)Morphological query expansion and language-filtering words for improving Basque web retrievalLanguage Resources and Evaluation10.1007/s10579-012-9208-x47:2(425-448)Online publication date: 4-Dec-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGIR Forum
ACM SIGIR Forum  Volume 41, Issue 2
December 2007
120 pages
ISSN:0163-5840
DOI:10.1145/1328964
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 2007
Published in SIGIR Volume 41, Issue 2

Check for updates

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Using Linguistically Non-local Punjabi Queries to Search the Global WebSoft Computing: Theories and Applications10.1007/978-981-16-1696-9_26(279-287)Online publication date: 27-Jun-2021
  • (2012)Attentes versus réalitéExpectations versus Reality. Search Engine Features needed for Web Research in 2008Questions de communication10.4000/questionsdecommunication.719(49-74)Online publication date: 21-Mar-2012
  • (2012)Morphological query expansion and language-filtering words for improving Basque web retrievalLanguage Resources and Evaluation10.1007/s10579-012-9208-x47:2(425-448)Online publication date: 4-Dec-2012
  • (2010)BibliographyAn Introduction to Search Engines and Web Navigation10.1002/9780470874233.biblio(424-461)Online publication date: 4-Aug-2010
  • (2009)Current research issues and trends in non-English Web searchingInformation Retrieval10.1007/s10791-009-9093-012:3(230-250)Online publication date: 1-Jun-2009
  • (2009)Mixed monolingual homepage finding in 34 languages: the role of language script and search domainInformation Retrieval10.1007/s10791-008-9082-812:3(324-351)Online publication date: 1-Jun-2009

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media