skip to main content
research-article

The FIRE 2008 Evaluation Exercise

Published:01 September 2010Publication History
Skip Abstract Section

Abstract

The aim of the Forum for Information Retrieval Evaluation (FIRE) is to create an evaluation framework in the spirit of TREC (Text REtrieval Conference), CLEF (Cross-Language Evaluation Forum), and NTCIR (NII Test Collection for IR Systems), for Indian language Information Retrieval. The first evaluation exercise conducted by FIRE was completed in 2008. This article describes the test collections used at FIRE 2008, summarizes the approaches adopted by various participants, discusses the limitations of the datasets, and outlines the tasks planned for the next iteration of FIRE.

References

  1. }}Amati, G. and Rijsbergen, C. V. 2002. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inform. Syst. 20, 4, 357--389. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. }}Braschler, M. and Peters, C. 2004. Cross-language evaluation forum: Objectives, results, achievements. Inform. Retriev. 7, 1/2, 7--31. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. }}Dolamic, L. and Savoy, J. 2008. UniNE at FIRE 2008: Hindi, Bengali, and Marathi IR. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  4. }}Harman, D. 1995. Overview of the second text retrieval conference (TREC-2). Inform. Process. Manage. 31, 3, 271--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. }}Hiemstra, D. 2001. Using language models for information retrieval. Ph.D. thesis, University of Twente.Google ScholarGoogle Scholar
  6. }}Kando, N., Mitamura, T., and Sakai, T. 2008. Introduction to the NTCIR-6 Special Issue. ACM Trans. Asian Lang. Inform. Process. 7, 2, 1--3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. }}Majumder, P., Mitra, M., Parui, S., Kole, G., Mitra, P., and Datta, K. 2007. YASS: Yet another suffix stripper. ACM Trans. Inform. Syst. 25, 4, 18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. }}McNamee, P. 2008. N-gram Tokenization for Indian Language Text Retrieval. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  9. }}Mitra, M. 2008. Overview of FIRE 2008. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  10. }}Nakagawa, H., Mori, T., and Kando, N., Eds. 2005. ACM Trans. Asian Lang. Inform. Process. 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. }}Oard, D. W. 2003. The surprise language exercises. ACM Trans. Asian Lang. Inform. Process. 2, 2, 79--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. }}Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., and Lioma, C. 2006. Terrier: A high performance and scalable information retrieval platform. In Proceedings of the ACM Workshop on Open Source Information Retrieval (OSIR’06).Google ScholarGoogle Scholar
  13. }}Padariya, N., Chinnakotla, M., Nagesh, A., and Damani, O. P. 2008. Evaluation of Hindi to English, Marathi to English, and English to Hindi CLIR at FIRE 2008. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  14. }}Paik, J. H. and Parui, S. K. 2008. A simple stemmer for inflectional languages. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  15. }}Pal, D., Majumder, P., Mitra, M., Mitra, S., and Sen, A. 2008. Issues in searching for Indian language Web content. In Proceedings of the 2nd ACM Workshop on Improving Non English Web Searching (iNEWS’08). 93--96. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. }}Peters, C. 2010. Personal communication.Google ScholarGoogle Scholar
  17. }}Peters, C., Jijkoun, V., Mandl, T., Müller, H., Oard, D., Peñas , A., Petras, V., and Santos, D., Eds. 2008. In Advances in Multilingual and Multimodal Information Retrieval, the 8th Workshop of the Cross-Language Evaluation Forum (CLEF’07). Lecture Notes in Computer Science. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. }}Pingali, P., Jagarlamudi, J., and Varma, V. 2006. Webkhoj: Indian language IR from multiple character encodings. In Proceedings of the International World Wide Web Conference (WWW’06). Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. }}Ponte, J. and Croft, W. 1998. A language modeling approach to information retrieval. In Proceedings of the 19th Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR’98). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. }}Ramanathan, A. and Rao, D. 2003. A lightweight stemmer for Hindi. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL’03).Google ScholarGoogle Scholar
  21. }}Rao, P. R. and Sobha, L. 2008. AU-KBC FIRE2008 submission - Cross lingual information retrieval track: Tamil-English. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  22. }}Sakai, T., Kando, N., Lin, C.-J., Mitamura, T., Shima, H., Ji, D., Chen, K.-H., and Nyberg, E. 2008. Overview of the NTCIR-7 ACLIA IR4QA Task. In Proceedings of the NII Test Collection for Information Retrieval Workshop (NTCIR’08). 77--114.Google ScholarGoogle Scholar
  23. }}Savoy, J. 2004. Data fusion for effective European monolingual information retrieval. In Proceedings of the Cross-Language Information Retrieval and Evaluation Workshop of Cross-Language Evaluation Forum (CLEF’00). 233--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. }}Sethuramalingam, S. and Varma, V. 2008. IIIT Hyderabad’s CLIR experiments for FIRE-2008. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  25. }}Sparck Jones, K. and van Rijsbergen, C. 1976. Information retrieval test collections. J. Doc. 32, 59--75.Google ScholarGoogle ScholarCross RefCross Ref
  26. }}Sparck Jones, K., Walker, S., and Robertson, S. 2000. A probabilistic model of information retrieval: Development and comparative experiments. Inform. Process. Manage. 36, 6, 779--808. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. }}Surve, M., Singh, S., and Bhattacharyya, P. 2004. Agro-Explorer: A meaning based multilingual search engine. In Proceedings of the International Conference on Digital Libraries (ICDL’04).Google ScholarGoogle Scholar
  28. }}Udupa, R., Jagarlamudi, J., and Saravanan, K. 2008. Hindi-English cross-language information retrieval. In Working Notes from FIRE 2008 (FIRE’08).Google ScholarGoogle Scholar
  29. }}Udupa, R., Saravanan, K., Bakalov, A., and Bhole, A. 2009. “They are out there, if you know where to look”: Mining transliterations of OOV query terms for cross-language information retrieval. In Proceedings of the European Conference on Information Retrieval (ECIR09). 437--448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. }}Voorhees, E. and Harman, D. 1997. Overview of the 5th Text Retrieval Conference. In Proceedings of the 5th Text Retrieval Conference (TREC5). 1--28.Google ScholarGoogle ScholarCross RefCross Ref
  31. }}Voorhees, E. M. and Harman, D. K., Eds. 2005. TREC Experiment and Evaluation in Information Retrieval. MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. }}Zobel, J. 1998. How reliable are the results of large-scale information retrieval experiments? In Proceedings of the 19th Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR’98). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The FIRE 2008 Evaluation Exercise

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Asian Language Information Processing
        ACM Transactions on Asian Language Information Processing  Volume 9, Issue 3
        September 2010
        82 pages
        ISSN:1530-0226
        EISSN:1558-3430
        DOI:10.1145/1838745
        Issue’s Table of Contents

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 September 2010
        • Revised: 1 March 2010
        • Accepted: 1 March 2010
        • Received: 1 September 2009
        Published in talip Volume 9, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader