skip to main content
10.1145/2645791.2645822acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article

A Preliminary Investigation into the Automatic EuroVoc Indexing of Greek Documents

Authors Info & Claims
Published:02 October 2014Publication History

ABSTRACT

In this paper, we present an automatic indexing experiment of greek documents. In particular, we describe an attempt to use JEX, the JRC-developed indexing tool, in order to assign EuroVoc descriptors to a collection of Greek open data. We discuss the results and limitations of this approach and we propose solutions which take into account the particularities of the Greek language.

References

  1. EuroVoc 2012. Multilingual thesaurus of the European Union. http://eurovoc.europa.eu/Google ScholarGoogle Scholar
  2. Fellbaum C. (ed.) 1998. WordNet: An Electronic Lexical Database. MIT Press.Google ScholarGoogle Scholar
  3. Geodata.gov.gr 2012. Web service for Greek open geospatial data http://www.geodata.gov.gr/geodataGoogle ScholarGoogle Scholar
  4. JEX-JRC EuroVoc Indexer 2014. http://langtech.jrc.ec.europa.eu/Eurovoc.htmlGoogle ScholarGoogle Scholar
  5. Karanikolas, N. and Skourlas, C. 2006. Text Classification: Forming Candidate Key-Phrases from Existing Shorter Ones. FACTA UNIVERSITATIS Series: Electronics and Energetics, ISSN 0353-3670, 19, 3.Google ScholarGoogle Scholar
  6. Lancaster, F.W. 1998. Indexing and abstracting in theory and practice. Library Association Publishing, London.Google ScholarGoogle Scholar
  7. Pouliquen, B., Steinberger, R. and Degeurnel, O. 2008. Story tracking: Linking similar news over time and across languages. In Proceedings of the 2nd workshop "Multi-source Multilingual Information Extraction and Summarization (MMIES'2008)" held at CoLing'2008 (Manchester, Aug.23, 2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Pouliquen, B., Steinberger, R. and Ignat, C. 2003. Automatic annotation of multilingual text collections with a conceptual thesaurus. In Proceedings of the workshop "Ontologies and Information Extraction" - at the summer school "The Semantic Web and Language Technology -- Its Potential and Practicalities (EUROLAN 2003)" (Bucharest, July 28 -- Aug. 8, 2003).Google ScholarGoogle Scholar
  9. Stamou S., Oflazer K., Pala K., Christoudoulakis D., Cristea D., Tufiş D., Koeva S., Totkov G., Dutoit D., Grigoriadou M. 2002. Balkanet: A Multilingual Semantic Network for the Balkan Languages. In Proceedings of the International Wordnet Conference, January 21-25, Mysore, India, 12--14.Google ScholarGoogle Scholar
  10. Steinberger, R., Ebrahim, M. and Turchi, M. 2012. JRC EuroVoc Indexer JEX -- A freely available multi-label categorisation tool. In Proceedings of the 8th Int. Conference LREC'2012, Istanbul, 798--805.Google ScholarGoogle Scholar
  11. Steinberger, R., Ehrmann, M., Pajzs, J., Ebrahim, M., Steinberger, J. and Turchi, M. 2013. Multilingual media monitoring and text analysis -- Challenges for highly inflected languages. In Proceedings of the 16th Int. Conference TSD 2013, Pilsen, Springer -- Verlag, 22--33.Google ScholarGoogle Scholar
  12. Tsoumakas, G. and Katakis, I. 2007. Multi-label classification: An overview, Int. J. Data Warehousing and Mining, 3, 1--13.Google ScholarGoogle ScholarCross RefCross Ref
  13. Vossen P. (ed.) 1998. EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A Preliminary Investigation into the Automatic EuroVoc Indexing of Greek Documents

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PCI '14: Proceedings of the 18th Panhellenic Conference on Informatics
      October 2014
      355 pages
      ISBN:9781450328975
      DOI:10.1145/2645791
      • General Chairs:
      • Katsikas Sokratis,
      • Hatzopoulos Michael,
      • Apostolopoulos Theodoros,
      • Anagnostopoulos Dimosthenis,
      • Program Chairs:
      • Carayiannis Elias,
      • Varvarigou Theodora,
      • Nikolaidou Mara

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 October 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      PCI '14 Paper Acceptance Rate51of102submissions,50%Overall Acceptance Rate190of390submissions,49%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader