skip to main content
10.1145/2645791.2645848acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article

Toward Improving Classification of Real World Biomedical Articles

Authors Info & Claims
Published:02 October 2014Publication History

ABSTRACT

In this work, we propose a method to improve performance in biomedical article classification. We use Naïve Bayes and Maximum Entropy classifiers to classify real world biomedical articles. We describe a technique based on chi-square measure to discard irrelevant information from the data and to identify the most relevant keywords to the classification task. To improve classification performance, we used two merging operators, Max and Harmonic Mean proposed by Jongwoo et al (2010) to combine results of the two classifiers. The results show that the Maximum Entropy classifier shows the better performance at 500 top relevant keywords. It is also shown that combining the results of the two classifiers we can improve classification performance of real world biomedical data.

References

  1. Fuhr, N., Hartmanna, S., Lustig, G., Schwantner, M., and Tzeras, K. 1991. Air/X -- A rule-based multi-stage indexing system for lage subject fields. In Proceedings of RIAO'91, 606--623.Google ScholarGoogle Scholar
  2. Galathiya, A. S., Ganatra, A. P., and Bhensdadia, K. C. 2012. An Improved decision tree induction algorithm, with feature selection, cross validation, model complexity & reduced error pruning, IJSCIT march 2012.Google ScholarGoogle Scholar
  3. Feldman, R., Sanger, J. 2006. The Text Mining Handbook: advanced approaches in analyzing unstructured data. Cambridge University Press. Google ScholarGoogle Scholar
  4. Krallinger, M., et al. 2009 The BioCreative II. 5 challenge overview. In: Proc. The BioCreative II. 5 Workshop 2009 on Digital Annotations, pp. 7--9.Google ScholarGoogle Scholar
  5. Fragos, K., Maistros, I. 2006. A Goodness of Fit Test Approach in Information Retrieval. In journal of "Information Retrieval", Springer Netherlands, Volume 9, Number 3, p 331--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. McCallum A. and Nigam, K. 1998. A comparison of event models for naive Bayes text classification. In AAAI/ICML-98 Workshop on Learning for Text Categorization.Google ScholarGoogle Scholar
  7. Fragos, K., Maistros, I., Skourlas, C. 2005. A X2-Weighted Maximum Entropy Model for Text Classification. In Proceedings of 2nd International Conference On N.L.U.C.S, Miami, Florida.Google ScholarGoogle Scholar

Index Terms

  1. Toward Improving Classification of Real World Biomedical Articles

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      PCI '14: Proceedings of the 18th Panhellenic Conference on Informatics
      October 2014
      355 pages
      ISBN:9781450328975
      DOI:10.1145/2645791
      • General Chairs:
      • Katsikas Sokratis,
      • Hatzopoulos Michael,
      • Apostolopoulos Theodoros,
      • Anagnostopoulos Dimosthenis,
      • Program Chairs:
      • Carayiannis Elias,
      • Varvarigou Theodora,
      • Nikolaidou Mara

      Copyright © 2014 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 October 2014

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      PCI '14 Paper Acceptance Rate51of102submissions,50%Overall Acceptance Rate190of390submissions,49%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader