skip to main content
10.1145/1284420.1284469acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
Article

Exclusion-inclusion based text categorization of biomedical articles

Published: 28 August 2007 Publication History

Abstract

In this paper, we propose a new approach based on two original principles to categorize biomedical articles. On the one hand, we combine linguistic, structural and metric descriptors to build patterns stemming from data mining techniques. On the other hand, we take into account the importance of the absence of patterns to the categorization task by using an exclusion-inclusion method. To avoid a crisp effect between the absence and the presence of a pattern, the exclusion-inclusion method uses two regret measures to quantify the interest of a weak pattern according to the other classes and among patterns from a same class. The global decision is based on the generalization of the local patterns, firstly by using patterns excluding classes, then according to the regret ratios. Experiments show the effectiveness of the approach.

References

[1]
Antonie, M-L. and Zaïane, O. R. An Associative Classifier based on Positive and Negative Rules, 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD-04), pp 64--69, Paris, France, 2004.
[2]
Dong, G. and Li, J. Efficient mining of emerging patterns: discovering trends and differences, proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining (ACM SIGKDD'99), ACM Press, pp. 43--52, San Diego, CA, 1999.
[3]
Gagné, E. C. and Gravel, M. and Price,W. L. Using metaheuristic compromise programming for the solution of multiple objective scheduling problems, Journal of the Operational Research Society, Vol. 56, No. 6, pp.687--698, 2005.
[4]
Sebastiani F. Machine learning in automated text categorization, ACM Computing Surveys, 34(1) :1--47, 2002.
[5]
Zerida N., Lucas N., Crémilleux B. Combining linguistic and structural descriptors for mining biomedical literature, ACM Symposium on Document Engineering, Amsterdam, The Netherlands, p.62--64, 2006.

Cited By

View all
  • (2023)BIOMEDICAL TEXT DOCUMENT CLASSIFICATIONinternational journal of engineering technology and management sciences10.46647/ijetms.2023.v07i03.1217:3(788-792)Online publication date: 2023
  • (2008)Towards Compromising Structural and Bag of Words Approaches for Clustering Heterogeneous XML DocumentsProceedings of the 2008 The Second International Conference on Advanced Engineering Computing and Applications in Sciences10.1109/ADVCOMP.2008.28(69-72)Online publication date: 29-Sep-2008

Index Terms

  1. Exclusion-inclusion based text categorization of biomedical articles

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DocEng '07: Proceedings of the 2007 ACM symposium on Document engineering
    August 2007
    236 pages
    ISBN:9781595937766
    DOI:10.1145/1284420
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 August 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. categorization
    2. characterisation
    3. text mining

    Qualifiers

    • Article

    Conference

    DocEng07
    Sponsor:
    DocEng07: ACM Symposium on Document Engineering
    August 28 - 31, 2007
    Manitoba, Winnipeg, Canada

    Acceptance Rates

    Overall Acceptance Rate 194 of 564 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)BIOMEDICAL TEXT DOCUMENT CLASSIFICATIONinternational journal of engineering technology and management sciences10.46647/ijetms.2023.v07i03.1217:3(788-792)Online publication date: 2023
    • (2008)Towards Compromising Structural and Bag of Words Approaches for Clustering Heterogeneous XML DocumentsProceedings of the 2008 The Second International Conference on Advanced Engineering Computing and Applications in Sciences10.1109/ADVCOMP.2008.28(69-72)Online publication date: 29-Sep-2008

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media