skip to main content
10.1145/2739480.2754650acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Simpler is Better: a Novel Genetic Algorithm to Induce Compact Multi-label Chain Classifiers

Published:11 July 2015Publication History

ABSTRACT

Multi-label classification (MLC) is the task of assigning multiple class labels to an object based on the features that describe the object. One of the most effective MLC methods is known as Classifier Chains (CC). This approach consists in training q binary classifiers linked in a chain, y1 → y2 → ... → yq, with each responsible for classifying a specific label in {l1, l2, ..., lq}. The chaining mechanism allows each individual classifier to incorporate the predictions of the previous ones as additional information at classification time. Thus, possible correlations among labels can be automatically exploited. Nevertheless, CC suffers from two important drawbacks: (i) the label ordering is decided at random, although it usually has a strong effect on predictive accuracy; (ii) all labels are inserted into the chain, although some of them might carry irrelevant information to discriminate the others. In this paper we tackle both problems at once, by proposing a novel genetic algorithm capable of searching for a single optimized label ordering, while at the same time taking into consideration the utilization of partial chains. Experiments on benchmark datasets demonstrate that our approach is able to produce models that are both simpler and more accurate.

References

  1. M. P. Basgalupp, R. C. Barros, A. C. P. L. F. de Carvalho, A. A. Freitas, and D. D. Ruiz. Legal-tree: A lexicographic multi-objective genetic algorithm for decision tree induction. In Proc. of the 2009 ACM Symposium on Applied Computing, SAC'09, pages 1085--1090, March 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Bull, E. Bernad o-Mansilla, and J. Holmes. Learning classier systems in data mining: An introduction. In Learning Classier Systems in Data Mining, volume 125 of Studies in Computational Intelligence, pages 1--15. Springer, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  3. P. N. da Silva, E. C. Gon calves, A. Plastino, and A. A. Freitas. Distinct chains for different instances: An effective strategy for multi-label classier chains. In Proc. of the 2014 ECML/PKDD Conf., LCNS 8725, pages 453--468, September 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Domingos. The role of occam's razor in knowledge discovery. Data Mining and Knowledge Discovery, 3(4):409--425, December 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. E. Eiben and J. E. Smith. Introduction to Evolutionary Computing. SpringerVerlag, 2003. Google ScholarGoogle ScholarCross RefCross Ref
  6. A. A. Freitas. Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. A. Freitas. A critical review of multi-objective optimization in data mining: A position paper. SIGKDD Explor. Newsl., 6(2):77--86, December 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. A. Freitas. A review of evolutionary algorithms for data mining. In Data Mining and Knowledge Discovery Handbook, pages 371--400. Springer US, July 2010.Google ScholarGoogle Scholar
  9. A. A. Freitas. Comprehensible classication models: A position paper. SIGKDD Explor. Newsl., 15(1):1--10,June 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. C. Gon calves. A human-centered approach for mining hybrid-dimensional association rules. In Proc. of the 2014 Conf. on Information Fusion, FUSION'14, pages 1--8, July 2014.Google ScholarGoogle Scholar
  11. E. C. Gon calves, A. Plastino, and A. A. Freitas. A genetic algorithm for optimizing the label ordering in multi-label classier chains. In Proc. of the 25th Int'l Conf. on Tools with Articial Intelligence, ICTAI'13, pages 469--476, November 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. I. Gon calves and S. Silva. Balancing learning and overtting in genetic programming with interleaved sampling of training data. In Proc. of EuroGP 2013, pages 73--84, April 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Guo and S. Gu. Multi-label classication using conditional dependency networks. In Proc. of the 22nd Int'l Joint Conf. on Articial Intelligence, IJCAI'11, pages 1300--1305, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The weka data mining software: an update. ACM SIGKDD Explor.,11(1):10--18, November 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. N. Japkowicz and M. Shah. Evaluating Learning Algorithms: A Classication Perspective. Cambridge University Press, New York, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Kumar, S. Vembu, A. K. Menon, and C. Elkan. Beam search algorithms for multilabel learning. Machine Learning, 92(1):65--89, July 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Larra~naga, C. M. H. Kuijpers, R. H. Murga, I. Inza, and S. Dizdarevic. Genetic algorithms for the travelling salesman problem: A review of representations and operators. Articial Intelligence Review, 13(2):129--170, April 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. E. Levy, O. E. David, and N. S. Netanyahu. Genetic algorithms and deep learning for automatic painter classication. In Proc. of the 2014 Genetic and Evolutionary Computation Conference, GECCO'14, pages 1143--1150, July 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Machado, J. Tavares, F. B. Pereira, and E. Costa. Vehicle routing problem: Doing it the evolutionary way. In Proc. of the 2002 Genetic and Evolutionary Computation Conference, GECCO'02, pages 690--696, July 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Madjarov, D. Kocev, D. Gjorgjevikj, and S. D zeroski. An extensive experimental comparison of methods for multi-label learning. Pattern Recognition, 45(9):3084--3104, September 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Read, L. Martino, and D. Luengo. Efficient monte carlo optimization for multi-label classier chains. In Proc. of the 38th Int'l Conf. on Acoustics, Speech, and Signal Processing, ICASSP'13, May 2013.Google ScholarGoogle Scholar
  22. J. Read, B. Pfahringer, G. Holmes, and E. Frank. Classier chains for multi-label classication. Machine Learning, 85(3):333--359, December 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. E. Romero and L. M. de Campos. A probabilistic methodology for multilabel classication. Intelligent Data Analysis, 18(5):911--926, September 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. L. E. Sucar, C. Bielza, E. F. Morales, P. H. Leal, J. H. Zaragoza, and P. Larra~naga. Multi-label classication with bayesian network-based chain classiers. Pattern Recogn. Lett., 41:14--22, May 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. G. Tsoumakas, I. Katakis, and I. Vlahavas. Mining multi-label data. In Data Mining and Knowledge Discovery Handbook, pages 667--685. Springer, 2010.Google ScholarGoogle Scholar
  26. G. Tsoumakas, E. S. Xious, J. Vilcek, and I. P. Vlahavas. Mulan: A java library for multi-label learning. Journal of Machine Learning Research, 12:2411--2414, July 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. J. Zaki and W. Meira Jr. Data Mining and Analysis: Fundamental Concepts and Algorithms. Cambridge University Press, New York, 2014. Google ScholarGoogle ScholarCross RefCross Ref
  28. M.-L. Zhang and K. Zhang. Multi-label learning by exploiting label dependency. In Proc. of the 16th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, KDD'10, pages 999--1008, July 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. M.-L. Zhang and Z.-H. Zhoug. A review on multi-label learning algorithms. IEEE Trans. on Knowledge and Data Engineering, 26(8):1819--1837, August 2014.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Simpler is Better: a Novel Genetic Algorithm to Induce Compact Multi-label Chain Classifiers

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '15: Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation
      July 2015
      1496 pages
      ISBN:9781450334723
      DOI:10.1145/2739480

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 July 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      GECCO '15 Paper Acceptance Rate182of505submissions,36%Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader