research-article

Pareto analysis for the selection of classifier ensembles

Authors:

Eulanda M. Dos Santos,

Robert Sabourin,

Patrick MaupinAuthors Info & Claims

GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation

Pages 681 - 688

https://doi.org/10.1145/1389095.1389229

Published: 12 July 2008 Publication History

Get Access

Abstract

The overproduce-and-choose strategy involves the generation of an initial large pool of candidate classifiers and it is intended to test different candidate ensembles in order to select the best performing solution. The ensemble's error rate, ensemble size and diversity measures are the most frequent search criteria employed to guide this selection. By applying the error rate, we may accomplish the main objective in Pattern Recognition and Machine Learning, which is to find high-performance predictors. In terms of ensemble size, the hope is to increase the recognition rate while minimizing the number of classifiers in order to meet both the performance and low ensemble size requirements. Finally, ensembles can be more accurate than individual classifiers only when classifier members present diversity among themselves. In this paper we apply two Pareto front spread quality measures to analyze the relationship between the three main search criteria used in the overproduce-and-choose strategy. Experimental results conducted demonstrate that the combination of ensemble size and diversity does not produce conflicting multi-objective optimization problems. Moreover, we cannot decrease the generalization error rate by combining this pair of search criteria. However, when the error rate is combined with diversity or the ensemble size, we found that these measures are conflicting objective functions and that the performances of the solutions are much higher.

References

[1]

M. Aksela and J. Laaksonen. Using diversity of errors for selecting members of a committee classifier. Pattern Recognition, 39(4):608--623, 2006.]]

Digital Library

Google Scholar

[2]

S. Ando and E. Suzuki. Distributed multi-objective ga for generating comprehensive pareto front in deceptive optimization problems. In Proceedings of the IEEE CEC, pages 1569--1576, 2006.]]

Google Scholar

[3]

G. Brown, J. Wyatt, R. Harris, and X. Yao. Diversity creation methods: a survey and categorisation. Information Fusion, 6(1):5--20, 2005.]]

Crossref

Google Scholar

[4]

K. Deb. Multi-Objective Optimization using Evolutionary Algorithms. John Wiley & Sons, LTD, 2001.]]

Digital Library

Google Scholar

[5]

K. Deb. Unveiling innovative design principles by means of multiple conflicting objectives. Engineering Optimization, 35(5):445--470, 2003.]]

Crossref

Google Scholar

[6]

B. Gabrys and D. Ruta. Genetic algorithms in classifier fusion. Applied Soft Computing, 6(4):337--347, 2006.]]

Digital Library

Google Scholar

[7]

L. Kuncheva and C. Whitaker. Measures of diversity in classifier ensembles an their relationship with the ensemble accuracy. Machine Learning, 51(2):181--207, 2003.]]

Digital Library

Google Scholar

[8]

L. Oliveira, R. Sabourin, F. Bortolozzi, and C. Suen. Automatic recognition of handwritten numerical strings: A recognition and verification strategy. IEEE TPAMI, 24(11):1438--1454, 2002.]]

Digital Library

Google Scholar

[9]

F. Roli, G. Giacinto, and G. Vernazza. Methods for designing multiple classifier systems. In Proceedings of MCS, pages 78--87, 2001.]]

Digital Library

Google Scholar

[10]

D. Ruta and B. Gabrys. Classifier selection for majority voting. Information Fusion, 6(1):163--168, 2005.]]

Crossref

Google Scholar

[11]

G. Tremblay, R. Sabourin, and P. Maupin. Optimizing nearest neighbour in random subspaces using a multi-objective genetic algorithm. In Proceedings of ICPR, pages 208--211, Cambridge, UK, 2004.]]

Digital Library

Google Scholar

[12]

C. Whitaker and L. Kuncheva. Examining the relationship between majority vote accuracy and diversity in bagging and boosting. Technical report, School of Informatics, Unversity of Wales, 2003.]]

Google Scholar

[13]

J. Wu and S. Azarm. Metrics for quality assessment of a multiobjective design optimization solution set. Transactions ASME, Journal of Mechanical Design, 123:18--25, 2001.]]

Crossref

Google Scholar

[14]

G. Zenobi and P. Cunningham. Using diversity in preparing ensembles of classifiers based on differente feature subsets to minimize generalization error. In Proceedings of XII European Conference on Machine Learning, pages 576--587, Freiburg, Germany, 2001.]]

Digital Library

Google Scholar

Cited By

View all

Cagnini HDas Dôres SFreitas ABarros R(2023)A survey of evolutionary algorithms for supervised ensemble learningThe Knowledge Engineering Review10.1017/S026988892300002438Online publication date: 1-Mar-2023
https://doi.org/10.1017/S0269888923000024
Haque Mde Vries NMoscato P(2019)A Multi-objective Meta-Analytic Method for Customer Churn PredictionBusiness and Consumer Analytics: New Ideas10.1007/978-3-030-06222-4_20(781-813)Online publication date: 31-May-2019
https://doi.org/10.1007/978-3-030-06222-4_20
Mc Leod PVerma B(2018)Variable Hidden Neuron Ensemble for Mass Classification in Digital Mammograms [Application Notes]IEEE Computational Intelligence Magazine10.1109/MCI.2012.22285988:1(68-76)Online publication date: 17-Dec-2018
https://dl.acm.org/doi/10.1109/MCI.2012.2228598
Show More Cited By

Index Terms

Pareto analysis for the selection of classifier ensembles
1. Computing methodologies
  1. Machine learning

Recommendations

Classifier ensembles for image identification using multi-objective Pareto features

In this paper we propose classifier ensembles that use multiple Pareto image features for invariant image identification. Different from traditional ensembles that focus on enhancing diversity by generating diverse base classifiers, the proposed method ...
Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms

Ensembles of classifiers that are trained on different parts of the input space provide good results in general. As a popular boosting technique, AdaBoost is an iterative and gradient based deterministic method used for this purpose where an exponential ...
Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms

Ensembles of classifiers that are trained on different parts of the input space provide good results in general. As a popular boosting technique, AdaBoost is an iterative and gradient based deterministic method used for this purpose where an exponential ...

Comments

Information & Contributors

Information

Published In

GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation

July 2008

1814 pages

ISBN:9781605581309

DOI:10.1145/1389095

Conference Chair:
Conor Ryan
University of Limerick, Ireland
,
Editor:
Maarten Keijzer
Chordiant Software International B.V., The Netherlands

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 July 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

GECCO08

Sponsor:

GECCO08: Genetic and Evolutionary Computation Conference

July 12 - 16, 2008

GA, Atlanta, USA

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
262
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 08 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Cagnini HDas Dôres SFreitas ABarros R(2023)A survey of evolutionary algorithms for supervised ensemble learningThe Knowledge Engineering Review10.1017/S026988892300002438Online publication date: 1-Mar-2023
https://doi.org/10.1017/S0269888923000024
Haque Mde Vries NMoscato P(2019)A Multi-objective Meta-Analytic Method for Customer Churn PredictionBusiness and Consumer Analytics: New Ideas10.1007/978-3-030-06222-4_20(781-813)Online publication date: 31-May-2019
https://doi.org/10.1007/978-3-030-06222-4_20
Mc Leod PVerma B(2018)Variable Hidden Neuron Ensemble for Mass Classification in Digital Mammograms [Application Notes]IEEE Computational Intelligence Magazine10.1109/MCI.2012.22285988:1(68-76)Online publication date: 17-Dec-2018
https://dl.acm.org/doi/10.1109/MCI.2012.2228598
Pourtaheri ZZahiri SRazavi S(2018)Stability investigation of multi-objective heuristic ensemble classifiersInternational Journal of Machine Learning and Cybernetics10.1007/s13042-018-0789-6Online publication date: 25-Jan-2018
https://doi.org/10.1007/s13042-018-0789-6
Pourtaheri ZZahiri S(2016)Ensemble classifiers with improved overfitting2016 1st Conference on Swarm Intelligence and Evolutionary Computation (CSIEC)10.1109/CSIEC.2016.7482130(93-97)Online publication date: Mar-2016
https://doi.org/10.1109/CSIEC.2016.7482130
Johansson ULofstrom TBostrom H(2013)Overproduce-and-select: The grim reality2013 IEEE Symposium on Computational Intelligence and Ensemble Learning (CIEL)10.1109/CIEL.2013.6613140(52-59)Online publication date: Apr-2013
https://doi.org/10.1109/CIEL.2013.6613140
Mc Leod PVerma B(2012)A multilayered ensemble architecture for the classification of masses in digital mammogramsProceedings of the 25th Australasian joint conference on Advances in Artificial Intelligence10.1007/978-3-642-35101-3_8(85-94)Online publication date: 4-Dec-2012
https://dl.acm.org/doi/10.1007/978-3-642-35101-3_8
Khreich WGranger EMiri ASabourin R(2010)Iterative Boolean combination of classifiers in the ROC spacePattern Recognition10.1016/j.patcog.2010.03.00643:8(2732-2752)Online publication date: 1-Aug-2010
https://dl.acm.org/doi/10.1016/j.patcog.2010.03.006
Engen VVincent JSchierz APhalp K(2009)Multi-objective evolution of the Pareto optimal set of neural network classifier ensembles2009 International Conference on Machine Learning and Cybernetics10.1109/ICMLC.2009.5212485(74-79)Online publication date: Jul-2009
https://doi.org/10.1109/ICMLC.2009.5212485

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Classifier ensembles for image identification using multi-objective Pareto features

Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms

Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations