skip to main content
10.1145/1068009.1068326acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
Article

DXCS: an XCS system for distributed data mining

Published: 25 June 2005 Publication History

Abstract

XCS is a flexible system for data mining due to its ability to deal with environmental changes, learn online with little prior knowledge and evolve accurate and maximally general classifiers. In this paper, we propose DXCS which is an XCS-based distributed data mining system. A MDL metric is proposed to quantify and analyze network load, and study the balance between network load and classifier accuracy in the presence of noise. The DXCS system shows promising results.

References

[1]
J. Bacardit and M. V. Butz. Data Mining in Learning Classifier Systems: Comparing XCS with GAssist, June 2004. IlliGAL Report No. 2004030.]]
[2]
J. Bacardit and J. M. Garrell. Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In Sixth International Workshop on Learning Classifier Systems (IWLCS-2003), Chicago, July 2003.]]
[3]
E. Bernadó, X. Llorà, and J. M. Garrell. XCS and GALE: a Comparative Study of Two Learning Classifier Systems with Six Other Learning Algorithms on Classification Tasks. In Proceedings of the 4th International Workshop on Learning Classifier Systems (IWLCS-2001), pages 337--341, 2001. Short version published in Genetic and Evolutionary Compution Conference (GECCO2001).]]
[4]
E. Bernado-Mansilla and J. Garrell-Guiu. Accuracy-based learning classifier systems: Models, analysis and applications to classification tasks. Evolutionary Computation, 11(3):209--238, 2003.]]
[5]
L. Breiman. Bagging predictors. Machine Learning, 24(2):123--140, 1996.]]
[6]
L. Breiman. Pasting small votes for classification in large databases and on-line. Machine Learning, 36(1--2):85--103, 1999.]]
[7]
L. Bull, J. Sha'Aban, A. Tomlinson, P. Addison, and B. Heydecker. Towards distributed adaptive control for road traffic junction signals using learning classifier systems. In L. Bull, editor, Applications of Learning Classifier Systems, pages 276--299. Springer-Verlag, 2004.]]
[8]
M. V. Butz. Rule-Based Evolutionary Online Learning Systems: Learning Bounds, Classification, and Prediction. PhD thesis, University of Illinois at Urbana-Champaign, 2004.]]
[9]
M. V. Butz, D. E. Goldberg, and K. Tharakunnel. Analysis and improvement of fitness exploitation in XCS: Bounding models, tournament selection, and bilateral accuracy. Evolutionary Computation, 11(3):239--277, 2003.]]
[10]
M. V. Butz and S. W. Wilson. An algorithmic description of XCS. In IWLCS '00: Revised Papers from the Third International Workshop on Advances in Learning Classifier Systems, pages 253--272. Springer-Verlag, 2001.]]
[11]
P. K. Chan and S. J. Stolfo. Toward parallel and distributed learning by meta-learning. In Working Notes AAAI Work. Knowledge Discovery in Databases, pages 227--240, Washington, DC, 1993.]]
[12]
P. W. Dixon, D. Corne, and M. J. Oates. A preliminary investigation of modified XCS as a generic data mining tool. In IWLCS '01: Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems, pages 133--150. Springer-Verlag, 2002.]]
[13]
U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. From data mining to knowledge discovery: An overview. In Advances in Knowledge Discovery and Data Mining, pages 1--36. The MIT Press, 1996.]]
[14]
A. A. Freitas. A survey of evolutionary algorithms for data mining and knowledge discovery. Advances in evolutionary computing: theory and applications, pages 819--845, 2003.]]
[15]
C. Giannella, R. Bhargava, and H. Kargupta. Multi-agent systems and distributed data mining. In Cooperative Information Agents VIII: 8th International Workshop, CIA 2004, pages 1--15, Erfurt, Germany, 2004.]]
[16]
D. E. Goldberg. Genetic Algorithms in Search, Optimization, and Machine Learning. Addision-Wesley Publishing Company, INC., 1989.]]
[17]
C. Guerra-Salcedo and D. Whitley. Genetic approach to feature selection for ensemble creation. In GECCO'99, pages 236--243, 1999.]]
[18]
Y. Guo, S. Rueger, J. Sutiwaraphun, and J. Forbes-Millott. Meta-learning for parallel data mining. In Proceedings of the Seventh Parallel Computing Worksop, 1997.]]
[19]
Y. Guo and J. Sutiwaraphun. Distributed classification with knowledge probing. In Advances in Distributed and Parallel Knowledge Discovery, pages 115--132. AAAI Press/ The MIT Press, 2000.]]
[20]
C. Jones, J. Hall, and J. Hale. Secure distributed database mining: Principles of design. In Advances in Distributed and Parallel Knowledge Discovery, pages 277--294. The MIT Press, 2000.]]
[21]
K. Jong, J. Mary, A. Cornuejols, E. Marchiori, and M. Sebag. Ensemble feature ranking. In Proceeding ECML-PKDD'04, Pisa, Italy, 2004. IEEE.]]
[22]
L. I. Kuncheva. Combining Pattern Classifiers: Methods and Algorithms. Wiley, 2004.]]
[23]
J. Naisbitt. Megatrends : Ten New Directions Transforming Our Lives. Warner Books; Rei edition, 1988.]]
[24]
A. L. Prodromidis, P. K. Chan, and S. J. Stolfo. Meta-learning in distributed data mining systems: Issues and approaches. In Advances in Distributed and Parallel Knowledge Discovery, pages 81--114. The MIT Press, 2000.]]
[25]
S. W. Wilson. Classifier fitness based on accuracy. Evolutionary Computation, 3(2):149--175, 1995.]]
[26]
S. W. Wilson. Generalization in the XCS classifier system. In J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo, editors, Genetic Programming 1998: Proceedings of the Third Annual Conference, pages 665--674, University of Wisconsin, Madison, Wisconsin, USA, 22-25 1998. Morgan Kaufmann.]]

Cited By

View all

Index Terms

  1. DXCS: an XCS system for distributed data mining

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '05: Proceedings of the 7th annual conference on Genetic and evolutionary computation
    June 2005
    2272 pages
    ISBN:1595930108
    DOI:10.1145/1068009
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 June 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. MDL
    2. XCS
    3. distributed data mining
    4. learning classifier system

    Qualifiers

    • Article

    Conference

    GECCO05
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Large-scale data mining using genetics-based machine learningWiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery10.1002/widm.10783:1(37-61)Online publication date: 1-Jan-2013
    • (2012)A Novel Data Mining Method in Distributed NetworkSoftware Engineering and Knowledge Engineering: Theory and Practice10.1007/978-3-642-03718-4_15(115-123)Online publication date: 15-Jan-2012
    • (2012)Genetics-Based Machine LearningHandbook of Natural Computing10.1007/978-3-540-92910-9_30(937-986)Online publication date: 2012
    • (2011)Applying XCS on time variant problem: Separates thinking from doing2011 Third World Congress on Nature and Biologically Inspired Computing10.1109/NaBIC.2011.6089616(348-352)Online publication date: Oct-2011
    • (2011)Distributed data mining for e-businessInformation Technology and Management10.1007/s10799-011-0091-812:2(67-79)Online publication date: 1-Jun-2011
    • (2010)Self-adaptation of parameters in a learning classifier system ensemble machineInternational Journal of Applied Mathematics and Computer Science10.2478/v10006-010-0012-820:1(157-174)Online publication date: 1-Mar-2010
    • (2010)Data mining in ditributed data environment2010 International Conference on Machine Learning and Cybernetics10.1109/ICMLC.2010.5581024(421-426)Online publication date: Jul-2010
    • (2010)A multiple population XCS: Evolving condition-action rules based on feature space partitionsIEEE Congress on Evolutionary Computation10.1109/CEC.2010.5586521(1-8)Online publication date: Jul-2010
    • (2009)Learning classifier systemsJournal of Artificial Evolution and Applications10.5555/1644490.16444912009(1-25)Online publication date: 1-Jan-2009
    • (2009)A self-organized, distributed, and adaptive rule-based induction systemIEEE Transactions on Neural Networks10.1109/TNN.2008.200833420:3(446-459)Online publication date: 1-Mar-2009
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media