| Towards tight bounds for rule learning |
| Full text |
Pdf
(154 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 69
archive
Proceedings of the twenty-first international conference on Machine learning
table of contents
Banff, Alberta, Canada
Page: 90
Year of Publication: 2004
ISBN:1-58113-828-5
|
|
Authors
|
|
Ulrich Rückert
|
Technische Universität München, Garching b. München, Germany
|
|
Stefan Kramer
|
Technische Universität München, Garching b. München, Germany
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 16, Citation Count: 3
|
|
|
ABSTRACT
While there is a lot of empirical evidence showing that traditional rule learning approaches work well in practice, it is nearly impossible to derive analytical results about their predictive accuracy. In this paper, we investigate rule-learning from a theoretical perspective. We show that the application of McAllester's PAC-Bayesian bound to rule learning yields a practical learning algorithm, which is based on ensembles of weighted rule sets. Experiments with the resulting learning algorithm show not only that it is competitive with state-of-the-art rule learners, but also that its error rate can often be bounded tightly. In fact, the bound turns out to be tighter than one of the "best" bounds for a practical learning scheme known so far (the Set Covering Machine). Finally, we prove that the bound can be further improved by allowing the learner to abstain from uncertain predictions.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Blake, C., & Merz, C. (1998). UCI repository of machine learning databases.
|
| |
2
|
|
| |
3
|
William W. Cohen , Yoram Singer, A simple, fast, and effective rule learner, Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, p.335-342, July 18-22, 1999, Orlando, Florida, United States
|
| |
4
|
Cohen, W. W. (1995). Fast effective rule induction. Proc. 12th International Conf. on Machine Learning (pp. 115--123). Morgan Kaufmann.
|
| |
5
|
|
| |
6
|
Mostefa Golea , Peter L. Bartlett , Wee Sun Lee , Llew Mason, Generalization in decision trees and DNF: does size matter?, Proceedings of the 1997 conference on Advances in neural information processing systems 10, p.259-265, July 1998, Denver, Colorado, United States
|
| |
7
|
Hoos, H. H. (1998). Stochastic local search -- methods, models, applications. Doctoral dissertation, TU Darmstadt.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
| |
11
|
Marchand, M., Shah, M., Shawe-Taylor, J., & Sokolova, M. (2003). The set covering machine with data-dependent half-spaces. Proc. 20th International Conf. on Machine Learning (pp. 520--527). Morgan Kaufmann.
|
| |
12
|
|
 |
13
|
|
| |
14
|
|
| |
15
|
Rückert, U., & Kramer, S. (2003). Stochastic local search in k-term DNF learning. Proc. 20th International Conf. on Machine Learning (pp. 648--655). AAAI Press.
|
| |
16
|
Sokolova, M., Marchand, M., Japkowicz, N., & Shawe-Taylor, J. (2003). The decision list machine. Advances in Neural Information Processing Systems 15 (pp. 921--928). MIT-Press, Cambridge, MA, USA.
|
| |
17
|
|
| |
18
|
|
|