skip to main content
10.1145/1390334.1390442acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

trNon-greedy active learning for text categorization using convex ansductive experimental design

Published:20 July 2008Publication History

ABSTRACT

In this paper we propose a non-greedy active learning method for text categorization using least-squares support vector machines (LSSVM). Our work is based on transductive experimental design (TED), an active learning formulation that effectively explores the information of unlabeled data. Despite its appealing properties, the optimization problem is however NP-hard and thus--like most of other active learning methods--a greedy sequential strategy to select one data example after another was suggested to find a suboptimum. In this paper we formulate the problem into a continuous optimization problem and prove its convexity, meaning that a set of data examples can be selected with a guarantee of global optimum. We also develop an iterative algorithm to efficiently solve the optimization problem, which turns out to be very easy-to-implement. Our text categorization experiments on two text corpora empirically demonstrated that the new active learning algorithm outperforms the sequential greedy algorithm, and is promising for active text categorization applications.

References

  1. A. C. Atkinson and A. N. Donev. Optimum experiment designs. Oxford Statistical Science Series. Oxford University Press, 1992.Google ScholarGoogle Scholar
  2. O. Chapelle. Active learning for Parzen window classifier. In Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, pages 49--56, 2005.Google ScholarGoogle Scholar
  3. D. Cohn and Z. Ghahramani. Active learning with statistical models. Journal of Arti¯cial Intelligence Research, 4:129--145, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Donoho. For most large underdetermined systems of linear equations, the minimal l1-norm solution is also the sparsest solution. Communications on Pure and Applied Mathematics, 59(6), 2006.Google ScholarGoogle Scholar
  5. Y. Freund, H. S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133--168, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Guestrin, A. Krause, and A. Singh. Near-optimal sensor placements in gaussian processes. In Proc. of the International Conference on Machine Learning (ICML), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. X. He, W. Min, D. Cai, and K. Zhou. Laplacian optimal design for image retrieval. In ACM SIGIR Conference, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. C. H. Hoi, R. Jin, J. Zhu, and M. R. Lyu. Batch mode active learning and its application to medical image classi¯cation. In International Conference on Machine Learning (ICML), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. D. Lewis, Y. Yang, T. Rose, and F. Li. RCV1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. MacKay. Information-based objective functions for active data selection. Neural Computation, 4(4):590--604, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. MacKay. Information-based objective functions for active data selection. Neural Computation, 4(4):590--604, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Schein and L. Ungar. Optimality for active learning of logistic regression classi¯ers. Technical Report Technical Report MS-CIS-04-07, The University of Pennsylvania, Department of Computer and Information Science, 2004.Google ScholarGoogle Scholar
  13. G. Schohn and D. Cohn. Less is more: Active learning with support vector machines. In International Conference on Machine Learning, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Suykens and J. Vandewalle. Least squares support vector machine classifiers. Neural Processing Letters, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Tibshirani. Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B, 58(1), 1996.Google ScholarGoogle Scholar
  16. S. Tong and D. Koller. Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Yu, J. Bi, and V. Tresp. Active learning via transductive experimental design. In International Conference on Machine Learning (ICML), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Zhang and Y. Yang. Robustness of regularized linear classifcation methods in text categorization. In The 26th Annual International SIGIR Conference (SIGIR'99), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. T. Zhang and F. J. Oles. Text categorization based on regularized linear classi¯cation methods. Information Retrieval, (4):5--31, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. W. V. Zhang, X. He, B. Rey, and R. Jones. Query rewritting using active learning for sponsored search. In ACM SIGIR Conference, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. trNon-greedy active learning for text categorization using convex ansductive experimental design

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
      July 2008
      934 pages
      ISBN:9781605581644
      DOI:10.1145/1390334

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 July 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader