ABSTRACT
No abstract available.
- W. Lam and C. Y. Ho. Using a generalized instance set for automatic text categorization, In Proc. of SIGIR'98, pp.81--89, ACM, 1998. Google ScholarDigital Library
- Y. Yang and X. Liu. A re-examination of text categorization, In Proc. of SIGIR'99, ACM, 1999. Google ScholarDigital Library
- Y. Yang, J. Zhang and B. Kisiel. A Scalability Analysis of Classifiers in Text Categorization, In Proc. of SIGIR'03, pp.96--103, ACM, 2003. Google ScholarDigital Library
- D. V. Khmelev and W. J. Teahan. A Repetition Based Measure for Verification of Text Collections and for Text Categorization, In Proc. of SIGIR'03, pp. 104--110, ACM, 2003. Google ScholarDigital Library
Index Terms
- On redundancy of training corpus for text categorization: a perspective of geometry
Recommendations
Using the Web as corpus for self-training text categorization
AbstractMost current methods for automatic text categorization are based on supervised learning techniques and, therefore, they face the problem of requiring a great number of training instances to construct an accurate classifier. In order to tackle this ...
Combining coregularization and consensus-based self-training for multilingual text categorization
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrievalWe investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the framework of multiview learning, where different languages correspond to ...
Semi-supervised text categorization: Exploiting unlabeled data using ensemble learning algorithms
Text categorization is one of the fundamental tasks in text mining. Classical supervised methods need lot of labeled data to train a classifier. Since assigning labels to the large amount of data is very costly and time consuming, it is useful to use ...
Comments