skip to main content
10.1145/1835449.1835643acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Short text classification in twitter to improve information filtering

Published: 19 July 2010 Publication History

Abstract

In microblogging services such as Twitter, the users may become overwhelmed by the raw data. One solution to this problem is the classification of short text messages. As short texts do not provide sufficient word occurrences, traditional classification methods such as "Bag-Of-Words" have limitations. To address this problem, we propose to use a small set of domain-specific features extracted from the author's profile and text. The proposed approach effectively classifies the text to a predefined set of generic classes such as News, Events, Opinions, Deals, and Private Messages.

References

[1]
Altingovde, I.S., Demir, E., Can, F., and Ulusoy, O. Site-based dynamic pruning for query processing in search engines. In Proc. SIGIR (Singapore, July 2008), 861--862.
[2]
Banerjee, S., Ramanthan, K., and Gupta, A. Clustering short text using Wikipedia. In Proc. SIGIR (Amsterdam, The Netherlands, July 2007), 787--788.
[3]
Hu, X., Sun, N., Zhang, C., and Chua, T.-S. Exploiting internal and external semantics for the clustering of short texts using world knowledge. In Proc. CIKM (Hong Kong, China, Nov. 2009), 919--928.
[4]
Java, A., Song, X., Finin, T., and Tseng, B. 2007. Why we twitter: understanding microblogging usage and communities. In Procs WebKDD/SNA-KDD '07 (San Jose, California, August, 2007), 56--65.
[5]
Phan, X.-H., Nguyen, L.-M., and Horiguchi, S. Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In Proc. WWW (Beijing, China, Apr. 2008), 91--100.
[6]
Sankaranarayanan, J., Samet, H., Teitler, B. E., Lieberman, and M. D., Sperling, J. TwitterStand: news in tweets. In Proc. ACM GIS'09 (Seattle, Washington, Nov. 2009), 42--51.

Cited By

View all
  • (2024)Fuzzy SVM With Mahalanobis Distance for Situational Awareness-Based Recognition of Public Health EmergenciesInternational Journal of Fuzzy System Applications10.4018/IJFSA.34211713:1(1-21)Online publication date: 15-May-2024
  • (2024)Generalized News Event Discovery via Dynamic Augmentation and Entropy OptimizationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681157(10018-10026)Online publication date: 28-Oct-2024
  • (2024)Automatic Construction of Expiration Time Expression Dataset from RetweetsCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3651471(545-548)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. Short text classification in twitter to improve information filtering

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
      July 2010
      944 pages
      ISBN:9781450301534
      DOI:10.1145/1835449
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2010

      Check for updates

      Author Tags

      1. classification
      2. feature selection
      3. short text
      4. twitter

      Qualifiers

      • Poster

      Conference

      SIGIR '10
      Sponsor:

      Acceptance Rates

      SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)42
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 02 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Fuzzy SVM With Mahalanobis Distance for Situational Awareness-Based Recognition of Public Health EmergenciesInternational Journal of Fuzzy System Applications10.4018/IJFSA.34211713:1(1-21)Online publication date: 15-May-2024
      • (2024)Generalized News Event Discovery via Dynamic Augmentation and Entropy OptimizationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681157(10018-10026)Online publication date: 28-Oct-2024
      • (2024)Automatic Construction of Expiration Time Expression Dataset from RetweetsCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3651471(545-548)Online publication date: 13-May-2024
      • (2024)Prompt-Learning for Short Text ClassificationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.333278736:10(5328-5339)Online publication date: Oct-2024
      • (2024)An Industrial Short Text Classification Method Based on Large Language Model and Knowledge Base2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650933(1-7)Online publication date: 30-Jun-2024
      • (2024)Short text classification with Soft Knowledgeable Prompt-tuningExpert Systems with Applications10.1016/j.eswa.2024.123248246(123248)Online publication date: Jul-2024
      • (2024)Measuring flight-destination similarityExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121802238:PAOnline publication date: 15-Mar-2024
      • (2024)Text Categorization: Conceptual ViewText Mining10.1007/978-3-031-75976-5_5(81-102)Online publication date: 8-Oct-2024
      • (2024)Sentiment Urgency Emotion Detection for Business IntelligenceProceedings of International Conference on Intelligent Vision and Computing (ICIVC 2023)10.1007/978-3-031-71388-0_13(163-171)Online publication date: 29-Oct-2024
      • (2024)Spatial Analysis of Social Media’s Proxies for Human Emotion and CognitionWisdom, Well-Being, Win-Win10.1007/978-3-031-57860-1_13(175-185)Online publication date: 10-Apr-2024
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media