skip to main content
10.1145/1458527.1458534acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Automatic scoring of online discussion posts

Published: 30 October 2008 Publication History

Abstract

Online discussions forums, known as forums for short, are conversational social cyberspaces constituting rich repositories of content and an important source of collaborative knowledge. However, most of this knowledge is buried inside the forum infrastructure and its extraction is both complex and difficult. The ability to automatically rate postings in online discussion forums, based on the value of their contribution, enhances the ability of users to find knowledge within this content. Several key online discussion forums have utilized collaborative intelligence to rate the value of postings made by users. However, a large percentage of posts go unattended and hence lack appropriate rating.
In this paper, we focus on automatic rating of postings in online discussion forums. A set of features derived from the posting content and the threaded discussion structure are generated for each posting. These features are grouped into five categories, namely (i) relevance, (ii) originality, (iii) forum-specific features, (iv) surface features, and (v) posting-component features. Using a non-linear SVM classifier, the value of each posting is categorized into one of three levels High, Medium, or Low. This rating represents a seed value for each posting that is leveraged in filtering forum content. Experimental results have shown promising performance on forum data.

References

[1]
Borgs, C., Chayes, J., Mahdian, M., and Saberi, A., 2004. Exploring the Community Structure of Newsgroups, In Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Seattle, WA, USA, August 22-25, 2004) KDD'04, ACM Press, New York, NY, 783--787
[2]
Chang, C., and Lin, C. 2001. LibSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[3]
Dikli, S., 2006. An Overview of Automatic Scoring of Essays. The Journal of Technology, Learning, and Assessment, Vol 5(1) August 2006, 3--35.
[4]
Fiore, A., Teirman, S., and Smith, M., 2002. Observed behavior and perceived value of authors in usenet newsgroups: bridging the gap. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Minneapolis, MN, USA, April 20-25, 2002). CHI'02. ACM Press, New York, NY, 323--330.
[5]
Fisher, D., Smith, M., and Welser, H., 2006. "You Are Who You Talk To: Detecting Roles in Usenet Newsgroups". In Proceedings of the 39th Hawaii International Conference on System Sciences (Kauai, HI, USA, January 4-7, 2006) Track 3, HICSS-39, IEEE Press, New Jersey, NJ, 59b.
[6]
Fortuna, B., Rodrigues, E., and Milic-Frayling, N. 2007. In Proceedings of the Conference on Information and Knowledge Management (Lisboan, Portugal, November 6-8, 2007). CIKM'07. ACM Press, New York, NY, 585--588
[7]
Glance, N., Hurst, M., Nigam, K., Siegler, M., Stockton, R., and Tomokiyo, T. 2005. Deriving Marketing Intelligence from Online Discussion. In Proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining (Chicargo, IL, USA, August 21-24, 2005). KDD'05. ACM Press, New York, NY, 419--428
[8]
Gómez, V., Kaltenbrunner, A., and López, V. 2008. Statistical Analysis of the Social Network and Discussion Threads in Slashdot. In Proceedings of the 17th International World Wide Web Conference (Beijing, China, April 21-25, 2008). WWW2008. ACM Press, New York, NY, 645--654
[9]
Lampe, C. and Resnick, P. 2004. Bowman, M., Debray, S. K., and Peterson, L. L. 1993. Slash(dot) and Burn: Distributed Moderation in a Large Online Conversation Space. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Vienna, Austria, April 24-29, 2004). CHI'04. ACM Press, New York, NY, 543--550
[10]
Lui, A., Li, S., and Choy, S. 2007. An Evaluation of Automatic Text Categorization in Online Discussion Analysis. In Proceedings of the Seventh IEEE International Conference on Advanced Learning Technologies (Niigata, Japan, July 18-20, 2007) ICALT 2007, IEEE Computer Society Press, New Jersey, NJ, 205--209
[11]
Weimer, M., Gurevych, I., and Mühlhäuser, M. 2007. Automatically Assessing the Post Quality in Online Discussions on Software. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (Prague, Czech Republic, June 23-30, 2007). ACL2007 Volume P07-2, 125--128.
[12]
Wu, Q., Burges, C. Svore, K, and Gao, J, 2008, Ranking, Boosting, and Model Adaptation, Technical Report, MSR-TR-2008-109, Microsoft Corporation, Redmond, WA, August 2008.
[13]
Wu, Z., and Li, C., 2007. Topic Detection in Online Discussion using Non-Negative Matrix Factorization. In Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology- Workshops (Silicon Valley, CA, USA, November 2-5, 2007) WI-IATW 2007, IEEE Computer Society Press, New Jersey, NJ, 272-275

Cited By

View all
  • (2024)A Sustainable Collaborative Talent Management Through Collaborative Intelligence Mindset Theory: A Systematic ReviewSage Open10.1177/2158244024126185114:2Online publication date: 17-Jun-2024
  • (2024)Misleading information in crises: exploring content-specific indicators on Twitter from a user perspectiveBehaviour & Information Technology10.1080/0144929X.2024.2373166(1-34)Online publication date: 8-Jul-2024
  • (2024)Digital Resilience in Dealing with Misinformation on Social Media during COVID-19Information Systems Frontiers10.1007/s10796-022-10347-526:2(477-499)Online publication date: 1-Apr-2024
  • Show More Cited By

Index Terms

  1. Automatic scoring of online discussion posts

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WICOW '08: Proceedings of the 2nd ACM workshop on Information credibility on the web
      October 2008
      100 pages
      ISBN:9781605582597
      DOI:10.1145/1458527
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 October 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. content filtering
      2. forums
      3. online communities

      Qualifiers

      • Research-article

      Conference

      CIKM08
      CIKM08: Conference on Information and Knowledge Management
      October 30, 2008
      California, Napa Valley, USA

      Acceptance Rates

      Overall Acceptance Rate 9 of 19 submissions, 47%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)21
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 17 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Sustainable Collaborative Talent Management Through Collaborative Intelligence Mindset Theory: A Systematic ReviewSage Open10.1177/2158244024126185114:2Online publication date: 17-Jun-2024
      • (2024)Misleading information in crises: exploring content-specific indicators on Twitter from a user perspectiveBehaviour & Information Technology10.1080/0144929X.2024.2373166(1-34)Online publication date: 8-Jul-2024
      • (2024)Digital Resilience in Dealing with Misinformation on Social Media during COVID-19Information Systems Frontiers10.1007/s10796-022-10347-526:2(477-499)Online publication date: 1-Apr-2024
      • (2023)A Machine Learning and Deep Learning Based Approach to Detect Inaccurate Health Information in Bengali Language2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)10.1109/ECCE57851.2023.10101612(01-06)Online publication date: 23-Feb-2023
      • (2023)Countering Fake News Technically – Detection and Countermeasure Approaches to Support UsersTruth and Fake in the Post-Factual Digital Age10.1007/978-3-658-40406-2_7(131-147)Online publication date: 25-May-2023
      • (2022)Exploring Resource-Sharing Behaviors for Finding Relevant Health Resources: Analysis of an Online Ovarian Cancer CommunityJMIR Cancer10.2196/331108:2(e33110)Online publication date: 12-Apr-2022
      • (2021)The Relationship between Renewable Energy and Economic Growth in a Time of Covid-19: A Machine Learning Experiment on the Brazilian EconomySustainability10.3390/su1303128513:3(1285)Online publication date: 26-Jan-2021
      • (2020)Content Noise Detection Model Using Deep Learning in Web ForumsSustainability10.3390/su1212507412:12(5074)Online publication date: 22-Jun-2020
      • (2020)Credibility Assessment of User Generated health information of the Bengali language in microblogging sites employing NLP techniques2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WIIAT50758.2020.00129(837-844)Online publication date: Dec-2020
      • (2019)Spelling Errors and Shouting Capitalization Lead to Additive Penalties to Trustworthiness of Online Health Information: Randomized Experiment With Laypersons (Preprint)Journal of Medical Internet Research10.2196/15171Online publication date: 26-Jun-2019
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media