research-article

Automatic scoring of online discussion posts

Authors:

Nayer Wanas,

Motaz El-Saban,

Heba Ashour,

Waleed AmmarAuthors Info & Claims

WICOW '08: Proceedings of the 2nd ACM workshop on Information credibility on the web

Pages 19 - 26

https://doi.org/10.1145/1458527.1458534

Published: 30 October 2008 Publication History

Get Access

Abstract

Online discussions forums, known as forums for short, are conversational social cyberspaces constituting rich repositories of content and an important source of collaborative knowledge. However, most of this knowledge is buried inside the forum infrastructure and its extraction is both complex and difficult. The ability to automatically rate postings in online discussion forums, based on the value of their contribution, enhances the ability of users to find knowledge within this content. Several key online discussion forums have utilized collaborative intelligence to rate the value of postings made by users. However, a large percentage of posts go unattended and hence lack appropriate rating.

In this paper, we focus on automatic rating of postings in online discussion forums. A set of features derived from the posting content and the threaded discussion structure are generated for each posting. These features are grouped into five categories, namely (i) relevance, (ii) originality, (iii) forum-specific features, (iv) surface features, and (v) posting-component features. Using a non-linear SVM classifier, the value of each posting is categorized into one of three levels High, Medium, or Low. This rating represents a seed value for each posting that is leveraged in filtering forum content. Experimental results have shown promising performance on forum data.

References

[1]

Borgs, C., Chayes, J., Mahdian, M., and Saberi, A., 2004. Exploring the Community Structure of Newsgroups, In Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Seattle, WA, USA, August 22-25, 2004) KDD'04, ACM Press, New York, NY, 783--787

Digital Library

Google Scholar

[2]

Chang, C., and Lin, C. 2001. LibSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.

Google Scholar

[3]

Dikli, S., 2006. An Overview of Automatic Scoring of Essays. The Journal of Technology, Learning, and Assessment, Vol 5(1) August 2006, 3--35.

Google Scholar

[4]

Fiore, A., Teirman, S., and Smith, M., 2002. Observed behavior and perceived value of authors in usenet newsgroups: bridging the gap. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Minneapolis, MN, USA, April 20-25, 2002). CHI'02. ACM Press, New York, NY, 323--330.

Digital Library

Google Scholar

[5]

Fisher, D., Smith, M., and Welser, H., 2006. "You Are Who You Talk To: Detecting Roles in Usenet Newsgroups". In Proceedings of the 39th Hawaii International Conference on System Sciences (Kauai, HI, USA, January 4-7, 2006) Track 3, HICSS-39, IEEE Press, New Jersey, NJ, 59b.

Digital Library

Google Scholar

[6]

Fortuna, B., Rodrigues, E., and Milic-Frayling, N. 2007. In Proceedings of the Conference on Information and Knowledge Management (Lisboan, Portugal, November 6-8, 2007). CIKM'07. ACM Press, New York, NY, 585--588

Digital Library

Google Scholar

[7]

Glance, N., Hurst, M., Nigam, K., Siegler, M., Stockton, R., and Tomokiyo, T. 2005. Deriving Marketing Intelligence from Online Discussion. In Proceedings of the SIGKDD International Conference on Knowledge Discovery and Data Mining (Chicargo, IL, USA, August 21-24, 2005). KDD'05. ACM Press, New York, NY, 419--428

Digital Library

Google Scholar

[8]

Gómez, V., Kaltenbrunner, A., and López, V. 2008. Statistical Analysis of the Social Network and Discussion Threads in Slashdot. In Proceedings of the 17th International World Wide Web Conference (Beijing, China, April 21-25, 2008). WWW2008. ACM Press, New York, NY, 645--654

Digital Library

Google Scholar

[9]

Lampe, C. and Resnick, P. 2004. Bowman, M., Debray, S. K., and Peterson, L. L. 1993. Slash(dot) and Burn: Distributed Moderation in a Large Online Conversation Space. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Vienna, Austria, April 24-29, 2004). CHI'04. ACM Press, New York, NY, 543--550

Digital Library

Google Scholar

[10]

Lui, A., Li, S., and Choy, S. 2007. An Evaluation of Automatic Text Categorization in Online Discussion Analysis. In Proceedings of the Seventh IEEE International Conference on Advanced Learning Technologies (Niigata, Japan, July 18-20, 2007) ICALT 2007, IEEE Computer Society Press, New Jersey, NJ, 205--209

Google Scholar

[11]

Weimer, M., Gurevych, I., and Mühlhäuser, M. 2007. Automatically Assessing the Post Quality in Online Discussions on Software. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (Prague, Czech Republic, June 23-30, 2007). ACL2007 Volume P07-2, 125--128.

Digital Library

Google Scholar

[12]

Wu, Q., Burges, C. Svore, K, and Gao, J, 2008, Ranking, Boosting, and Model Adaptation, Technical Report, MSR-TR-2008-109, Microsoft Corporation, Redmond, WA, August 2008.

Google Scholar

[13]

Wu, Z., and Li, C., 2007. Topic Detection in Online Discussion using Non-Negative Matrix Factorization. In Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology- Workshops (Silicon Valley, CA, USA, November 2-5, 2007) WI-IATW 2007, IEEE Computer Society Press, New Jersey, NJ, 272-275

Digital Library

Google Scholar

Cited By

View all

Chew YMohamed Zainal S(2024)A Sustainable Collaborative Talent Management Through Collaborative Intelligence Mindset Theory: A Systematic ReviewSage Open10.1177/2158244024126185114:2Online publication date: 17-Jun-2024
https://doi.org/10.1177/21582440241261851
Hartwig KSchmid SBiselli TPleil HReuter C(2024)Misleading information in crises: exploring content-specific indicators on Twitter from a user perspectiveBehaviour & Information Technology10.1080/0144929X.2024.2373166(1-34)Online publication date: 8-Jul-2024
https://doi.org/10.1080/0144929X.2024.2373166
Schmid SHartwig KCieslinski RReuter C(2024)Digital Resilience in Dealing with Misinformation on Social Media during COVID-19Information Systems Frontiers10.1007/s10796-022-10347-526:2(477-499)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1007/s10796-022-10347-5
Show More Cited By

Index Terms

Automatic scoring of online discussion posts
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval

Recommendations

Systems for Improving Online Discussion
UIST '17 Adjunct: Adjunct Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology

More and more of the discussion that happens now takes place on the web, whether it be for work, communities of interest, political and civic discourse, or education. However, little has changed in the design of online discussion systems, such as email, ...
Characterizing Growth and Decline in Online UX Communities
CHI EA '21: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems

UX practitioners increasingly rely on online communities to collaborate on and discuss complex design problems. Understanding how these platforms flourish is thus of interest to both HCI academia and the broader UX discipline. In this study, we ...
Online and offline interactions in online communities
WikiSym '11: Proceedings of the 7th International Symposium on Wikis and Open Collaboration

Online communities, while primarily enacted through technology-mediated environments, can also include offline meetings between members, promoting interactivity and community building. This study explores the offline interactions of online community ...

Comments

Information & Contributors

Information

Published In

WICOW '08: Proceedings of the 2nd ACM workshop on Information credibility on the web

October 2008

100 pages

ISBN:9781605582597

DOI:10.1145/1458527

General Chairs:
Katsumi Tanaka
Kyoto University, Japan
,
Takashi Matsuyama
Kyoto University, Japan
,
Ee-Peng Lim
Singapore Management University, Singapore
,
Program Chair:
Adam Jatowt
Kyoto University, Japan

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2008

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CIKM08

Sponsor:

CIKM08: Conference on Information and Knowledge Management

October 30, 2008

California, Napa Valley, USA

Acceptance Rates

Overall Acceptance Rate 9 of 19 submissions, 47%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

42
Total Citations
View Citations
887
Total Downloads

Downloads (Last 12 months)21
Downloads (Last 6 weeks)2

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Chew YMohamed Zainal S(2024)A Sustainable Collaborative Talent Management Through Collaborative Intelligence Mindset Theory: A Systematic ReviewSage Open10.1177/2158244024126185114:2Online publication date: 17-Jun-2024
https://doi.org/10.1177/21582440241261851
Hartwig KSchmid SBiselli TPleil HReuter C(2024)Misleading information in crises: exploring content-specific indicators on Twitter from a user perspectiveBehaviour & Information Technology10.1080/0144929X.2024.2373166(1-34)Online publication date: 8-Jul-2024
https://doi.org/10.1080/0144929X.2024.2373166
Schmid SHartwig KCieslinski RReuter C(2024)Digital Resilience in Dealing with Misinformation on Social Media during COVID-19Information Systems Frontiers10.1007/s10796-022-10347-526:2(477-499)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1007/s10796-022-10347-5
Taki NShowan EChowdhury UTasnim F(2023)A Machine Learning and Deep Learning Based Approach to Detect Inaccurate Health Information in Bengali Language2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)10.1109/ECCE57851.2023.10101612(01-06)Online publication date: 23-Feb-2023
https://doi.org/10.1109/ECCE57851.2023.10101612
Hartwig KReuter C(2023)Countering Fake News Technically – Detection and Countermeasure Approaches to Support UsersTruth and Fake in the Post-Factual Digital Age10.1007/978-3-658-40406-2_7(131-147)Online publication date: 25-May-2023
https://doi.org/10.1007/978-3-658-40406-2_7
Thaker KChi YBirkhoff SHe DDonovan HRosenblum LBrusilovsky PHui VLee Y(2022)Exploring Resource-Sharing Behaviors for Finding Relevant Health Resources: Analysis of an Online Ovarian Cancer CommunityJMIR Cancer10.2196/331108:2(e33110)Online publication date: 12-Apr-2022
https://doi.org/10.2196/33110
Magazzino CMele MMorelli G(2021)The Relationship between Renewable Energy and Economic Growth in a Time of Covid-19: A Machine Learning Experiment on the Brazilian EconomySustainability10.3390/su1303128513:3(1285)Online publication date: 26-Jan-2021
https://doi.org/10.3390/su13031285
Woo JYun J(2020)Content Noise Detection Model Using Deep Learning in Web ForumsSustainability10.3390/su1212507412:12(5074)Online publication date: 22-Jun-2020
https://doi.org/10.3390/su12125074
Benazir ASharmin S(2020)Credibility Assessment of User Generated health information of the Bengali language in microblogging sites employing NLP techniques2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WIIAT50758.2020.00129(837-844)Online publication date: Dec-2020
https://doi.org/10.1109/WIIAT50758.2020.00129
Witchel HThompson GJones CWestling CRomero JNicotra AMaag BCritchley H(2019)Spelling Errors and Shouting Capitalization Lead to Additive Penalties to Trustworthiness of Online Health Information: Randomized Experiment With Laypersons (Preprint)Journal of Medical Internet Research10.2196/15171Online publication date: 26-Jun-2019
https://doi.org/10.2196/15171
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Systems for Improving Online Discussion

Characterizing Growth and Decline in Online UX Communities

Online and offline interactions in online communities

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations