research-article

Information credibility on twitter

Authors:
Carlos Castillo

Yahoo! Research, Barcelona, Spain

Yahoo! Research, Barcelona, Spain
View Profile

,
Marcelo Mendoza

Universidad Técnica Federico Santa Maria and Yahoo! Research Latin America, Santiago, Chile

Universidad Técnica Federico Santa Maria and Yahoo! Research Latin America, Santiago, Chile
View Profile

,
Barbara Poblete

Yahoo! Research Latin America and Universidad de Chile, Santiago, Chile

Yahoo! Research Latin America and Universidad de Chile, Santiago, Chile
View Profile

WWW '11: Proceedings of the 20th international conference on World wide webMarch 2011Pages 675–684https://doi.org/10.1145/1963405.1963500

Published:28 March 2011Publication History

WWW '11: Proceedings of the 20th international conference on World wide web

Pages 675–684

ABSTRACT

We analyze the information credibility of news propagated through Twitter, a popular microblogging service. Previous research has shown that most of the messages posted on Twitter are truthful, but the service is also used to spread misinformation and false rumors, often unintentionally.

On this paper we focus on automatic methods for assessing the credibility of a given set of tweets. Specifically, we analyze microblog postings related to "trending" topics, and classify them as credible or not credible, based on features extracted from them. We use features from users' posting and re-posting ("re-tweeting") behavior, from the text of the posts, and from citations to external sources.

We evaluate our methods using a significant number of human assessments about the credibility of items on a recent sample of Twitter postings. Our results shows that there are measurable differences in the way messages propagate, that can be used to classify them automatically as credible or not credible, with precision and recall in the range of 70% to 80%.

References

E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In WSDM '08: Proceedings of the international conference on Web search and web data mining, pages 183--194, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
Alonso, Omar, Carson, Chad, Gerster, David, Ji, Xiang, and Nabar, Shubha. Detecting Uninteresting Content in Text Streams. In SIGIR Crowdsourcing for Search Evaluation Workshop, 2010.Google Scholar
C. L. Armstrong and M. J. Mcadams. Blogs of information: How gender cues and individual motivations influence perceptions of credibility. Journal of Computer-Mediated Communication, 14(3):435--456, 2009.Google ScholarCross Ref
F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida. Detecting Spammers on Twitter. In Collaboration, Electronic messaging, Anti-Abuse and Spam Conference (CEAS), July 2010.Google Scholar
R. Crane and D. Sornette. Robust dynamic classes revealed by measuring the response function of a social system. Proceedings of the National Academy of Sciences, 105(41):15649--15653, October 2008.Google ScholarCross Ref
B. De Longueville, R. S. Smith, and G. Luraschi. "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires. In LBSN '09: Proceedings of the 2009 International Workshop on Location Based Social Networks, pages 73--80, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
P. S. Earle, M. Guy, C. Ostrum, S. Horvath, and R. A. Buckmaster. OMG Earthquake! Can Twitter improve earthquake response? AGU Fall Meeting Abstracts, pages B1697+, Dec. 2009.Google Scholar
A. J. Flanagin and M. J. Metzger. Perceptions of internet information credibility. Journalism and Mass Communication Quarterly, 77(3):515--540, 2000.Google ScholarCross Ref
A. J. Flanagin and M. J. Metzger. The role of site features, user attributes, and information verification behaviors on the perceived credibility of web-based information. New Media Society, 9(2):319--342, April 2007.Google ScholarCross Ref
B. J. Fogg and H. Tseng. The elements of computer credibility. In CHI '99: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 80--87, New York, NY, USA, 1999. ACM. Google ScholarDigital Library
C. Grier, K. Thomas, V. Paxson, and M. Zhang. @spam: the underground on 140 characters or less. In CCS '10: Proceedings of the 17th ACM conference on Computer and Communications Security, CCS '10, pages 27--37, New York, NY, USA, October 2010. ACM. Google ScholarDigital Library
A. L. Hughes and L. Palen. Twitter adoption and use in mass convergence and emergency events. In ISCRAM Conference, May 2009.Google ScholarCross Ref
A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD/SNA-KDD '07: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, pages 56--65, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
T. J. Johnson, B. K. Kaye, S. L. Bichard, and W. J. Wong. Every blog has its day: Politically-interested internet users' perceptions of blog credibility. Journal of Computer-Mediated Communication, 13(1), 2007.Google ScholarDigital Library
K. Kireyev, L. Palen, and K. Anderson. Applications of topics models to analysis of disaster-related twitter data. In NIPS Workshop on Applications for Topic Models: Text and Beyond, December 2009.Google Scholar
H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In World Wide Web Conference. ACM Press, 2010. Google ScholarDigital Library
V. Lampos, T. D. Bie, and N. Cristianini. Flu detector - tracking epidemics on twitter. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2010), pages 599--602, Barcelona, Spain, 2010. Springer, Springer. Google ScholarDigital Library
M. Mathioudakis and N. Koudas. TwitterMonitor: trend detection over the twitter stream. In Proceedings of the 2010 international conference on Management of data, pages 1155--1158. ACM, 2010. Google ScholarDigital Library
M. Mendoza, B. Poblete, and C. Castillo. Twitter under crisis: Can we trust what we rt? In 1st Workshop on Social Media Analytics (SOMA '10). ACM Press, July 2010. Google ScholarDigital Library
E. Mustafaraj and P. Metaxas. From obscurity to prominence in minutes: Political speech and real-time search. In Proceedings of the WebSci10: Extending the Frontiers of Society On-Line, April 2010.Google Scholar
M. Naaman, J. Boase, and C. H. Lai. Is it really about me?: message content in social awareness streams. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, CSCW '10, pages 189--192, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
Pear Analytics. Twitter study. http://www.pearanalytics.com/wp-content/uploads/2009/08/Twitter-Study-August-2009.pdf, August 2009.Google Scholar
Pew Research Center. Internet Overtakes Newspapers As News Outlet. http://pewresearch.org/pubs/1066/internet-overtakes-newspapers-as-news-source 2008.Google Scholar
A. M. Popescu and M. Pennacchiotti. Detecting controversial events from twitter. In Proceedings of the 19th ACM international conference on Information and knowledge management, CIKM '10, pages 1873--1876, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
K. Poulsen. Firsthand reports from california wildfires pour through twitter. October 2007.Google Scholar
J. Ratkiewicz, M. Conover, M. Meiss, B. Gonçalves, S. Patil, A. Flammini, and F. Menczer. Detecting and Tracking the Spread of Astroturf Memes in Microblog Streams. arXiv, Nov 2010.Google Scholar
T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web, WWW '10, pages 851--860, New York, NY, USA, April 2010. ACM. Google ScholarDigital Library
J. Sankaranarayanan, H. Samet, B. E. Teitler, M. D. Lieberman, and J. Sperling. TwitterStand: news in tweets. In GIS '09: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pages 42--51, New York, NY, USA, November 2009. ACM Press. Google ScholarDigital Library
M. Schmierbach and A. Oeldorf-Hirsch. A little bird told me, so i didn't believe it: Twitter, credibility, and issue perceptions. In Proc. of annual meeting of the Association for Education in Journalism and Mass Communication. AEJMC, August 2010.Google Scholar
J. Schwarz and M. R. Morris. Augmenting Web Pages and Search Results to Support Credibility Assessment. In ACM Conference on Human Factors in Computing Systems (CHI). ACM Press, May 2011. Google ScholarDigital Library
K. Starbird, L. Palen, A. L. Hughes, and S. Vieweg. Chatter on the red: what hazards threat reveals about the social life of microblogged information. In CSCW '10: Proceedings of the 2010 ACM conference on Computer supported cooperative work, pages 241--250, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
S. Vieweg. Microblogged contributions to the emergency arena: Discovery, interpretation and implications. In Computer Supported Collaborative Work, February 2010.Google Scholar
S. Vieweg, A. Hughes, K. Starbird, and L. Palen. Microblogging during two natural hazards events: What twitter may contribute to situational awareness. In Proceedings of ACM Conference on Computer Human Interaction (CHI), April 2010. Google ScholarDigital Library
C. R. W. Watch. Leap of faith: Using the internet despite the dangers. http://www.consumerwebwatch.org/pdfs/princeton.pdf, October 2005.Google Scholar
D. J. Watts and J. Peretti. Viral Marketing for the Real World. Harvard Business Review, June 2007.Google Scholar
S. Yardi, D. Romero, G. Schoenebeck, and D. Boyd. Detecting spam in a Twitter network. First Monday, 15(1), January 2010.Google Scholar

Index Terms

Information credibility on twitter
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Document filtering
      2. Information extraction

Recommendations

Detecting rumors from microblogs with recurrent neural networks
IJCAI'16: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence

Microblogging platforms are an ideal place for spreading rumors and automatically debunking rumors is a crucial problem. To detect rumors, existing approaches have relied on hand-crafted features for employing machine learning algorithms that require ...
Read More
Twitter under crisis: can we trust what we RT?
SOMA '10: Proceedings of the First Workshop on Social Media Analytics

In this article we explore the behavior of Twitter users under an emergency situation. In particular, we analyze the activity related to the 2010 earthquake in Chile and characterize Twitter in the hours and days following this disaster. Furthermore, we ...
Read More
Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts
WWW '15: Proceedings of the 24th International Conference on World Wide Web

Many previous techniques identify trending topics in social media, even topics that are not pre-defined. We present a technique to identify trending rumors, which we define as topics that include disputed factual claims. Putting aside any attempt to ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '11: Proceedings of the 20th international conference on World wide web
March 2011
840 pages
ISBN:9781450306324
DOI:10.1145/1963405
General Chairs:
S. Sadagopan
IIIT-Bangalore, India
,
Krithi Ramamritham
IIT-Bombay, India
,
Arun Kumar
IBM Research, India
,
M. P. Ravindra
Infosys E & R, India
,
Program Chairs:
Elisa Bertino
Purdue University, USA
,
Ravi Kumar
Yahoo! Research, USA
Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 28 March 2011
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
social media analytics
social media credibility
twitter
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1,455
  Total Citations
  View Citations
- 12,675
  Total Downloads
- Downloads (Last 12 months)818
- Downloads (Last 6 weeks)103
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Information credibility on twitter

WWW '11: Proceedings of the 20th international conference on World wide web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Detecting rumors from microblogs with recurrent neural networks

Twitter under crisis: can we trust what we RT?

Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts