skip to main content
10.1145/1571941.1572090acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Is spam an issue for opinionated blog post search?

Published: 19 July 2009 Publication History

Abstract

In opinion-finding, the retrieval system is tasked with retrieving not just relevant documents, but those that also express an opinion towards the query target entity. This task has been studied in the context of the blogosphere by groups participating in the 2006-2008 TREC Blog tracks. Spam blogs (splogs) are thought to be a problem on the blogosphere. In this paper, we investigate the extent to which spam has affected the participating groups' retrieval systems over the three years of the TREC Blog track opinion-finding task. Our results show that spam can be an issue, with most systems retrieving some spam for every topic. However, removing spam from the rankings does not markedly change the relative performance of opinion-finding approaches.

References

[1]
P. Kolari, A. Java, and T. Finin. Characterizing the Splogosphere. In Proceedings of 3rd WWE Workshop at WWW'06, Edinburgh, UK, 2006.
[2]
I. Ounis, C. Macdonald, and I. Soboroff. On the TREC Blog Track. In Proceedings of ICWSM-2008, Seattle, USA, 2008.
[3]
I. Ounis, C. Macdonald, I. Soboroff. Overview of TREC-2008 Blog track. In Proceedings of TREC-2008, Gaithersburg, USA, 2009.
[4]
C. Macdonald and I. Ounis. The TREC Blog06 Collection: Creating and Analysing a Blog Test Collection. DCS Technical Report TR-2006-224. Univ. of Glasgow. 2006. http://www.dcs.gla.ac.uk/~craigm/publications/macdonald06creating.pdf

Cited By

View all
  • (2018)Online social networking services and spam detection approaches in opinion mining-a reviewInternational Journal of Web Based Communities10.5555/3302823.330282614:4(353-378)Online publication date: 1-Jan-2018
  • (2012)Text mining and probabilistic language modeling for online review spam detectionACM Transactions on Management Information Systems10.1145/2070710.20707162:4(1-30)Online publication date: 5-Jan-2012
  • (2012)An Opinion Mining Technique For Chinese BlogsFuture Information Technology, Application, and Service10.1007/978-94-007-4516-2_28(281-289)Online publication date: 5-Jun-2012
  • Show More Cited By

Index Terms

  1. Is spam an issue for opinionated blog post search?

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
    July 2009
    896 pages
    ISBN:9781605584836
    DOI:10.1145/1571941

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 July 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. blogs
    2. opinion-finding
    3. spam
    4. splogs

    Qualifiers

    • Poster

    Conference

    SIGIR '09
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Online social networking services and spam detection approaches in opinion mining-a reviewInternational Journal of Web Based Communities10.5555/3302823.330282614:4(353-378)Online publication date: 1-Jan-2018
    • (2012)Text mining and probabilistic language modeling for online review spam detectionACM Transactions on Management Information Systems10.1145/2070710.20707162:4(1-30)Online publication date: 5-Jan-2012
    • (2012)An Opinion Mining Technique For Chinese BlogsFuture Information Technology, Application, and Service10.1007/978-94-007-4516-2_28(281-289)Online publication date: 5-Jun-2012
    • (2011)Efficient and effective spam filtering and re-ranking for large web datasetsInformation Retrieval10.1007/s10791-011-9162-z14:5(441-465)Online publication date: 30-Jan-2011
    • (2011)Multi-facets Quality Assessment of Online Opinionated ExpressionsWeb Information Systems Engineering – WISE 2010 Workshops10.1007/978-3-642-24396-7_17(212-225)Online publication date: 2011
    • (2010)Multi-facets quality assessment of online opinionated expressionsProceedings of the 2010 international conference on Web information systems engineering10.5555/2044492.2044514(212-225)Online publication date: 12-Dec-2010
    • (2010)Blog track research at TRECACM SIGIR Forum10.1145/1842890.184289944:1(58-75)Online publication date: 18-Aug-2010
    • (2010)Toward a Language Modeling Approach for Consumer Review Spam Detection2010 IEEE 7th International Conference on E-Business Engineering10.1109/ICEBE.2010.47(1-8)Online publication date: Nov-2010

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media