skip to main content
10.1145/2939672.2939837acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Public Access

From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach

Authors Info & Claims
Published:13 August 2016Publication History

ABSTRACT

In this era of information explosion, conflicts are often encountered when information is provided by multiple sources. Traditional truth discovery task aims to identify the truth the most trustworthy information, from conflicting sources in different scenarios. In this kind of tasks, truth is regarded as a fixed value or a set of fixed values. However, in a number of real-world cases, objective truth existence cannot be ensured and we can only identify single or multiple reliable facts from opinions. Different from traditional truth discovery task, we address this uncertainty and introduce the concept of trustworthy opinion of an entity, treat it as a random variable, and use its distribution to describe consistency or controversy, which is particularly difficult for data which can be numerically measured, i.e. quantitative information. In this study, we focus on the quantitative opinion, propose an uncertainty-aware approach called Kernel Density Estimation from Multiple Sources (KDEm) to estimate its probability distribution, and summarize trustworthy information based on this distribution. Experiments indicate that KDEm not only has outstanding performance on the classical numeric truth discovery task, but also shows good performance on multi-modality detection and anomaly detection in the uncertain-opinion setting.

References

  1. A. Berlinet and C. Thomas-Agnan. Reproducing kernel Hilbert spaces in probability and statistics. Springer Science & Business Media, 2011.Google ScholarGoogle Scholar
  2. D. P. Bertsekas. Nonlinear programming. Athena Scientific, 1999.Google ScholarGoogle Scholar
  3. X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: the role of source dependence. PVLDB, 2(1):550--561, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. X. L. Dong, L. Berti-Equille, and D. Srivastava. Data fusion: resolving conflicts from multiple sources. In WAIM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In WSDM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Hinneburg and H.-H. Gabriel. Denclue 2.0: Fast clustering based on kernel density estimation. In Advances in Intelligent Data Analysis VII, pages 70--80. Springer, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Kim and C. D. Scott. Robust kernel density estimation. JMLR, 13(1):2529--2565, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L.-W. Ku, Y.-T. Liang, and H.-H. Chen. Opinion extraction, summarization and tracking in news and blog corpora. In AAAI spring symposium: Computational approaches to analyzing weblogs, volume 100107, 2006.Google ScholarGoogle Scholar
  9. Q. Li, Y. Li, J. Gao, L. Su, B. Zhao, M. Demirbas, W. Fan, and J. Han. A confidence-aware approach for truth discovery on long-tail data. PVLDB, 8(4):425--436, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Q. Li, Y. Li, J. Gao, B. Zhao, W. Fan, and J. Han. Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation. In SIGMOD, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. X. Li, W. Meng, and C. Yu. T-verifier: Verifying truthfulness of fact statements. In ICDE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery. ACM SIGKDD Explorations Newsletter, 17(2):1--16, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. W. Ouyang, L. Kaplan, P. Martin, A. Toniolo, M. Srivastava, and T. J. Norman. Debiasing crowdsourced quantitative characteristics in local businesses and services. In IPSN, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Parzen. On estimation of a probability density function and mode. The annals of mathematical statistics, pages 1065--1076, 1962.Google ScholarGoogle Scholar
  15. J. Pasternack and R. Dan. Making better informed trust decisions with generalized fact-finding. In IJCAI, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Pasternack and D. Roth. Knowing what to believe (when you already know something). In COLING, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Pasternack and D. Roth. Latent credibility analysis. In WWW, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. J. Qi, C. C. Aggarwal, J. Han, and T. Huang. Mining collective intelligence in diverse groups. In WWW, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. V. G. V. Vydiswaran, C. X. Zhai, and D. Roth. Content-driven trust propagation framework. In SIGKDD, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Wang, L. Kaplan, H. Le, and T. Abdelzaher. On truth discovery in social sensing: A maximum likelihood estimation approach. In IPSN, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In SIGKDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis without aspect keyword supervision. In SIGKDD, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. X. Yin, J. Han, and P. S. Yu. Truth discovery with multiple conflicting information providers on the web. Knowledge and Data Engineering, IEEE Transactions on, 20(6):796--808, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. Zhao and J. Han. A probabilistic model for estimating real-valued truth from conflicting sources. QDB Workshop, 2012.Google ScholarGoogle Scholar
  25. B. Zhao, B. I. P. Rubinstein, J. Gemmell, and J. Han. A bayesian approach to discovering truth from conflicting sources for data integration. PVLDB, 5(6):550--561, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. S. Zhi, B. Zhao, W. Tong, J. Gao, D. Yu, H. Ji, and J. Han. Modeling truth existence in truth discovery. In SIGKDD, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. D. Zhou, J. C. Platt, S. Basu, and Y. Mao. Learning from the wisdom of crowds by minimax entropy. In NIPS, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
        August 2016
        2176 pages
        ISBN:9781450342322
        DOI:10.1145/2939672

        Copyright © 2016 ACM

        © 2016 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 13 August 2016

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        KDD '16 Paper Acceptance Rate66of1,115submissions,6%Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader