skip to main content
10.1145/2110363.2110390acmconferencesArticle/Chapter ViewAbstractPublication PagesihiConference Proceedingsconference-collections
research-article

Sentiment lexicons for health-related opinion mining

Authors Info & Claims
Published:28 January 2012Publication History

ABSTRACT

Opinion mining consists in extracting from a text opinions expressed by its author and their polarity. Lexical resources, such as polarized lexicons, are needed for this task. Opinion mining in the medical domain has not been well explored, partly because little credence is given to patients and their opinions (although more and more of them are using social media). We are interested in opinion mining of user-generated content on drugs/medication. We present in this paper the creation of our lexical resources and their adaptation to the medical domain. We first describe the creation of a general lexicon, containing opinion words from the general domain and their polarity. Then we present the creation of a medical opinion lexicon, based on a corpus of drug reviews. We show that some words have a different polarity in the general domain and in the medical one. Some words considered generally as neutral are opinionated in medical texts. We finally evaluate the lexicons and show with a simple algorithm that using our general lexicon gives better results than other well-known ones on our corpus and that adding the domain lexicon improves them as well.

References

  1. S. Baccianella, A. Esuli, and F. Sebastiani. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the 7th Conference on Language Resources and Evaluation, pages 2200--2204, 2010.Google ScholarGoogle Scholar
  2. K. Denecke and W. Nejdl. How valuable is medical social media data? content analysis of the medical web. Journal of Information Sciences, 179(12):1870--1880, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. Du and S. Tan. Building domain-oriented sentiment lexicon by improved information bottleneck. In Proceedings of the International Conference on Information and Knowledge Management, pages 1749--1752, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Du, S. Tan, X. Cheng, and X. Yun. Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon. In Proceedings of the third ACM International Conference on Web Search and Data Mining, pages 111--119, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. L. Goeuriot, J.-C. Na, W. Y. M. Kyaing, S. Foo, C. Khoo, Y.-L. Theng, and Y.-K. Chang. Textual and informational characteristics of health-related social media content: A study of drug review forums. In Proceedings of the Asia-Pacific Conference on Library and Information Education and Practice: Issues, Challenges and Opportunities, pages 548--557, 2011.Google ScholarGoogle Scholar
  6. L. Goeuriot, J.-C. Na, W. Y. M. Kyaing, C. Khoo, Y.-L. Theng, Y.-K. Chang, and S. Foo. Textual and informational characteristics of drug-related content on three types of websites: Drug reviews forum, discussion board and medical portal. International Journal of Organizational and Collective Intelligence on "Social Media in E-Health: Emergent Issues in Ethics, Trust and Privacy", 2011. In press.Google ScholarGoogle Scholar
  7. W. Himmel, U. Reincke, and H. W. Michelmann. Using text mining to classify lay requests to a medical expert forum and to prepare semiautomatic answers. Journal of Medical Internet Research, 11(3):e25, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  8. A. Jaloba. The club no one wants to join: Online behaviour on a breast cancer discussion forum. First Monday {Online}, 14(7), 2009.Google ScholarGoogle Scholar
  9. H. Kanayama and T. Nasukawa. Fully automatic lexicon expansion for domain-oriented sentiment analysis. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 355--363, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. X. Ma, G. Chen, and J. Xiao. Analysis on an online health social network. In Proceedings of the 1st ACM International Health Informatics Symposium, pages 297--306, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. A. Miller. Wordnet: A lexical database for english. Communications of the ACM, 38(11):39--41, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Névéol and Z. Lu. Automatic integration of drug indications from multiple health resources. In Proceedings of the 1st ACM International Health Informatics Symposium, pages 666--673, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Sarasohn-Kahn. The wisdom of patients: Health care meets online social media. Technical report, California Healthcare Foundation, 2008.Google ScholarGoogle Scholar
  14. M. C. Schraefel, R. W. White, P. André, and D. Tan. Investigating web search strategies and forum use to support diet and weight loss. In Proceedings of the 27th international conference extended abstracts on Human factors in computing systems, pages 3829--3834, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. T. Thet, J.-C. Na, and C. Khoo. Aspect-based sentiment analysis of movie reviews on discussion boards. Journal of Information Science, 36(6):823--848, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. L. Velikovitch, S. Blair-Goldenshon, and R. McDonald. The viability of web-derived polarity lexicons. In Proceedings of Human Language Technologies Conference, pages 777--785, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of Human Language Technologies Conference/Conference on Empirical Methods in Natural Language Processing, pages 347--354, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. L. Xia, A. L. Gentile, J. Munro, and J. Iria. Improving patient opinion mining through multi-step classification. Lecture Notes in Artificial Intelligence, 5729:70--76, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Sentiment lexicons for health-related opinion mining

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        IHI '12: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
        January 2012
        914 pages
        ISBN:9781450307819
        DOI:10.1145/2110363

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 28 January 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader