skip to main content
10.1145/1593254.1593264acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicecConference Proceedingsconference-collections
research-article

Extracting customer knowledge from online consumer reviews: a collaborative-filtering-based opinion sentence identification approach

Published:12 August 2009Publication History

ABSTRACT

Due to the popularity of online retail stores, consumers are not only shopping and comparing consumer products on the Web but also providing their consumer reviews on the Internet platform. The Web has become the largest repository of consumer reviews. Consumer reviews are beneficial to consumers, merchants, and manufacturers. Consumers may read the comments of other consumers and decide whether the product is good in the specific product features that they are interested in. For merchants or product manufacturers, consumer reviews help them understand general responses of customers on their products for product or marketing campaign improvement. In addition, consumer reviews can enable merchants better understand specific preferences of individual customers and facilitates effective marketing decisions. However, the large volume of consumer reviews makes it impossible for any individual consumer, merchant, or manufacturer to extract important knowledge efficiently. In this study, we concentrate on opinion sentence identification of focused sentiment analysis and propose a collaborative-filtering-based opinion sentence identification (CF-OSI) technique. The proposed CF-OSI technique considers opinion sentence identification as the sentence retrieval problem. In addition, a collaborative-filtering-based query expansion approach is incorporated into the CF-OSI technique to address possible effectiveness degradation caused by short user queries (i.e., limited number of query terms in query queries). Experiments have been conducted to empirically evaluate the effectiveness of our proposed technique. Our evaluation results show that the performance of our proposed CF-OSI technique is promising.

References

  1. Agrawal, R. and Srikant, R. 1994. Fast algorithms for mining association rules in large databases. In Proceedings of 20th International Conference on Very Large Data Bases, Santiago, Chile, 487--499. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Agarwal, R., Sinha, A., and Tanniru, M. 1996. Cognitive fit in requirements modeling: A study of object and process methodologies. Journal of Management Information Systems 13, 2 (1996), 137--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Anderberg, M. R. 1973. Cluster analysis for applications. Academic Press, New York, NY.Google ScholarGoogle Scholar
  4. Attar, R. and Fraenkel, A. S. 1977. Local feedback in full-text retrieval systems. Journal of the ACM 24, 3 (Jul. 1977), 397--417. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Balabanovic, M. and Shoham, Y. 1997. Fab: Content-based, collaborative recommendation. Communications of the ACM 40, 3 (1997), 66--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Bhargava, H. K., Choudhary, V., and Krishnan, R. 2000. Pricing and product design: Intermediary strategies in an electronic market. International Journal of Electronic Commerce 5, 1 (Fall 2000), 37--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Buckley, C., Salton, G., and Allan, J. 1994. The effect of adding relevance information in a relevance feedback environment. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Dublin, Ireland). SIGIR'94. ACM Press, New York, NY, 292--300. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Dave, K., Lawrence, S., and Pennock, D. M. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th International Conference on World Wide Web (Budapest, Hungary). WWW'03. ACM Press, New York, NY, 519--528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D. S., and Yates, A. 2005. Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence 165, 1 (Jun. 2005), 91--134. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hawking, D. 2006. Web search engines: Part 2. IEEE Computer 39, 8 (Aug. 2006), 88--90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Herlocker, J. L., Konstan, J. A., Borchers, A., and Riedl, J. 1999. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Berkeley, CA). SIGIR'99. ACM Press, New York, NY, 230--237. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Hu, M. and Liu, B. 2004a. Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Seattle, WA). ACM Press, New York, NY, 168--177. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hu, M. and Liu, B. 2004b. Mining opinion features in customer reviews. In Proceedings of the 19th National Conference on Artificial Intelligence. (San Jose, CA). AAAI'04. AAAI Press, Menlo Park, CA, 755--760. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kobayashi, N., Iida, R., Inui, K., and Matsumotto, Y. 2005. Opinion extraction using a learning-based anaphora resolution technique. In Proceedings of the Second International Joint Conference on Natural Language Processing (Jeju Island, Korea). IJCNLP'05. Springer, Berlin/Heidelberg, German, 173--178.Google ScholarGoogle Scholar
  15. Kobayashi, N., Inui, K., and Matsumotto, Y. 2004. Collecting evaluative expressions for opinion extraction. In Proceedings of the First International Joint Conference on Natural Language Processing (Sanya City, Hainan Island, China). IJCNLP'04. Springer, Berlin/Heidelberg, German, 596--605. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Liu, B., Hu, M., and Cheng, J. 2005. Opinion observer: Analyzing and comparing opinions on the Web. In Proceedings of the 14th International Conference on World Wide Web (Chiba, Japan). WWW'05. ACM Press, New York, NY, 342--351. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Menczer, F., Street, W. N., Monge, A. E. 2002. Adaptive assistants for customized e-shopping. IEEE Intelligent Systems 17, 6 (Nov./Dec. 2002), 12--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of 2002 Conference on Empirical Methods in Natural Language Processing (Philadelphia, PA). EMNLP 2002. ACL, East Stroudsburg, PA, 79--86. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Popescu, A. and Etzioni, O. 2005. Extracting product features and opinions from reviews. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (Vancouver, B.C., Canada). HLT/EMNLP 2005. ACL, East Stroudsburg, PA, 339--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Rocchio, J. 1971. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing, G. Salton, Ed. Prentice-Hall, Englewood Cliffs, NJ, 313--323.Google ScholarGoogle Scholar
  21. Salton, G., Wong, A., and Yang, C. S. 1975. A vector space model for automatic indexing. Communications of the ACM 18, 11 (1975), 613--620. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Sarwar, B. M., Karypis, G., Konstan, J. A., and Riedl, J. 2000. Analysis of recommendation algorithms for E-commerce. In Proceedings of the 2nd ACM Conference on Electronic Commerce (Minneapolis, MN). EC-00. ACM Press, New York, NY, 158--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Sebastiani, F. 2002. Machine learning in automated text categorization. ACM Computing Surveys 34, 1 (2002), 1--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Srikant, R. and Agrawal, R. 1995. Mining generalized association rules. In Proceedings of the 1995 International Conference on Very Large Data Bases (Zurich, Switzerland). VLDB'95. Morgan Kaufmann, San Francisco, CA, 407--419. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Turney, P. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Conference on Association for Computational Linguistics (Philadelphia, PA). ACL-02. ACL, East Stroudsburg, PA, 417--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Wong, R. and Yang, C. C. 2005. Collaborative infomediary for financial news. In Proceedings of the Fourth Workshop on e-Business (Las Vegas, NV) WeB'05.Google ScholarGoogle Scholar
  27. Wei, C., Yang, C. S., and Huang, C. N. 2006. Turning online product reviews to customer knowledge: A semantic-based sentiment classification approach. In Proceedings of 10th Pacific Asia Conference on Information Systems (Kuala Lumpur, Malaysia) PACIS'06.Google ScholarGoogle Scholar
  28. Wei, C., Chen, Y. M., Yang, C. S., Yang, C. C. forthcoming. Understanding what concerns consumers: A semantic approach to product feature extraction from consumer reviews. Journal of Information Systems and E-Business Management (forthcoming).Google ScholarGoogle Scholar
  29. Xu, J. and Croft, W. B. 2000. Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems 18, 1 (Jan. 2000), 79--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yang, C. C. and Wong, R. 2006. Measuring success factors of E-commerce infomediary. In Proceedings of the Pacific Asia Conference on Information Systems (Kuala Lumpur, Malaysia) PACIS'06.Google ScholarGoogle Scholar
  31. Yu, C. T., Luk, W. S., and Cheung, T. Y. 1976. A statistical model for relevance feedback in information retrieval. Journal of ACM 23, 2 (Apr. 1976), 273--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Yuan, S. T. 2003. A personalized and integrative comparison-shopping engine and its applications. Decision Support Systems 34, 2 (2003), 139--156. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Extracting customer knowledge from online consumer reviews: a collaborative-filtering-based opinion sentence identification approach

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICEC '09: Proceedings of the 11th International Conference on Electronic Commerce
          August 2009
          407 pages
          ISBN:9781605585864
          DOI:10.1145/1593254

          Copyright © 2009 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 August 2009

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate150of244submissions,61%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader