ABSTRACT
Due to the popularity of online retail stores, consumers are not only shopping and comparing consumer products on the Web but also providing their consumer reviews on the Internet platform. The Web has become the largest repository of consumer reviews. Consumer reviews are beneficial to consumers, merchants, and manufacturers. Consumers may read the comments of other consumers and decide whether the product is good in the specific product features that they are interested in. For merchants or product manufacturers, consumer reviews help them understand general responses of customers on their products for product or marketing campaign improvement. In addition, consumer reviews can enable merchants better understand specific preferences of individual customers and facilitates effective marketing decisions. However, the large volume of consumer reviews makes it impossible for any individual consumer, merchant, or manufacturer to extract important knowledge efficiently. In this study, we concentrate on opinion sentence identification of focused sentiment analysis and propose a collaborative-filtering-based opinion sentence identification (CF-OSI) technique. The proposed CF-OSI technique considers opinion sentence identification as the sentence retrieval problem. In addition, a collaborative-filtering-based query expansion approach is incorporated into the CF-OSI technique to address possible effectiveness degradation caused by short user queries (i.e., limited number of query terms in query queries). Experiments have been conducted to empirically evaluate the effectiveness of our proposed technique. Our evaluation results show that the performance of our proposed CF-OSI technique is promising.
- Agrawal, R. and Srikant, R. 1994. Fast algorithms for mining association rules in large databases. In Proceedings of 20th International Conference on Very Large Data Bases, Santiago, Chile, 487--499. Google ScholarDigital Library
- Agarwal, R., Sinha, A., and Tanniru, M. 1996. Cognitive fit in requirements modeling: A study of object and process methodologies. Journal of Management Information Systems 13, 2 (1996), 137--162. Google ScholarDigital Library
- Anderberg, M. R. 1973. Cluster analysis for applications. Academic Press, New York, NY.Google Scholar
- Attar, R. and Fraenkel, A. S. 1977. Local feedback in full-text retrieval systems. Journal of the ACM 24, 3 (Jul. 1977), 397--417. Google ScholarDigital Library
- Balabanovic, M. and Shoham, Y. 1997. Fab: Content-based, collaborative recommendation. Communications of the ACM 40, 3 (1997), 66--72. Google ScholarDigital Library
- Bhargava, H. K., Choudhary, V., and Krishnan, R. 2000. Pricing and product design: Intermediary strategies in an electronic market. International Journal of Electronic Commerce 5, 1 (Fall 2000), 37--56. Google ScholarDigital Library
- Buckley, C., Salton, G., and Allan, J. 1994. The effect of adding relevance information in a relevance feedback environment. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Dublin, Ireland). SIGIR'94. ACM Press, New York, NY, 292--300. Google ScholarDigital Library
- Dave, K., Lawrence, S., and Pennock, D. M. 2003. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th International Conference on World Wide Web (Budapest, Hungary). WWW'03. ACM Press, New York, NY, 519--528. Google ScholarDigital Library
- Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D. S., and Yates, A. 2005. Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence 165, 1 (Jun. 2005), 91--134. Google ScholarDigital Library
- Hawking, D. 2006. Web search engines: Part 2. IEEE Computer 39, 8 (Aug. 2006), 88--90. Google ScholarDigital Library
- Herlocker, J. L., Konstan, J. A., Borchers, A., and Riedl, J. 1999. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Berkeley, CA). SIGIR'99. ACM Press, New York, NY, 230--237. Google ScholarDigital Library
- Hu, M. and Liu, B. 2004a. Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Seattle, WA). ACM Press, New York, NY, 168--177. Google ScholarDigital Library
- Hu, M. and Liu, B. 2004b. Mining opinion features in customer reviews. In Proceedings of the 19th National Conference on Artificial Intelligence. (San Jose, CA). AAAI'04. AAAI Press, Menlo Park, CA, 755--760. Google ScholarDigital Library
- Kobayashi, N., Iida, R., Inui, K., and Matsumotto, Y. 2005. Opinion extraction using a learning-based anaphora resolution technique. In Proceedings of the Second International Joint Conference on Natural Language Processing (Jeju Island, Korea). IJCNLP'05. Springer, Berlin/Heidelberg, German, 173--178.Google Scholar
- Kobayashi, N., Inui, K., and Matsumotto, Y. 2004. Collecting evaluative expressions for opinion extraction. In Proceedings of the First International Joint Conference on Natural Language Processing (Sanya City, Hainan Island, China). IJCNLP'04. Springer, Berlin/Heidelberg, German, 596--605. Google ScholarDigital Library
- Liu, B., Hu, M., and Cheng, J. 2005. Opinion observer: Analyzing and comparing opinions on the Web. In Proceedings of the 14th International Conference on World Wide Web (Chiba, Japan). WWW'05. ACM Press, New York, NY, 342--351. Google ScholarDigital Library
- Menczer, F., Street, W. N., Monge, A. E. 2002. Adaptive assistants for customized e-shopping. IEEE Intelligent Systems 17, 6 (Nov./Dec. 2002), 12--19. Google ScholarDigital Library
- Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of 2002 Conference on Empirical Methods in Natural Language Processing (Philadelphia, PA). EMNLP 2002. ACL, East Stroudsburg, PA, 79--86. Google ScholarDigital Library
- Popescu, A. and Etzioni, O. 2005. Extracting product features and opinions from reviews. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (Vancouver, B.C., Canada). HLT/EMNLP 2005. ACL, East Stroudsburg, PA, 339--346. Google ScholarDigital Library
- Rocchio, J. 1971. Relevance feedback in information retrieval. In The SMART Retrieval System: Experiments in Automatic Document Processing, G. Salton, Ed. Prentice-Hall, Englewood Cliffs, NJ, 313--323.Google Scholar
- Salton, G., Wong, A., and Yang, C. S. 1975. A vector space model for automatic indexing. Communications of the ACM 18, 11 (1975), 613--620. Google ScholarDigital Library
- Sarwar, B. M., Karypis, G., Konstan, J. A., and Riedl, J. 2000. Analysis of recommendation algorithms for E-commerce. In Proceedings of the 2nd ACM Conference on Electronic Commerce (Minneapolis, MN). EC-00. ACM Press, New York, NY, 158--167. Google ScholarDigital Library
- Sebastiani, F. 2002. Machine learning in automated text categorization. ACM Computing Surveys 34, 1 (2002), 1--47. Google ScholarDigital Library
- Srikant, R. and Agrawal, R. 1995. Mining generalized association rules. In Proceedings of the 1995 International Conference on Very Large Data Bases (Zurich, Switzerland). VLDB'95. Morgan Kaufmann, San Francisco, CA, 407--419. Google ScholarDigital Library
- Turney, P. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Conference on Association for Computational Linguistics (Philadelphia, PA). ACL-02. ACL, East Stroudsburg, PA, 417--424. Google ScholarDigital Library
- Wong, R. and Yang, C. C. 2005. Collaborative infomediary for financial news. In Proceedings of the Fourth Workshop on e-Business (Las Vegas, NV) WeB'05.Google Scholar
- Wei, C., Yang, C. S., and Huang, C. N. 2006. Turning online product reviews to customer knowledge: A semantic-based sentiment classification approach. In Proceedings of 10th Pacific Asia Conference on Information Systems (Kuala Lumpur, Malaysia) PACIS'06.Google Scholar
- Wei, C., Chen, Y. M., Yang, C. S., Yang, C. C. forthcoming. Understanding what concerns consumers: A semantic approach to product feature extraction from consumer reviews. Journal of Information Systems and E-Business Management (forthcoming).Google Scholar
- Xu, J. and Croft, W. B. 2000. Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems 18, 1 (Jan. 2000), 79--112. Google ScholarDigital Library
- Yang, C. C. and Wong, R. 2006. Measuring success factors of E-commerce infomediary. In Proceedings of the Pacific Asia Conference on Information Systems (Kuala Lumpur, Malaysia) PACIS'06.Google Scholar
- Yu, C. T., Luk, W. S., and Cheung, T. Y. 1976. A statistical model for relevance feedback in information retrieval. Journal of ACM 23, 2 (Apr. 1976), 273--286. Google ScholarDigital Library
- Yuan, S. T. 2003. A personalized and integrative comparison-shopping engine and its applications. Decision Support Systems 34, 2 (2003), 139--156. Google ScholarDigital Library
Index Terms
Extracting customer knowledge from online consumer reviews: a collaborative-filtering-based opinion sentence identification approach
Recommendations
Product Comparison Networks for Competitive Analysis of Online Word-of-Mouth
Enabled by Web 2.0 technologies social media provide an unparalleled platform for consumers to share their product experiences and opinions---through word-of-mouth (WOM) or consumer reviews. It has become increasingly important to understand how WOM ...
Deriving the Pricing Power of Product Features by Mining Consumer Reviews
Increasingly, user-generated product reviews serve as a valuable source of information for customers making product choices online. The existing literature typically incorporates the impact of product reviews on sales based on numeric variables ...
Show me the money!: deriving the pricing power of product features by mining consumer reviews
KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data miningThe increasing pervasiveness of the Internet has dramatically changed the way that consumers shop for goods. Consumer-generated product reviews have become a valuable source of information for customers, who read the reviews and decide whether to buy ...
Comments