ABSTRACT
Aspect-oriented opinion mining detects the reviewers' sentiment orientation (e.g. positive, negative or neutral) towards different product-features. Domain customization is a big challenge for opinion mining due to the accuracy loss across domains. In this paper, we show our experiences and lessons learned in the domain customization for the aspect-oriented opinion analysis system OpinionIt. We present a customization method for sentiment classification with multi-level latent sentiment clues. We first construct Latent Semantic Association model to capture latent association among product-features from the unlabeled corpus. Meanwhile, we present an unsupervised method to effectively extract various domain-specific sentiment clues from the unlabeled corpus. In the customization, we tune the sentiment classifier on the labeled source domain data by incorporating the multi-level latent sentiment clues (e.g. latent association among product-features, domain-specific and generic sentiment clues). Experimental results show that the proposed method significantly reduces the accuracy loss of sentiment classification without any labeled target domain data.
- D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3(5):993--1022, 2003. Google ScholarDigital Library
- J. Blitzer, M. Dredze, and F. Pereira. Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. In Proceedings of ACL'07, pages 440--447, 2007.Google Scholar
- K. W. Church and P. Hanks. Word association norms, mutual information and lexicography. Computational Linguistics, 16(1):22--29, 1990. Google ScholarDigital Library
- H. Guo, H. Zhu, Z. Guo, X. Zhang, and Z. Su. Product feature categorization with multi-level latent semantic association. In Proceedings of CIKM09, 2009. Google ScholarDigital Library
- H. Guo, H. Zhu, Z. Guo, X. Zhang, and Z. Su. Opinionit: A text mining system for cross-lingual opinion analysis. In Proceedings of CIKM10, 2010. Google ScholarDigital Library
- T. Li, V. Sindhwani, C. Ding, and Y. Zhang. Knowledge transformation for cross-domain sentiment classification. In Proceedings of SIGIR'09, 2009. Google ScholarDigital Library
- S. J. Pan, X. Ni, J. Sun, Q. Yang, and Z. Chen. Cross-domain sentiment classification via spectral feature alignment. In Proceedings of WWW2010, 2010. Google ScholarDigital Library
- P. Pang and L. Lee. A sentiment education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of ACL04, 2004. Google ScholarDigital Library
- P. Turney. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of ACL'02, 2002. Google ScholarDigital Library
- T. Zhang, F. Damerau, and D. E. Johnson. Text chunking based on a generalization of Winnow. Journal of Machine Learning Research, 2:615--637, 2002. Google ScholarDigital Library
Index Terms
- Domain customization for aspect-oriented opinion analysis with multi-level latent sentiment clues
Recommendations
Exploring weakly supervised latent sentiment explanations for aspect-level review analysis
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementIn sentiment analysis, aspect-level review analysis has been an important task because it can catalogue, aggregate, or summarize various opinions according to a product's properties. In this paper, we explore a new concept for aspect-level review ...
Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementSentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Meta-level sentiment models for big social data analysis
People react to events, topics and entities by expressing their personal opinions and emotions. These reactions can correspond to a wide range of intensities, from very mild to strong. An adequate processing and understanding of these expressions has ...
Comments