An Opinion Document Clustering Technique for Product Characterization

Jae-Young Chang


Opinion Mining is one of the application domains of text mining which extracting opinions from documents, and much researches are currently underway. Most of related researches focused on the sentiment classification which classifies the documents into positive/negative opinions. However, there is a little interest in extracting the features characterizing the individual product. In this paper, we propose the technique classifying the opinion documents according to the product features, and selecting the those features characterizing each product. In the proposed method, we utilize the document clustering technique and develope a new algorithm for evaluating the similarity between documents. In addition, through experiments, we prove the usefulness of proposed method.

Full Text:



Liu, B., Hu, M., and Cheng, J., “Opinion observer : analyzing and comparing opinions on the Web,” Proceedings of the 14th international conference on WWW, pp. 10-14, 2005.

Scaffdi, C., Bierhoff, K., Chang, E., Felker, M., Ng, H., and Jin, C., “Red Opal : Product-Feature Scoring from Reviews,” Proceedings of the 8th ACM conference on Electronic commerce, pp. 11-15, 2007.

Xiaowen Ding, and Bing Lui, “The Utility of Linguistic Rules in Opinion Mining,” SIGIR 2007, pp. 811-812, 2007.

Courses, E. and Surveys, T., “Using SentiWordNet for multilingual sentiment analysis,” IEEE 24th International Conference on Data Engineering Workshop, ICDEW 2008, 2008.

Popescu, A. O., “Extracting product features and opinions from reviews,” Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 339-396, 2005.

Liu, J., Cao, Y., Lin, C., Huang, Y., and Zhou, M., “Low-Quality Product Review Detection in Opinion Summarization,” Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 334-342, 2007.

Pak, A. and Paroubek, P., “Twitter as a Corpus for Sentiment Analysis and Opinion Mining,” Proceedings of The International Conference on Language Resources and Evaluation, pp. 1320-1326, 2010.

Tan, P., Steinbach, M., and Kumar, V., Introduction to Data Mining, Addision- Wesley, 2006.

Zhai, Z., Liu, B., Xu, H., and Jia, P., “Clustering Product Features for Opinion Mining,” Proceedings of the fourth ACM international conference on Web search and data mining, pp. 347-354, 2011.

Ahmad, T., “Clustering Technique for Feature Segregation in Opinion Analysis,” International Journal of Computer Applications, Vol. 76, No. 17, pp. 43-49, 2013.

Hu, M. and Liu, B., “Mining opinion features in customer reviews,” Proceedings of the 19th national conference on Artificial intelligence, pp. 755-760.

Liu, B., Web Data Mining : Exploring hyperlinks, contents, and usage data, Springer, 2006.

Mo-The movie ontology,

Unified Medical Language System, http://

Rho, J.-H., Kim, H., and Chang, J.-Y., Improving Hypertext Classification Systems through WordNet-based Feature Abstraction, The Journal of Society for e-Business Studies, Vol. 18, No. 2, 2013.


  • There are currently no refbacks.