Product Feature Extraction and Rating Distribution Using User Reviews

Soobin Son, Jonghoon Chun

Abstract


We propose a method to analyze the user reviews and ratings of the products in the online shopping mall and automatically extracts the features of the products to determine the characteristics of a product. By judging whether a rating is given by a specific feature of a product, our method distributes the score to each feature. Conventional methods force users to wastes time reading overflowing number of reviews and ratings to decide whether to buy the product or not. Moreover, it is difficult to grasp the merits and demerits of the product, because of the way reviews and ratings are provided. It is structured in a way that it is impossible to decide which rating is given to the which characteristics of the product. Therefore, in this paper, to resolve this problem, we propose a method to automatically extract the feature of the product from the user review and distribute the score to appropriate characteristics of the product by calculating the rating of each feature from the overall rating. proposed method collects product reviews and ratings, conducts morphological analysis, and extracts features and emotional words of the products. In addition, a method for determining the polarity of a sentence in which the feature appears is given a weight value for each feature. results of the experiment and the questionnaires comparing the existing methods show the usefulness of the proposed method. We also validates the results by comparing the analysis conducted by the product review experts.


Full Text:

PDF

References


Ahan, J. K. and Kim, H. W., “Building a Korean Sentiment Dictionary and Applications of Natural Languages Processing,” Korea Intelligent Information Systems Society, 2014.

Hogenboom, A., Bal, D., and Frasincar, F., Exploiting Emoticons in Sentiment Analysis, SAC, 2013.

Hu, M. and Liu, B., “Mining and Summarizing Customer Reviews,” KDD, pp. 168-177, 2004.

Hu, M. H., Sun, A. and Lim, E. P., “Comments-Oriented Blog Summarization by Sentence Extraction,” CIKM, pp. 901-904, 2007.

Internet Price Comparision Site (www.bb.co.kr).

Khabiri, E., Caverlee, J., and Hsu, C. F., “Summarizing User-Contributed Comments,” Association for the Advancement of Artificial Intelligence, 2011.

Kim, J. O., Lee, S. S., and Yong, H. S., “Automatic Opinion Classfication of Korean Text,” Korean Institute of Information Scientists and Engineers, Vol. 38, No. 6, pp. 423-428, 2011.

Kokoma Morphological Analyzer http://kkma.snu.ac.kr/.

Korean Information Science Society: Linguistic Engineering Research Association https://sites.google.com/site/sighclt/.

Lee, K. S., Song, H. J., Son, Y. W., Hwang, M. J., and Park, Y. S., “Cognitive Psychological Approach to Statistics,” Statistical Survey, 2008.

Lee, W. C., Lee, H. A., and Lee, K. J., “Automatic Product Feature Extraction for Efficient Analysis of Product Reviews Using Term Statistics,” Journal of Korean Institute of Information Scientists and Engineers, Vol. 16B, , No. 6, pp. 497-502, 2009.

Lee, Y. J., Ji, J. H., Woo, G., and Cho, H. G., “Analysis and Visualization for Comment Messages of Internet Posts,” Journal of The Korea Contents Association, Vol. 9, No. 7, pp. 45-56, 2009.

Menupan.com, http://www.menupan.com/.

Myung, J. S., Lee, D. J., and Lee, S. G., “A Korean Product Review Analysis System Using a Semi-Automatically Constructed Semantic Dictionary,” Journal of KISS: Software and Applications. Vol. 35, No. 6, pp. 392-403, 2008.

National Statistics Portal, http://kosis.kr/.

Naver Shopping, http://shopping.naver.com/.

Oh, P. H., Hwang, B. Y., “Real-time Spatial Recommendation System based on Sentiment Analysis of Twitter,” The Journal of Society for e-Business Studies, Vol. 21, No. 3, pp. 15-28, 2016.

SeleniumHQ Browser Automation (http://www.seleniumhq.org).

SentiWordNet http://sentiwordnet.isti.cnr.it/.

Sipos, R. and Joachims, T., “Generating Comparative Summaries from Reviews,” CIKM, pp. 1853-1856, 2013.

Socher, R., Perelygin, A., Wu, J. Y., Chuang, J., Manning, C. D., Ng, A. Y., and Potts, C., Recursive Deep Models for Semantic Compositionality Over a Sentiment, Treebank, EMNLP, 2013.

Somprasertsri, G. and Lalitrojwong, P., “Mining Feature-Opinion in Online Customer Reviews for Opinion Summarization,” Journal of Universal Computer Science, Vol. 16, No. 6, pp. 938-955, 2010.

Song, J. S. and Lee, S. W., “Automatic Construction of Positive/Negative Feature-Predicate Dictionary for Polarity Classification of Product Reviews,” Korean Institute of Information Scientists and Engineers, Vol. 38, No. 3, pp. 157-168, 2011.

Song, S. I., Lee, D. J., Lee, S. G., “Identifying Sentiment Polarity of Korean Vocabulary Using PMI,” Korea Computer Integrated Circuits Conference Vol. 37, No. 1, pp. 260-265, 2010.

Wikipedia https://en.wikipedia.org/wiki/Normalization_(statistics).

Yatani, K., Novati, M., Trusty, A., and Truong, K. N., “Review Spotlight: A User Interface for Summarizing User-generated Reviews Using Adjective-Noun Word Pairs,” CHI, pp. 1541-1550, 2011.

Zhai, Z., Liu, B., Xu, H., and Jia, P., “Clustering Product Features for Opinion Mining,” ACM, pp. 347-354, 2011.


Refbacks

  • There are currently no refbacks.