Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2015, Vol. 9 Issue (2) : 210-223    https://doi.org/10.1007/s11704-014-3492-0
RESEARCH ARTICLE
Product-oriented review summarization and scoring
Rong ZHANG1(),Wenzhe YU1(),Chaofeng SHA2(),Xiaofeng HE1,*(),Aoying ZHOU1()
1. Institute of Data Science and Engineering, Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai 200062, China
2. School of Computer Science, Fudan University, Shanghai 200433, China
 Download: PDF(610 KB)  
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Currently, there are many online review web sites where consumers can freely write comments about different kinds of products and services. These comments are quite useful for other potential consumers. However, the number of online comments is often large and the number continues to grow as more and more consumers contribute. In addition, one comment may mention more than one product and contain opinions about different products, mentioning something good and something bad. However, they share only a single overall score. Therefore, it is not easy to know the quality of an individual product from these comments.

This paper presents a novel approach to generate review summaries including scores and description snippets with respect to each individual product. From the large number of comments, we first extract the context (snippet) that includes a description of the products and choose those snippets that express consumer opinions on them. We then propose several methods to predict the rating (from 1 to 5 stars) of the snippets. Finally, we derive a generic framework for generating summaries from the snippets. We design a new snippet selection algorithm to ensure that the returned results preserve the opinion-aspect statistical properties and attribute-aspect coverage based on a standard seat allocation algorithm. Through experiments we demonstrate empirically that our methods are effective. We also quantitatively evaluate each step of our approach.

Keywords online transaction      diversification      review summarization      review scoring     
Corresponding Author(s): Xiaofeng HE   
Issue Date: 07 April 2015
 Cite this article:   
Rong ZHANG,Wenzhe YU,Chaofeng SHA, et al. Product-oriented review summarization and scoring[J]. Front. Comput. Sci., 2015, 9(2): 210-223.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-014-3492-0
https://academic.hep.com.cn/fcs/EN/Y2015/V9/I2/210
1 Liu J, Cao Y, Lin C Y, Huang Y, Zhou M. Low-quality product review detection in opinion summarization. In: Proceedings of the Joint Meeting of the Conference on Empirical Methods on Natural Language Processing and the Conference on Natural Language Learing. 2007, 334-342
2 Lappas T, Crovella M, Terzi E. Selecting a characteristic set of reviews. In: Proceedings of the 18th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2012, 832-840
https://doi.org/10.1145/2339530.2339663
3 Dang V, Croft W B. Diversity by proportionality: an election-based approach to search result diversification. In: Proceedings of the 35th ACM SIGIR Conference on Research and Development in Information Retrieval. 2012, 65-74
https://doi.org/10.1145/2348283.2348296
4 Tsaparas P, Ntoulas A, Terzi E. Selecting a comprehensive set of reviews. In: Proceedings of the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2011, 168-176
https://doi.org/10.1145/2020408.2020440
5 Yu W, Zhang R, He X. Selecting a diversified set of reviews. Lecture Notes in Computer Science, 2013, 7808, 721-733
https://doi.org/10.1007/978-3-642-37401-2_70
6 Sinha P, Mehrotra S, Jain R. Summarization of personal photologs using multidimensional content and context. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval. 2011, 4
https://doi.org/10.1145/1991996.1992000
7 Lu Y, Tsaparas P, Ntoulas A, Polanyi L. Exploiting social context for review quality prediction. In: Proceedings of the 19th international conference on World Wide Web. 2010, 691-700
https://doi.org/10.1145/1772690.1772761
8 O’Mahony M P, Smyth B. Learning to recommend helpful hotel reviews. In: Proceedings of the 3rd ACM Conference on Recommender Systems. 2009, 305-308
https://doi.org/10.1145/1639714.1639774
9 Kim S M, Pantel P, Chklovski T, Pennacchiotti M. Automatically assessing review helpfulness. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. 2006, 423-430
https://doi.org/10.3115/1610075.1610135
10 Liu Y, Huang X, An A, Yu X. Modeling and predicting the helpfulness of online reviews. In: Proceedings of the 8th IEEE International Conference on Data Mining. 2008, 443-452
https://doi.org/10.1109/ICDM.2008.94
11 Zhang R, Sha C F, Zhou M Q, Zhou A Y. Exploiting shopping and reviewing behavior to re-score online evaluations. In: Proceedings of the 21st International Conference Companion on World Wide Web. 2012, 649-650
https://doi.org/10.1145/2187980.2188171
12 Lappas T, Gunopulos D. Efficient confident search in large review corpora. Lecture Notes in Computer Science, 2010, 6322: 195-210
https://doi.org/10.1007/978-3-642-15883-4_13
13 Ganesan K, Zhai C, Viegas E. Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions. In: Proceedings of the 21st International Conference Companion on World Wide Web. 2012, 869-878
https://doi.org/10.1145/2187836.2187954
14 Hu M, Liu B. Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004, 168-177
https://doi.org/10.1145/1014052.1014073
15 Zhuang L, Jing F, Zhu X Y. Movie review mining and summarization. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management. 2006, 43-50
https://doi.org/10.1145/1183614.1183625
16 Meng X, Wang H. Mining user reviews: from specification to summarization. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers. 2009, 177-180
https://doi.org/10.3115/1667583.1667637
17 Moghaddam S, Ester M. Opinion digger: an unsupervised opinion miner from unstructured product reviews. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 2010, 1825-1828
https://doi.org/10.1145/1871437.1871739
18 Shimada K, Tadano R, Endo T. Multi-aspects review summarization with objective information. Procedia-Social and Behavioral Sciences, 2011, 27: 140-149
https://doi.org/10.1016/j.sbspro.2011.10.592
19 Zhan J, Loh H T, Liu Y. Gather customer concerns from online product reviews- a text summarization approach. Expert Systems with Applications, 2009, 36(2): 2107-2115
https://doi.org/10.1016/j.eswa.2007.12.039
20 Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL- 02 Conference on Empirical Methods in Natural Language Processing. 2002, 79-86
21 Turney P D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. 2002, 417-424
22 Pang B, Lee L. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2008, 2(1-2): 1-135
https://doi.org/10.1561/1500000011
23 Kruengkrai C, Uchimoto K, Kazama J, Wang Y, Torisawa K, Isahara H. An error-driven word-character hybrid model for joint Chinese word segmentation and POS tagging. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 2009, 513-521
https://doi.org/10.3115/1687878.1687951
24 McCallum A, Nigam K. A comparison of event models for naive bayes text classification. AAAI-98 Workshop on Learning for Text Categorization, 1998, 752: 41-48
25 Berger A L, Pietra S A D, Pietra V J D. A maximum entropy approach to natural language processing. Journal of Computational Linguistics, 1996, 22(1): 39-71
26 Vapnik V. The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995
https://doi.org/10.1007/978-1-4757-2440-0
27 Qiu G, Liu B, Bu J, Chen C. Opinion word expansion and target extraction through double propagation. Computational Linguistics, 2011, 37(1): 9-27
https://doi.org/10.1162/coli_a_00034
28 Zhai Z, Liu B, Zhang L, Xu H, Jia P. Identifying evaluative sentences in online discussions. In: Proceedings of the 25th AAAI Conference on Artificial Intelligence. 2011, 933-938
29 Petrov S, McDonald R. Overview of the 2012 shared task on parsing the web. Notes of the First Workshop on Syntactic Analysis of Non-Canonical Language, 2012, 59
30 Koo T, Carreras X, Collins M. Simple semi-supervised dependency parsing. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics. 2008, 595-603
31 Zhao Y, Karypis G. Criterion Functions for Document Clustering: Experiments and Analysis. Technical Report. 2001
32 Lapata M. Automatic evaluation of information ordering: Kendall’stau. Computational Linguistics, 2006, 32(4): 471-484
https://doi.org/10.1162/coli.2006.32.4.471
33 Gunawardana A, Shani G. A survey of accuracy evaluation metrics of recommendation tasks. The Journal of Machine Learning Research, 2009, 10: 2935-2962
34 Herlocker J L, Konstan J A, Terveen L G, Riedl J T. Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 2004, 22(1): 5-53
https://doi.org/10.1145/963770.963772
35 Chapelle O, Metlzer D, Zhang Y, Grinspan P. Expected reciprocal rank for graded relevance. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management. 2009, 621-630
https://doi.org/10.1145/1645953.1646033
36 Manning C D, Raghavan P, Schütze H. Introduction to Information Retrieval. New York: Cambridge University Press, 2008
https://doi.org/10.1017/CBO9780511809071
[1] Supplementary Material-Highlights in 3-page ppt
Download
[1] Meifan ZHANG, Hongzhi WANG, Jianzhong LI, Hong GAO. Diversification on big data in query processing[J]. Front. Comput. Sci., 2020, 14(4): 144607-.
[2] Wanyu CHEN, Fei CAI, Honghui CHEN, Maarten DE RIJKE. Personalized query suggestion diversification in information retrieval[J]. Front. Comput. Sci., 2020, 14(3): 143602-.
[3] Chaofeng SHA,Keqiang WANG,Dell ZHANG,Xiaoling WANG,Aoying ZHOU. Optimizing top-k retrieval: submodularity analysis and search strategies[J]. Front. Comput. Sci., 2016, 10(3): 477-487.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed