Evaluating and improving the interpretability of item embeddings using item-tag relevance information

doi:10.1007/s11704-019-7427-7

Front. Comput. Sci.

2020, Vol. 14

Issue (3) : 143603 https://doi.org/10.1007/s11704-019-7427-7

RESEARCH ARTICLE

Evaluating and improving the interpretability of item embeddings using item-tag relevance information

Tao LIAN¹, Lin DU², Mingfu ZHAO³, Chaoran CUI⁴, Zhumin CHEN⁵(

), Jun MA⁵

¹. College of Data Science, Taiyuan University of Technology, Jinzhong 030600, China
². Software College, Shandong University, Jinan 250101, China
³. School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing 101408, China
⁴. School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan 250014, China
⁵. School of Computer Science and Technology, Shandong University, Qingdao 266237, China

Download: PDF(1946 KB)
Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks

Abstract

Matrix factorization (MF) methods have superior recommendation performance and are flexible to incorporate other side information, but it is hard for humans to interpret the derived latent factors. Recently, the item-item cooccurrence information is exploited to learn item embeddings and enhance the recommendation performance. However, the item-item co-occurrence information, constructed from the sparse and long-tail distributed user-item interaction matrix, is over-estimated for rare items, which could lead to bias in learned item embeddings. In this paper, we seek to evaluate and improve the interpretability of item embeddings by leveraging a dense item-tag relevance matrix. Specifically, we design two metrics to quantitatively evaluate the interpretability of item embeddings from different viewpoints: interpretability of individual dimensions of item embeddings and semantic coherence of local neighborhoods in the latent space.We also propose a tag-informed item embedding (TIE) model that jointly factorizes the user-item interaction matrix, the item-item co-occurrence matrix and the item-tag relevance matrix with shared item embeddings so that different forms of information can co-operate with each other to learn better item embeddings. Experiments on the MovieLens20M dataset demonstrate that compared with other state-of-the-art MF methods, TIE achieves better top-N recommendations, and the relative improvement is larger when the user-item interaction matrix becomes sparser. By leveraging the itemtag relevance information, individual dimensions of item embeddings are more interpretable and local neighborhoods in the latent space are more semantically coherent; the bias in learned item embeddings are also mitigated to some extent.

Keywords recommender system matrix factorization item embedding item-tag relevance interpretability

Corresponding Author(s): Zhumin CHEN

Issue Date: 10 January 2020

Cite this article:

Tao LIAN,Lin DU,Mingfu ZHAO, et al. Evaluating and improving the interpretability of item embeddings using item-tag relevance information[J]. Front. Comput. Sci., 2020, 14(3): 143603.

URL:

https://academic.hep.com.cn/fcs/EN/10.1007/s11704-019-7427-7
https://academic.hep.com.cn/fcs/EN/Y2020/V14/I3/143603

1	R Salakhutdinov, A Mnih. Probabilistic matrix factorization. In: Proceedings of the 20th International Conference on Neural Information Processing Systems. 2007, 1257–1264
2	Y Hu, Y Koren, C Volinsky. Collaborative filtering for implicit feedback datasets. In: Proceedings of the 2008 IEEE International Conference on Data Mining. 2008, 263–272
3	R Pan, Y Zhou, B Cao, N N Liu, R Lukose, M Scholz, Q Yang. Oneclass collaborative filtering. In: Proceedings of the 2008 IEEE International Conference on Data Mining. 2008, 502–511
4	Y Koren, R Bell, C Volinsky. Matrix factorization techniques for recommender systems. Computer, 2009, 42(8): 30–37
5	T Mikolov, I Sutskever, K Chen, G Corrado, J Dean. Distributed representations of words and phrases and their compositionality. In: Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013, 3111–3119
6	O Levy, Y Goldberg. Neural word embedding as implicit matrix factorization. In: Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014, 2177–2185
7	J Pennington, R Socher, C D Manning. GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014, 1532–1543
8	N Zhou, W X Zhao, X Zhang, J R Wen, S Wang. A general multicontext embedding model for mining human trajectory data. IEEE Transactions on Knowledge and Data Engineering, 2016, 28(8): 1945–1958
9	D Liang, J Altosaar, L Charlin, D M Blei. Factorization meets the item embedding: regularizing matrix factorization with item co-occurrence. In: Proceedings of the 10th ACM Conference on Recommender Systems. 2016, 59–66
10	C Park, D Kim, J Oh, H Yu. Do “also-viewed” products help user rating prediction? In: Proceedings of the 26th International Conference on World Wide Web. 2017, 1113–1122
11	D Cao, L Nie, X He, X Wei, S Zhu, T S Chua. Embedding factorization models for jointly recommending items and user generated lists. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2017, 585–594
12	P D Turney, P Pantel. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research, 2010, 37: 141–188
13	J Vig, S Sen, J Riedl. The tag genome: encoding community knowledge to support novel interaction. ACM Transactions on Interactive Intelligent Systems, 2012, 2(3): 13
14	J Chang, J Boyd-Graber, S Gerrish, C Wang, D M Blei. Reading tea leaves: how humans interpret topic models. In: Proceedings of the 22nd International Conference on Neural Information Processing Systems. 2009, 288–296
15	B Murphy, P P Talukdar, T Mitchell. Learning effective and interpretable semantic models using non-negative sparse embedding. In: Proceedings of the 24th International Conference on Computational Linguistics. 2012, 1933–1950
16	M Faruqui, Y Tsvetkov, D Yogatama, C Dyer, N A Smith. Sparse overcomplete word vector representations. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. 2015, 1491–1500
17	F Sun, J Guo, Y Lan, J Xu, X Cheng. Sparse word embeddings using ℓ 1 regularized online learning. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016, 2915–2921
18	P Cremonesi, Y Koren, R Turrin. Performance of recommender algorithms on top-n recommendation tasks. In: Proceedings of the 4th ACM Conference on Recommender Systems. 2010, 39–46
19	X He, H Zhang, M Y Kan, T S Chua. Fast matrix factorization for online recommendation with implicit feedback. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2016, 549–558
20	D Lian, Y Ge, F Zhang, N J Yuan, X Xie, T Zhou, Y Rui. Scalable content-aware collaborative filtering for location recommendation. IEEE Transactions on Knowledge and Data Engineering, 2018, 30(6): 1122–1135
21	C Anderson. The long tail. Wired Magazine, 2004, 12(10): 170–177
22	S Sen, F M Harper, A LaPitz, J Riedl. The quest for quality tags. In: Proceedings of the 2007 International ACM Conference on Supporting Group Work. 2007, 361–370
23	H F Yu, C J Hsieh, S Si, I Dhillon. Scalable coordinate descent approaches to parallel matrix factorization for recommender systems. In: Proceedings of the 2012 IEEE International Conference on Data Mining. 2012, 765–774
24	M Levy, M Sandler. A semantic space for music derived from social tags. In: Proceedings of the 8th International Conference on Music Information Retrieval. 2007, 411–416
25	R Sinha, K Swearingen. The role of transparency in recommender systems. In: Proceedings of the 2002 Conference on Human Factors in Computing Systems. 2002, 830–831
26	F M Harper, J A Konstan. The movielens datasets: history and context. ACM Transactions on Interactive Intelligent Systems, 2015, 5(4): 19
27	A P Singh, G J Gordon. Relational learning via collective matrix factorization. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2008, 650–658
28	I Pilászy, D Tikk. Recommending new movies: even a few ratings are more valuable than metadata. In: Proceedings of the 3rd ACM Conference on Recommender Systems. 2009, 93–100
29	H Abdollahpouri, R Burke, B Mobasher. Controlling popularity bias in learning-to-rank recommendation. In: Proceedings of the 11th ACM Conference on Recommender Systems. 2017, 42–46
30	C Marlow, M Naaman, D Boyd, M Davis. HT06, tagging paper, taxonomy, flickr, academic article, to read. In: Proceedings of the 17th Conference on Hypertext and Hypermedia. 2006, 31–40
31	M Gupta, R Li, Z Yin, J Han. Survey on social tagging techniques. SIGKDD Explorations Newsletter, 2010, 12(1): 58–72
32	K H L Tso-Sutter, L B Marinho, L Schmidt-Thieme. Tag-aware recommender systems by fusion of collaborative filtering algorithms. In: Proceedings of the 2008 ACM Symposium on Applied Computing. 2008, 1995–1999
33	T Bogers, A van den Bosch. Collaborative and content-based filtering for item recommendation on social bookmarking websites. In: Proceedings of the Workshop on Recommender Systems and the Social Web. 2009, 9–16
34	D Cai, X He, J Han, T S Huang. Graph regularized nonnegative matrix factorization for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8): 1548–1560
35	T C Zhou, H Ma, I King, MR Lyu . TagRec: leveraging tagging wisdom for recommendation. In: Proceedings of the 2009 International Conference on Computational Science and Engineering. 2009, 194–199
36	Y Zhen, W J Li, D Y Yeung. TagiCoFi: tag informed collaborative filtering. In: Proceedings of the 3rd ACM Conference on Recommender Systems. 2009, 69–76
37	L Wu, E Chen, Q Liu, L Xu, T Bao, L Zhang. Leveraging tagging for neighborhood-aware probabilistic matrix factorization. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management. 2012, 1854–1858
38	S Rendle. Factorization machines with libFM. ACM Transactions on Intelligent Systems and Technology, 2012, 3(3): 57
39	T Chen, W Zhang, Q Lu, K Chen, Z Zheng, Y Yu. SVDFeature: a toolkit for feature-based collaborative filtering. Journal of Machine Learning Research, 2012, 13(1): 3619–3622
40	Z Gantner, L Drumond, C Freudenthaler, S Rendle, L Schmidt-Thieme. Learning attribute-to-feature mappings for cold-start recommendations. In: Proceedings of the 2010 IEEE International Conference on Data Mining. 2010, 176–185
41	D Cohen, M Aharon, Y Koren, O Somekh, R Nissim. Expediting exploration by attribute-to-feature mapping for cold-start recommendations. In: Proceedings of the 11th ACM Conference on Recommender Systems. 2017, 184–192
42	D Lian, Y Ge, F Zhang, N J Yuan, X Xie, T Zhou, Y Rui. Content-aware collaborative filtering for location recommendation based on human mobility data. In: Proceedings of the 2015 IEEE International Conference on Data Mining. 2015, 261–270

[1]

Article highlights

Download

[1]	Yiteng PAN, Fazhi HE, Haiping YU. A correlative denoising autoencoder to model social influence for top-N recommender system[J]. Front. Comput. Sci., 2020, 14(3): 143301-.
[2]	Guijuan ZHANG, Yang LIU, Xiaoning JIN. A survey of autoencoder-based recommender systems[J]. Front. Comput. Sci., 2020, 14(2): 430-450.
[3]	Ming HE, Hao GUO, Guangyi LV, Le WU, Yong GE, Enhong CHEN, Haiping MA. Leveraging proficiency and preference for online Karaoke recommendation[J]. Front. Comput. Sci., 2020, 14(2): 273-290.
[4]	Liang SUN, Hongwei GE, Wenjing KANG. Non-negative matrix factorization based modeling and training algorithm for multi-label learning[J]. Front. Comput. Sci., 2019, 13(6): 1243-1254.
[5]	Dakun LIU,Xiaoyang TAN. Max-margin non-negative matrix factorization with flexible spatial constraints based on factor analysis[J]. Front. Comput. Sci., 2016, 10(2): 302-316.
[6]	Richong ZHANG,Han BAO,Hailong SUN,Yanghao WANG,Xudong LIU. Recommender systems based on ranking performance optimization[J]. Front. Comput. Sci., 2016, 10(2): 270-280.
[7]	Suhrid BALAKRISHNAN, Sumit CHOPRA. Two of a kind or the ratings game? Adaptive pairwise preferences and latent factor models[J]. Front Comput Sci, 2012, 6(2): 197-208.
[8]	Jiliang TANG, Xufei WANG, Huiji GAO, Xia HU, Huan LIU. Enriching short text representation in microblog for clustering[J]. Front Comput Sci, 2012, 6(1): 88-101.

Viewed

Full text

Abstract

Cited

Shared

Discussed