Multi-label learning is more complicated than single-label learning since the semantics of the instances are usually overlapped and not identical. The effectiveness of many algorithms often fails when the correlations in the feature and label space are not fully exploited. To this end, we propose a novel non-negative matrix factorization (NMF) based modeling and training algorithm that learns from both the adjacencies of the instances and the labels of the training set. In the modeling process, a set of generators are constructed, and the associations among generators, instances, and labels are set up, with which the label prediction is conducted. In the training process, the parameters involved in the process of modeling are determined. Specifically, an NMF based algorithm is proposed to determine the associations between generators and instances, and a non-negative least square optimization algorithm is applied to determine the associations between generators and labels. The proposed algorithm fully takes the advantage of smoothness assumption, so that the labels are properly propagated. The experimentswere carried out on six set of benchmarks. The results demonstrate the effectiveness of the proposed algorithms.
M L Zhang, Z H Zhou. A review on multi-label learning algorithms. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(8): 1819–1837 https://doi.org/10.1109/TKDE.2013.39
2
M L Zhang, Z H Zhou. Milti-label neural networks with applications to functional genomics and text categorization. IEEE Transactions on Knowledge and Data Engineering, 2006, 18(10): 1338–1351 https://doi.org/10.1109/TKDE.2006.162
3
H Lo, J Wang, H Wang, S Lin. Cost sensitive multi-label learning for audio tag annotation and retrieval. IEEE Transactions on Multimedia, 2011, 13(3): 518–529 https://doi.org/10.1109/TMM.2011.2129498
4
C Sanden, J Zhang. Enhancing multi-label music genre classification through ensemble techniques. In: Proceedings of the 34th International ACMSIGIR Conference on Research and Development in Information Retrieval. 2011, 705–714 https://doi.org/10.1145/2009916.2010011
5
L Tang, S Rajan, V Narayanan. Large scale multi-label classification via metalabeler. In: Proceedings of the 19th International Conference on World Wide Web. 2009, 211–220 https://doi.org/10.1145/1526709.1526738
6
S Gopal, Y Yang. Multi-label classification with meta-level features. In: Proceedings of the 33rd International ACM SIGIR Conference on Research & Development in Information Retrieval. 2010, 315–322
7
X Zhu, Z Ghahramani. Learning from labeled and unlabeled data with label propagation. Technical Report, 2002
8
J Read, L Martino, P M Olmos, D Luengo. Scalable multi-output label prediction: from classifier chains to classifier trellises. Pattern Recognition, 2015, 48(6): 2096–2109 https://doi.org/10.1016/j.patcog.2015.01.004
9
G Madjarov, D Gjorgjevikj, S Dzeroski. Two stage architecture for multi-label learning. Pattern Recognition, 2012, 45(3): 1019–1034 https://doi.org/10.1016/j.patcog.2011.08.011
J Lee, D W Kim. Memetic feature selection algorithm for multi-label classification. Information Sciences, 2015, 293: 80–96 https://doi.org/10.1016/j.ins.2014.09.020
12
X Zhu, J Lafferty, R Rosenfeld. Semi-supervised learning with graphs. Carnegie Mellon University, Doctor Thesis, 2005
P Hou, X Geng, M L Zhang. Multi-label manifold learning. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016, 1680–1686
15
N N Gao, S J Huang, S C Chen. Multi-label active learning by model guided distribution matching. Frontiers of Computer Science, 2016, 10(5): 845–855 https://doi.org/10.1007/s11704-016-5421-x
16
X Kong, M K Ng, Z H Zhou. Transductive multi-label learning via label set propagation. IEEE Transactions on Knowledge and Data Engineering, 2013, 25(3): 704–719 https://doi.org/10.1109/TKDE.2011.141
17
S J Huang, Y Yu, Z H Zhou. Multi-label hypothesis reuse. In: Proceedings of the 18th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2012, 525–533 https://doi.org/10.1145/2339530.2339615
18
S J Huang, Z H Zhou. Multi-label learning by exploiting label correlations locally. In: Proceedings of the 26th AAAI Conference on Artificial Intelligence. 2012, 949–955
19
H Lo, S Lin, H Wang. Generalized k-label sets ensemble for multi-label and cost-sensitive classification. IEEE Transactions on Knowledge and Data Engineering, 2014, 26(7): 1679–1691 https://doi.org/10.1109/TKDE.2013.112
20
J Lee, K Kim, N Kim, J H Lee. An approach for multi-label classification by directed acyclic graph with label correlation maximization. Information Sciences, 2016, 351: 101–114 https://doi.org/10.1016/j.ins.2016.02.037
21
P Li, H Li, M Wu. Multi-label ensemble based on variable pairwise constraint projection. Information Sciences, 2013, 222: 269–281 https://doi.org/10.1016/j.ins.2012.07.066
22
B Wang, J Tsotsos. Dynamic label propagation for semi-supervised multi-class multi-label classification. Pattern Recognition, 2016, 52: 75–84 https://doi.org/10.1016/j.patcog.2015.10.006
23
S Wang, J Wang, Z Wang, Q Ji. Enhancing multi-label classification by modeling dependencies among labels. Pattern Recognition, 2014, 47(10): 3405–3413 https://doi.org/10.1016/j.patcog.2014.04.009
F Sun, J Tang, H Li, G Qi, T S Huang. Multi-label image categorization with sparse factor representation. IEEE Transactions on Image Processing, 2014, 23(3): 1028–1037 https://doi.org/10.1109/TIP.2014.2298978
26
Y Zhang, Z H Zhou. Multi-label dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data, 2010, 4(3): 14 https://doi.org/10.1145/1839490.1839495
27
M L Zhang, K Zhang. Multi-label learning by exploiting label dependency. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2010, 999–1007 https://doi.org/10.1145/1835804.1835930
28
M L Zhang, L Wu. Lift: multi-label learning with label specific features. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(1): 107–120 https://doi.org/10.1109/TPAMI.2014.2339815
29
I Triguero, C Vens. Labelling strategies for hierarchical multi-label classification techniques. Pattern Recognition, 2016, 56: 170–183 https://doi.org/10.1016/j.patcog.2016.02.017
30
J R Quevedo, O Luaces, A Bahamonde. Multilabel classifiers with a probabilistic thresholding strategy. Pattern Recognition, 2012, 45(2): 876–883
Y Huang, W Wang, L Wang. Unconstrained multimodal multi-label learning. IEEE Transactions on Multimedia, 2015, 17(11): 1923–1935 https://doi.org/10.1109/TMM.2015.2476658
33
J Xu, V Jagadeesh, B S Manjunath. Multi-label learning with fused multimodal Bi-relational graph. IEEE Transactions on Multimedia, 2014, 16(2): 403–412 https://doi.org/10.1109/TMM.2013.2291218
G Madjarov, D Kocev, D Gjorgjevikj, S Dzeroski. An extensive experimental comparison of methods for multi-label learning. Pattern Recognition, 2012, 45(9): 3084–3104 https://doi.org/10.1016/j.patcog.2012.03.004
36
D D Lee, H S Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 1999, 401(6755): 788–791 https://doi.org/10.1038/44565
37
C L Lawson, R J Hanson. Solving Least Squares Problems. New Jersey: Prentice-Hall, Inc., 1974
38
T Joachims. Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning. 1999, 200–209
39
V N Vapnik. Statistical Learning Theory. New York: Wiley, 1998
40
M Belkin, P Niyogi, V Sindhwani. Manifold regularization: a geometric framework for learning from examples. Journal of Machine Learning Research, 2006, 7: 2399–2434
41
M Xu, R Jin, Z H Zhou. Speedup matrix completion with side information: application to multi-label learning. In: Proceedings of the 27th Annual Conference on Neural Information Processing Systems. 2013, 2301–2309
42
J Read, B Pfahringer, G Holmes, E Frank. Classifier chains for multilabel classification. Machine Learning, 2011, 85(3): 333–359 https://doi.org/10.1007/s10994-011-5256-5
43
J Wang, Y Zhao, X Wu, X S Hua. A transductive multi-label learning approach for video concept detection. Pattern Recognition, 2011, 44(10–11): 2274–2286 https://doi.org/10.1016/j.patcog.2010.07.015
44
J Furnkranz, E Hullermeier, E L Mencia, K Brinker. Multilabel classification via calibrated label ranking. Machine Learning, 2008, 73(2): 133–153 https://doi.org/10.1007/s10994-008-5064-8
45
E L Mencia, S H Park, J Furnkranz. Efficient voting prediction for pairwise multi-label classification. Neurocomputing, 2010, 73(7–9): 1164–1176 https://doi.org/10.1016/j.neucom.2009.11.024