|
|
Towards kernelizing the classifier for hyperbolic data |
Meimei YANG1,2, Qiao LIU1,2, Xinkai SUN1,2, Na SHI1,2, Hui XUE1,2() |
1. School of Computer Science and Engineering, Southeast University, Nanjing 210096, China 2. MOE Key Laboratory of Computer Science and Information Integration (Southeast University), Nanjing 210096, China |
|
|
Abstract Data hierarchy, as a hidden property of data structure, exists in a wide range of machine learning applications. A common practice to classify such hierarchical data is first to encode the data in the Euclidean space, and then train a Euclidean classifier. However, such a paradigm leads to a performance drop due to distortion of data embedding in the Euclidean space. To relieve this issue, hyperbolic geometry is investigated as an alternative space to encode the hierarchical data for its higher ability to capture the hierarchical structures. Those methods cannot explore the full potential of the hyperbolic geometry, in the sense that such methods define the hyperbolic operations in the tangent plane, causing the distortion of data embeddings. In this paper, we develop two novel kernel formulations in the hyperbolic space, with one being positive definite (PD) and another one being indefinite, to solve the classification tasks in hyperbolic space. The PD one is defined via mapping the hyperbolic data to the Drury-Arveson (DA) space, which is a special reproducing kernel Hilbert space (RKHS). To further increase the discrimination of the classifier, an indefinite kernel is further defined in the Kreĭn spaces. Specifically, we design a 2-layer nested indefinite kernel which first maps hyperbolic data into the DA spaces, followed by a mapping from the DA spaces to the Kreĭn spaces. Extensive experiments on real-world datasets demonstrate the superiority of the proposed kernels.
|
Keywords
data hierarchy
hyperbolic geometry
drury-arveson space
kreĭn space
|
Corresponding Author(s):
Hui XUE
|
About author: Changjian Wang and Zhiying Yang contributed equally to this work. |
Just Accepted Date: 26 October 2022
Issue Date: 27 February 2023
|
|
1 |
J, Gong Z, Teng Q, Teng H, Zhang L, Du S, Chen Z A, Bhuiyan J, Li M, Liu H Ma . Hierarchical graph transformer-based deep learning model for large-scale multi-label text classification. IEEE Access, 2020, 8: 30885–30896
|
2 |
Q, Wang Z, Mao B, Wang L Guo . Knowledge graph embedding: a survey of approaches and applications. IEEE Transactions on Knowledge and Data Engineering, 2017, 29( 12): 2724–2743
|
3 |
X, Du Y Xia . Natural images enhancement using structure extraction and retinex. In: Proceedings of the 20th International Conference on Advanced Concepts for Intelligent Vision Systems. 2020, 408−420
|
4 |
S, Kim C, Song J, Jang J Paik . Edge-aware image filtering using a structure-guided CNN. IET Image Processing, 2020, 14( 3): 472–479
|
5 |
J, Long X, Feng X, Zhu J, Zhang G Gou . Efficient superpixel-guided interactive image segmentation based on graph theory. Symmetry, 2018, 10( 5): 169
|
6 |
H, Wang F, Zhang M, Zhang J, Leskovec M, Zhao W, Li Z Wang . Knowledge-aware graph neural networks with label smoothness regularization for recommender systems. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 968−977
|
7 |
R, Ying R, He K, Chen P, Eksombatchai W L, Hamilton J Leskovec . Graph convolutional neural networks for web-scale recommender systems. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2018, 974−983
|
8 |
S, Lee S, Park M, Kahng S G Lee . PathRank: a novel node ranking measure on a heterogeneous graph for recommender systems. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management. 2012, 1637−1641
|
9 |
R, Ma P, Fang T, Drummond M Harandi . Adaptive poincaré point to set distance for few-shot classification. In: Proceedings of the 36th AAAI Conference on Artificial Intelligence. 2022, 1926−1934
|
10 |
J, Sun Y, Xie H, Zhang C Faloutsos . Less is more: compact matrix decomposition for large sparse graphs. In: Proceedings of the 7th SIAM International Conference on Data Mining. 2007, 366−377
|
11 |
B, Perozzi R, Al-Rfou S Skiena . DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014, 701−710
|
12 |
E, Hajiramezanali A, Hasanzadeh N, Duffield K, Narayanan M, Zhou X Qian . Variational graph recurrent neural networks. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019, 960
|
13 |
N, Linial E, London Y Rabinovich . The geometry of graphs and some of its algorithmic applications. Combinatorica, 1995, 15( 2): 215–245
|
14 |
D, Krioukov F, Papadopoulos M, Kitsak A, Vahdat M Boguñá . Hyperbolic geometry of complex networks. Physical Review E, 2010, 82( 3): 036106
|
15 |
M, Nickel D Kiela . Poincaré embeddings for learning hierarchical representations. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017, 6341−6350
|
16 |
G, Alanis-Lobato P, Mier M A Andrade-Navarro . Efficient embedding of complex networks to hyperbolic space via their Laplacian. Scientific Reports, 2016, 6: 30108
|
17 |
B P, Chamberlain J, Clough M P Deisenroth . Neural embeddings of graphs in hyperbolic space. 2017, arXiv preprint arXiv: 1705.10359
|
18 |
O E, Ganea G, Bécigneul T Hofmann . Hyperbolic entailment cones for learning hierarchical embeddings. In: Proceedings of the 35th International Conference on Machine Learning. 2018, 1632−1641
|
19 |
M, Nickel D Kiela . Learning continuous hierarchies in the Lorentz model of hyperbolic geometry. In: Proceedings of the 35th International Conference on Machine Learning. 2018, 3779−3788
|
20 |
Sa C, De A, Gu C, Ré F Sala . Representation tradeoffs for hyperbolic embeddings. In: Proceedings of Machine Learning Research, 2018, 80: 4460−4469
|
21 |
R, Suzuki R, Takahama S Onoda . Hyperbolic disk embeddings for directed acyclic graphs. In: Proceedings of International Conference on Machine Learning. 2019, 6066−6075
|
22 |
I, Balažević C, Allen T Hospedales . Multi-relational Poincaré graph embeddings. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019, 401
|
23 |
R, Sonthalia A C Gilbert . Tree! I am no tree! I am a low dimensional hyperbolic embedding. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 72
|
24 |
M, Weber M, Zaheer A S, Rawat A, Menon S Kumar . Robust large-margin learning in hyperbolic space. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. 2020, 1499
|
25 |
H, Cho B, DeMeo J, Peng B Berger . Large-margin classification in hyperbolic space. In: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics. 2019, 1832−1840
|
26 |
P, Fang M, Harandi L Petersson . Kernel methods in hyperbolic spaces. In: Proceedings of 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 2021, 10665−10674
|
27 |
R Rochberg . Complex hyperbolic geometry and Hilbert spaces with complete pick kernels. Journal of Functional Analysis, 2019, 276( 5): 1622–1679
|
28 |
S Cucerzan . Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). 2007, 708−716
|
29 |
B, Rozemberczki R, Davies R, Sarkar C Sutton . GEMSEC: graph embedding with self clustering. In: Proceedings of 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2019, 65−72
|
30 |
B, Zhao P, Sen L Getoor . Entity and relationship labeling in affiliation networks. In: Proceedings of ICML Workshop on Statistical Network Analysis. 2006
|
31 |
O, Shchur M, Mumme A, Bojchevski S Günnemann . Pitfalls of graph neural network evaluation. 2018, arXiv preprint arXiv: 1811.05868
|
32 |
A, Bojchevski S Günnemann . Deep Gaussian embedding of graphs: unsupervised inductive learning via ranking. In: Proceedings of the 6th International Conference on Learning Representations. 2018
|
33 |
J R Parker . Notes on complex hyperbolic geometry. Preprint, 2003
|
34 |
J G Ratcliffe . Foundations of Hyperbolic Manifolds. New York: Springer, 1994
|
35 |
W M Goldman . Complex Hyperbolic Geometry. Oxford: Oxford University Press, 1999
|
36 |
J, Kim C D Scott . L2 kernel classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32( 10): 1822–1831
|
37 |
B, Bordelon A, Canatar C Pehlevan . Spectrum dependent learning curves in kernel regression and wide neural networks. In: Proceedings of the 37th International Conference on Machine Learning. 2020, 96
|
38 |
Z, Kang L, Wen W, Chen Z Xu . Low-rank kernel learning for graph-based clustering. Knowledge-Based Systems, 2019, 163: 510–517
|
39 |
S W, Ober C E, Rasmussen der Wilk M van . The promises and pitfalls of deep kernel learning. In: Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence. 2021, 1206−1216
|
40 |
P, Fang J, Zhou S K, Roy P, Ji L, Petersson M Harandi . Attention in attention networks for person retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44( 9): 4626–4641
|
41 |
C S, Ong X, Mary S, Canu A J Smola . Learning with non-positive kernels. In: Proceedings of the 21st International Conference on Machine Learning. 2004
|
42 |
A A Ungar . The hyperbolic Pythagorean theorem in the Poincaré disc model of hyperbolic geometry. The American Mathematical Monthly, 1999, 106( 8): 759–763
|
43 |
O, Shalit O Shalit . Operator theory and function theory in Drury–Arveson space and its quotients. In: Alpay D, ed. Operator Theory. Basel: Springer, 2014, 1−50
|
44 |
N, Arcozzi R, Rochberg E, Sawyer B D Wick . Distance functions for reproducing kernel Hilbert spaces. Contemp., 2011, 547: 25−53
|
45 |
A A Ungar . From Pythagoras to Einstein: the hyperbolic Pythagorean theorem. Foundations of Physics, 1998, 28( 8): 1283–1321
|
46 |
G S, Birman A A Ungar . The hyperbolic derivative in the Poincaré ball model of hyperbolic geometry. Journal of Mathematical Analysis and Applications, 2001, 254( 1): 321–333
|
47 |
T Dray . The Geometry of Special Relativity. Boca Raton: CRC Press, 2012
|
48 |
G, Loosli S, Canu C S Ong . Learning SVM in Kreĭn spaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38( 6): 1204–1216
|
49 |
H M, Xu H, Xue X H, Chen Y Y Wang . Solving indefinite kernel support vector machine with difference of convex functions programming. In: Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2017, 2782−2788
|
50 |
D, Oglic T Gärtner . Learning in reproducing kernel Krein spaces. In: Proceedings of the International Conference on Machine Learning. 2018, 3856−3864
|
51 |
J, McAuley C, Targett Q, Shi den Hengel A van . Image-based recommendations on styles and substitutes. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2015, 43−52
|
52 |
A K, McCallum K, Nigam J, Rennie K Seymore . Automating the construction of internet portals with machine learning. Information Retrieval, 2000, 3( 2): 127–163
|
53 |
A A Deshmukh . Kernel approximation. Stats 608, 2015, 1–3
|
54 |
J Platt . Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in Large Margin Classifiers, 1999, 10( 3): 61–74
|
55 |
B, Schölkopf A J Smola . Learning with Kernels: support vector machines, regularization, optimization, and beyond. MIT Press, 2002
|
56 |
C C, Chang C J Lin . LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011, 2( 3): 27
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|