Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front. Comput. Sci.    2020, Vol. 14 Issue (5) : 145314    https://doi.org/10.1007/s11704-019-9097-x
RESEARCH ARTICLE
Probabilistic robust regression with adaptive weights–a case study on face recognition
Jin LI, Quan CHEN, Jingwen LENG, Weinan ZHANG, Minyi GUO()
Department of Computer Science, Shanghai Jiao Tong University, Shanghai 200240, China
 Download: PDF(584 KB)  
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Robust regression plays an important role in many machine learning problems. A primal approach relies on the use of Huber loss and an iteratively reweighted l2 method. However, because the Huber loss is not smooth and its corresponding distribution cannot be represented as a Gaussian scale mixture, such an approach is extremely difficult to handle using a probabilistic framework. To address those limitations, this paper proposes two novel losses and the corresponding probability functions. One is called Soft Huber, which is well suited for modeling non-Gaussian noise. Another is Nonconvex Huber, which can help produce much sparser results when imposed as a prior on regression vector. They can represent any lq loss (12q<2) with tuning parameters, which makes the regression modelmore robust. We also show that both distributions have an elegant form, which is a Gaussian scale mixture with a generalized inverse Gaussian mixing density. This enables us to devise an expectation maximization (EM) algorithm for solving the regression model.We can obtain an adaptive weight through EM, which is very useful to remove noise data or irrelevant features in regression problems. We apply our model to the face recognition problem and show that it not only reduces the impact of noise pixels but also removes more irrelevant face images. Our experiments demonstrate the promising results on two datasets.

Keywords robust regression      nonconvex loss      face recognition     
Corresponding Author(s): Minyi GUO   
Issue Date: 20 January 2020
 Cite this article:   
Jin LI,Quan CHEN,Jingwen LENG, et al. Probabilistic robust regression with adaptive weights–a case study on face recognition[J]. Front. Comput. Sci., 2020, 14(5): 145314.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-019-9097-x
https://academic.hep.com.cn/fcs/EN/Y2020/V14/I5/145314
1 R Andersen. Modern Methods for Robust Regression. Sage, 2008
https://doi.org/10.4135/9781412985109
2 I Ben-Gal. Outlier Detection. Data Mining and Knowledge Discovery Handbook. Springer, 2010, 117–130
https://doi.org/10.1007/978-0-387-09823-4_7
3 S M Stigler. Gauss and the invention of least squares. The Annals of Statistics, 1981, 9(3): 465–474
https://doi.org/10.1214/aos/1176345451
4 P J Rousseeuw, M Hubert. Robust statistics for outlier detection. Wiley Interdisciplinary Reviews: DataMining and Knowledge Discovery, 2011, 1(1): 73–79
https://doi.org/10.1002/widm.2
5 P J Huber. Robust regression: asymptotics, conjectures and monte carlo. The Annals of Statistics, 1973, 1(5): 799–821
https://doi.org/10.1214/aos/1176342503
6 P J Huber, E M Ronchetti. Robust Statistics. 2nd ed. New Jersey: John Wiley & Sons, 2009
https://doi.org/10.1002/9780470434697
7 R Hartley, A Zisserman. Multiple View Geometry in Computer Vision. 2nd ed. Cambridge: Cambridge University Press, 2004
https://doi.org/10.1017/CBO9780511811685
8 M Figueiredo. Adaptive sparseness using jeffreys prior. In: Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems. MIT Press, 2002, 697–704
9 A Kabán. On Bayesian classification with laplace priors. Pattern Recognition Letters, 2007, 28(10): 1271–1282
https://doi.org/10.1016/j.patrec.2007.02.010
10 K L Lange, R J A Little, J M G Taylor. Robust statistical modeling using the t distribution. Journal of the American Statistical Association, 1989, 84(408): 881–896
https://doi.org/10.1080/01621459.1989.10478852
11 P Jylänki, J Vanhatalo, A Vehtari. Robust gaussian process regression with a student-t likelihood. Journal of Machine Learning Research, 2011, 12: 3227–3257
12 K Lange, J S Sinsheimer. Normal/independent distributions and their applications in robust regression. Journal of Computational and Graphical Statistics, 1993, 2(2): 175–198
https://doi.org/10.1080/10618600.1993.10474606
13 M Gao, K Wang, L He. Probabilistic model checking and scheduling implementation of an energy router system in energy internet for green cities. IEEE Transactions on Industrial Informatics, 2018, 14(4): 1501–1510
https://doi.org/10.1109/TII.2018.2791537
14 J M Bernardo, A F M Smith. Bayesian Theory. New York: JohnWilley and Sons, 1994
https://doi.org/10.1002/9780470316870
15 L Xu, M I Jordan. On convergence properties of the EM algorithm for gaussian mixtures. Neural Computation, 1996, 8(1): 129–151
https://doi.org/10.1162/neco.1996.8.1.129
16 I Naseem, R Togneri, M Bennamoun. Robust regression for face recognition. Pattern Recognition, 2012, 45(1): 104–118
https://doi.org/10.1016/j.patcog.2011.07.003
17 M Yang, L Zhang, J Yang, D Zhang. Robust sparse coding for face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2011, 625–632
https://doi.org/10.1109/CVPR.2011.5995393
18 H Hu, K Wang, C Lv, J Wu, Z Yang. Semi-supervised metric learningbased anchor graph hashing for large-scale image retrieval. IEEE Transactions on Image Processing, 2019, 28(2): 739–754
https://doi.org/10.1109/TIP.2018.2860898
19 P J Huber. Robust Estimation of a Location Parameter. Breakthroughs in Statistics. Springer, New York, 1992, 492–518
https://doi.org/10.1007/978-1-4612-4380-9_35
20 R Tibshirani. Regression shrinkage and selection via the lasso: a retrospective. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 2011, 73(3): 273–282
https://doi.org/10.1111/j.1467-9868.2011.00771.x
21 C F J Wu. On the convergence properties of the EM algorithm. Annals of Statistics, 1983, 11: 95–103
https://doi.org/10.1214/aos/1176346060
22 t J Wrigh, A Y Yang, A Ganesh, S S Sastry, Y Ma. Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 31(2): 210–227
https://doi.org/10.1109/TPAMI.2008.79
23 A M Martinez. The AR face database. CVC Technical Report, 1998
24 A S Georghiades, P N Belhumeur, D J Kriegman. From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(6): 643–660
https://doi.org/10.1109/34.927464
25 K C Lee, J Ho, D J Kriegman. Acquiring linear subspaces for face recognition under variable lighting. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(5): 684–698
https://doi.org/10.1109/TPAMI.2005.92
26 A Krizhevsky, I Sutskever, G E Hinton. Imagenet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems. 2012, 1097–1105
27 M Yang, L Zhang, J Yang, D Zhang. Regularized robust coding for face recognition. IEEE Transactions on Image Processing, 2013, 22(5): 1753–1766
https://doi.org/10.1109/TIP.2012.2235849
28 D F Andrews, C L Mallows. Scale mixtures of normal distributions. Journal of the Royal Statistical Society: Series B (Methodological), 1974, 36(1): 99–102
https://doi.org/10.1111/j.2517-6161.1974.tb00989.x
[1] Article highlights Download
[1] Yan LI, Shiguang SHAN, Ruiping WANG, Zhen CUI, Xilin CHEN. Fusing magnitude and phase features with multiple face models for robust face recognition[J]. Front. Comput. Sci., 2018, 12(6): 1173-1191.
[2] Xin LIU,Meina KAN,Wanglong WU,Shiguang SHAN,Xilin CHEN. VIPLFaceNet: an open source deep face recognition SDK[J]. Front. Comput. Sci., 2017, 11(2): 208-218.
[3] Qicong WANG,Binbin WANG,Xinjie HAO,Lisheng CHEN,Jingmin CUI,Rongrong JI,Yunqi LEI. Face recognition by decision fusion of two-dimensional linear discriminant analysis and local binary pattern[J]. Front. Comput. Sci., 2016, 10(6): 1118-1129.
[4] Lishan QIAO, Limei ZHANG, Songcan CHEN. Dimensionality reduction with adaptive graph[J]. Front Comput Sci, 2013, 7(5): 745-753.
[5] Pu HUANG, Zhenmin TANG, Caikou CHEN, Xintian CHENG. Nearest-neighbor classifier motivated marginal discriminant projections for face recognition[J]. Front Comput Sci Chin, 2011, 5(4): 419-428.
[6] YANG Jian, YANG Jingyu, ZHANG David. Median Fisher Discriminator: a robust feature extraction method with applications to biometrics[J]. Front. Comput. Sci., 2008, 2(3): 295-305.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed