Please wait a minute...
Frontiers of Mathematics in China

ISSN 1673-3452

ISSN 1673-3576(Online)

CN 11-5739/O1

Postal Subscription Code 80-964

2018 Impact Factor: 0.565

Front Math Chin    2013, Vol. 8 Issue (3) : 665-694    https://doi.org/10.1007/s11464-013-0267-0
RESEARCH ARTICLE
Probability density estimation with surrogate data and validation sample
Qihua WANG1, Wenquan CUI2()
1. Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; 2. Department of Statistics and Finance, University of Science and Technology of China, Hefei 230026, China
 Download: PDF(254 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

The probability density estimation problem with surrogate data and validation sample is considered. A regression calibration kernel density estimator is defined to incorporate the information contained in both surrogate variates and validation sample. Also, we define two weighted estimators which have less asymptotic variances but have bigger biases than the regression calibration kernel density estimator. All the proposed estimators are proved to be asymptotically normal. And the asymptotic representations for the mean squared error and mean integrated square error of the proposed estimators are established, respectively. A simulation study is conducted to compare the finite sample behaviors of the proposed estimators.

Keywords Measurement error      asymptotic normality      convergent rate     
Corresponding Author(s): CUI Wenquan,Email:qhwang@amss.ac.cn   
Issue Date: 01 June 2013
 Cite this article:   
Qihua WANG,Wenquan CUI. Probability density estimation with surrogate data and validation sample[J]. Front Math Chin, 2013, 8(3): 665-694.
 URL:  
https://academic.hep.com.cn/fmc/EN/10.1007/s11464-013-0267-0
https://academic.hep.com.cn/fmc/EN/Y2013/V8/I3/665
1 Bowman A. An alternative method of cross-validation for the smoothing of density estimates. Biometrika , 1984, 71: 353-360
doi: 10.1093/biomet/71.2.353
2 Carroll R J, Hall P. Optimal rates of convergence for deconvolving a density. J Amer Statist Assoc , 1988, 83: 1184-1186
doi: 10.1080/01621459.1988.10478718
3 Carroll R J, Wand M P. Semiparametric estimation in logistic measure error models. J Roy Statist Soc, Ser B , 1991, 53: 539-572
4 Chen Y-H. Cox regression in cohort studies with validation sampling. J Roy Statist Soc, Ser B , 2002, 64: 51-62
doi: 10.1111/1467-9868.00324
5 Devroye L, Gy?rfi L. The equivalence of weak, strong and complete convergence in L1 for kernel density estimates. Ann Statist , 1983, 11: 896-904
doi: 10.1214/aos/1176346255
6 Devroye L, Gy?rfi L. Nonparametric Density Estimation. New York: John Wiley & Sons, 1985
7 Duncan G, Hill D. An investigations of the extent and consequences of measurement error in labor-economics survey data. J Labor Economics , 1985, 3: 508-532
doi: 10.1086/298067
8 Fan J. On the optimal rates of convergence for nonparametric deconvolution problems. Ann Statist , 1991, 19: 1257-1272
doi: 10.1214/aos/1176348248
9 Parzen E. On estimation of a probability density function and mode. Ann Statist , 1962, 33: 1065-1076
doi: 10.1214/aoms/1177704472
10 Pepe M S. Inference using surrogate outcome data and a validation sample. Biometrika , 1992, 79: 355-365
doi: 10.1093/biomet/79.2.355
11 Pepe M S, Fleming T R. A general nonparametric method for dealing with errors in missing or surrogate covariate data. J Amer Statist Assoc , 1991, 86: 108-113
doi: 10.1080/01621459.1991.10475009
12 Pepe M S, Reilly M, Fleming T R. Auxiliary outcome data and the mean score method. J Statist Plann Inference , 1994, 42: 137-160
doi: 10.1016/0378-3758(94)90194-5
13 Reilly M, Pepe M S. A mean score method for missing and auxiliary covariate data in regression models. Biometrika , 1995, 82: 299-314
doi: 10.1093/biomet/82.2.299
14 Rosenblatt M. Remarks on some nonparametric estimates of a density function. Ann Math Stat , 1956, 27: 832-837
doi: 10.1214/aoms/1177728190
15 Rosner B, Willett W C, Spiegelman D. Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. Stat Med , 1989, 8: 1075-1093
16 Rudemo M. Empirical choice of histograms and kernel density estimators. Scand J Stat , 1982, 9: 65-78
17 Sepanski J H, Lee L F. Semiparametric Estimation of nonlinear error-in-variables models with validation study. J Nonparametr Stat , 1995, 4: 365-394
doi: 10.1080/10485259508832627
18 Silverman B W. On the estimation of a probability density function by the maximum penalized likelihood method. Ann Statist , 1982, 10: 795-810
doi: 10.1214/aos/1176345872
19 Stefanski L A. Rates of convergence of some estimators in a class of deconvolution problems. Statist Probab Lett , 1990, 9: 229-235
doi: 10.1016/0167-7152(90)90061-B
20 Stefanski L A, Carroll R J. Conditional scores and optimal scores for generalized linear measurement error models. Biometrika , 1987, 74: 703-716
21 Stefanski L A, Carroll R J. Deconvoluting kernel density estimators. Statistics , 1990, 21: 169-184
doi: 10.1080/02331889008802238
22 Wahba G. Optimal convergence properties of variable knot, kernel and orthogonal series methods for density estimation. Ann Statist , 1975, 3: 15-29
doi: 10.1214/aos/1176342997
23 Wang Q H. Estimation of partial linear error-in-variables model with validation data. J Multivariate Anal , 1999, 69: 30-64
doi: 10.1006/jmva.1998.1797
24 Wang Q H. Estimation of linear error-in-covariables models with validation data under random censorship. J Multivariate Anal , 2000, 74: 245-266
doi: 10.1006/jmva.1999.1869
25 Wang Q H, H?rdle W. Empirical likelihood-based dimension reduction inference for linear error-in-responses models with validation study. Sci China Ser A , 2004, 47: 921-939
doi: 10.1360/03ys0148
26 Wang Q H, Rao J N K. Empirical likelihood-based in linear errors-in-covariables models with validation data. 2003, 89: 345-358
27 Wittes J, Lakatos E, Probstfied J. Surrogate endpoints in clinical trails: Cardiovascular diseases. Stat Med , 1989, 8: 415-425
doi: 10.1002/sim.4780080405
[1] Xinjie CHEN, Guohua ZOU, Xinyu ZHANG. Frequentist model averaging for linear mixed-effects models[J]. Front Math Chin, 2013, 8(3): 497-515.
[2] Ying YANG, Fei YE. General relative error criterion and M-estimation[J]. Front Math Chin, 2013, 8(3): 695-715.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed