Small-Sample Comparison of the Gamma Kernel and the Orthogonal Series Methods of Density Estimation

Authors
Razi University
Abstract
Introduction

Estimation of a probability density function is an important area of nonparametric statistical inference that has received much attention in recent decades. The kernel method is widely used in nonparametric estimation of the probability density function of an absolutely continuous distribution with support on the whole real line. However, for a distribution with support on a subset of the real line, the kernel density estimator with fixed symmetric kernels encounters bias at the boundaries of the support, which is known as the boundary bias issue. This is due to smoothing data near the boundary points by the fixed symmetric kernel that leads to allocating probability density to outside of the distribution’s support (see Silverman, 1986).

There are many applications, such as reliability, insurance and life testing, dealing with non-negative data and estimating the probability density function of distributions with support on the non-negative real line is the object of interest. Using the kernel estimator with fixed symmetric kernels in these cases results in the boundary bias issue at the origin. A number of methods have been proposed to avoid the boundary bias issue at the origin. A simple remedy is to replace symmetric kernels by asymmetric kernels which never assign density to negative values. The Gamma kernels proposed by Chen (2000) are the effective asymmetric kernels to estimate the probability density function of distributions on the non-negative real line.

Orthogonal series estimators form another class of nonparametric probability density estimators, which go back to Cencov (1964). In this approach, as reviewed in Efromovich (2010), the target probability function is expanded in terms of a sequence of orthogonal basis functions. After selecting a suitable sequence of orthogonal basis functions, the observed data are used to estimated the coefficients of the expansion in order to obtain the orthogonal series density estimator. Similar to kernel estimators, under some mild conditions the orthogonal series estimators have appealing large sample properties. Moreover, the boundary issue can be avoided by using orthogonal density estimators with suitable basis functions.

Although small sample properties of asymmetric kernel estimators with the Gamma kernels and orthogonal series estimators are well-studied separately, but to the best of our knowledge, there have been no reports of comparing their performance in estimating the probability density function of distributions on the non-negative real line. In this paper‎, a simulation study is conducted to compare the small-sample performance of the Gamma kernel estimators and orthogonal series estimators for a set of distributions on the positive real line.



Material and methods

Following Malec and Schienle (2014), we consider six parameter settings for the generalized F distribution to obtain probability density functions with different shapes, near-origin behaviors and tail decays (Figure 2). Based on 5000 simulations from any of these density functions with sample sizes , we estimate the target density function using the type I and II Gamma kernel estimators and the orthogonal series estimators with Hermite and Laguerre basis functions and compute the mean integrated squared error (MISE).

The bandwidth parameter in the Gamma kernel estimators and the cutoff and smoothing parameters in the orthogonal series estimators are significantly affect the performance of the estimators. We use optimal bandwidths for the Gamma kernel estimators and optimal cutoff and smoothing parameters of the orthogonal series estimators to avoid variations due to uncertainty of tuning parameters. To obtain the optimal tuning parameters for each target density, we compute and minimize the MISE with respect to the tuning parameters based on additional 5000 simulations from the true density function.

Results and discussion

For each density function, the optimal tuning parameters and the MISE’s of the estimators are reported (Table 2). As expected from the large sample properties, increasing the sample size improves the performance of all estimators. The performances of estimators vary from cases to cases and there no considered estimator is the best in all cases. In all cases except one, the type II Gamma kernel estimator is superior to the type I Gamma kernel estimator, which is in agreement with Chen (2000) suggestion of preferring type II to type I Gamma kernel estimator. However, in one case the type I Gamma kernel estimator is better than all other estimators. In cases where the shape and near-origin behavior of the target density is similar to the Hermit or Laguerre basis functions, the corresponding orthogonal series estimator outperforms all the other competing estimators.

Conclusion

The following conclusions were drawn from this research to choose among the considered estimators.

If the basis functions of the orthogonal series estimator are chosen to have similar shape and near-origin behavior to the target density function, then the corresponding orthogonal series estimator can outperform the Gamma kernel estimators.
If there is no prior knowledge about the shape and near-origin behavior of the target density function and the sample size is relatively large (n=400), then the type II Gamma kernel estimator can outperform the orthogonal series estimators.


Keywords

1. Tapia R. A., Thompson J. R., "Nonparametric Probability Density Estimation", Johns Hopkins University Press, Baltimore (1978).## 2. Silverman B. W., "Density estimation for statistics and data analysis", Chapman & Hall/CRC press, London (1986). ## 3. Izenman A. J., "Review Papers: Recent Developments in Nonparametric Density Estimation", Journal of the American Statistical Association, 86(413) (1991) 205-224. ## 4. Wand M. P., Jones M. C., "Kernel Smoothing", Chapman and Hall/CRC, London (1995). ## 5. Devroye L., Lugosi G., "Combinatorial Methods in Density Estimation", Springer Science & Business Media, New York, (2012). ## 6. Parzen E., "On Estimation of a Probability Density Function and Mode", The Annals of Mathematical Statistics, 33 (3) (1962) 1065-1076. ## 7. Marron J., Ruppert D., "Transformations to Reduce Boundary Bias in Kernel Density Estimation", Journal of the Royal Statistical Society. Series B (Methodological), 56 (4) (1994) 653-671. ## 8. Schuster E. F., "Incorporating support constraints into nonparametric estimators of densities", Communications in Statistics - Theory and Methods, 14(5) (1985) 1123-1136. ## 9. Cline D. B. H., Hart J. D., "Kernel Estimation of Densities with Discontinuities or Discontinuous Derivatives", Statistics, 22(1) (1991) 69-84. ## 10. Gasser T., Muller H. G., Mammitzsch V., "Kernels for Nonparametric Curve Estimation", Journal of the Royal Statistical Society. Series B (Methodological), 47 (2) (1985) 238-252. ## 11. Muller H. G., "Smooth Optimum Kernel Estimators Near Endpoints", Biometrika, 78 (3) (1991) 521-530. ## 12. Muller H. G., "On the boundary kernel method for non-parametric curve estimation near endpoints", Scand. J. Stat., 20 (1993) 313-328. ## 13. Jones M. C., "Simple boundary correction for kernel density estimation", Statistics and Computing, 3(3) (1993) 135-146. ## 14. Wand M. P., Marron J. S., Ruppert D., "Transformations in Density Estimation", Journal of the American Statistical Association, 86 (414) (1991) 343-353. ## 15. Chen S. X., "Beta kernel estimators for density functions", Computational Statistics & Data Analysis, 31 (2) (1999) 131-145. ## 16. Chen S. X., "Probability Density Function Estimation Using Gamma Kernels", Annals of the Institute of Statistical Mathematics, 52 (3) (2000) 471-480. ## 17. Jones M., Foster P., "A Simple Nonnegative Boundary Correction Method for Kernel Density Estimation", Statistica Sinica, 6(4) (1996) 1005-1013. ## 18. Muller H. G., Wang J. L., "Hazard Rate Estimation under Random Censoring with Varying Kernels and Bandwidths", Biometrics, 50(1) (1994) 61-76. ## 19. Malec P., Schienle M., "Nonparametric Kernel Density Estimation Near the Boundary", Computational Statistics & Data Analysis, 72 (2014) 57-76. ## 20. Karunamuni R. J., Alberts T., "On boundary correction in kernel density estimation", Statistical Methodology, 2 (3) (2005) 191-212. ## 21. Gasser T., Muller H. G., "Kernel estimation of regression functions. In Gasser, T., Rosenblatt, M. (Eds.): Smoothing Techniques for Curve Estimation", Lecture Notes in Mathematics, vol. 757. Springer, Heidelberg (1979) 23-68. ## 22. Cencov N. N., "Estimation of an unknown density function from observations", In Dokl. Akad. Nauk, SSSR Vol. 147 (1964) 45-48. ## 23. Wahba G., "Data-Based Optimal Smoothing of Orthogonal Series Density Estimates", The Annals of Statistics, 9 (1) (1981) 146-156. ## 24. Hall P., "Cross-validation and the smoothing of orthogonal series density estimators", Journal of Multivariate Analysis, 21 (2) (1987) 189-206. ## 25. Efromovich S., "Orthogonal series density estimation. Wiley Interdisciplinary Reviews: Computational Statistics", 2 (4) (2010) 467-476. ## 26. Kreyszig E., "Introductory functional analysis with applications", Wiley, New York (1989). 27. Watson G. S., "Density Estimation by Orthogonal Series", The Annals of Mathematical Statistics, 40(4) (1969) 1496-1498. ## 28. Efromovich S., "Nonparametric curve estimation: methods, theory, and applications", Springer Science & Business Media, (2008). ## 29. Walter G. G., "Properties of Hermite Series Estimation of Probability Density", The Annals of Statistics, 5 (6) (1977) 1258-1264. ## 30. Cox C., "The generalized F distribution: An umbrella for parametric survival analysis", Statistics in Medicine, 27 (21) (2008) 4301-4312. ## 31. Scarrott C. J., Hu Y., "evmix 0.2.7: Extreme Value Mixture Modelling, Threshold Estimation and Boundary Corrected Kernel Density Estimation", Availableon CRAN: http://www.math.canterbury.ac.nz/~c.scarrott/evmix (2017). ## 32. Novomestky F., "orthopolynom 1.0-5: Collection of functions for orthogonal and orthonormal polynomials", Available on CRAN: https://CRAN.R-project org/package=orthopolynom (2013). ## 33. Jackson C., "flexsurv: A Platform for Parametric Survival Modeling in R", Journal of Statistical Software, 70 (8) (2016) 1-33. ##