EVALUATION OF THE FLEXIBILITY OF NADARAYA-WATSON KERNEL AND PENALIZED SPLINE ESTIMATORS IN BIVARIATE RESPONSE NONPARAMETRIC REGRESSION MODELS

  • Cinta Rizki Oktarina Department of Statistics, Faculty of Mathematics and Natural Sciences, Universitas Bengkulu, Indonesia https://orcid.org/0009-0007-3186-7220
  • Sigit Nugroho Department of Statistics, Faculty of Mathematics and Natural Sciences, Universitas Bengkulu, Indonesia https://orcid.org/0000-0003-4535-2045
  • Idhia Sriliana Department of Statistics, Faculty of Mathematics and Natural Sciences, Universitas Bengkulu, Indonesia https://orcid.org/0000-0003-3926-4129
Keywords: Bandwidth, Bivariate nonparametric regression, Kernel Nadaraya-Watson, Knot, Penalized Spline

Abstract

Nonparametric regression is a flexible approach used when the functional relationship between predictors and responses is unknown. In the context of multiple responses, bivariate nonparametric regression allows modeling two correlated response variables, such as stunting and wasting prevalence, which remain critical issues in public health. This study aims to evaluate the flexibility and performance of two nonparametric estimators, the Nadaraya-Watson Kernel and the Penalized Spline, for modeling bivariate response data. The research was conducted in two stages: (1) simulation using variations in sample sizes (50, 100, 150, 200) and error variances based on exponential and trigonometric functions, and (2) application to real data on stunting and wasting prevalence in Indonesia (2024) obtained from Statistics Indonesia (BPS), with socioeconomic and health-related predictors. Model performance was assessed using RMSE, MSE, and R-squared, complemented by MANOVA, orthogonal polynomial contrasts, and Tukey’s post-hoc test to examine significant differences across scenarios. Simulation results indicate that the Nadaraya-Watson Kernel estimator consistently outperformed the Penalized Spline, providing lower RMSE and MSE values and greater stability, particularly for larger sample sizes and smaller error variances. Orthogonal polynomial analysis revealed a quadratic relationship between sample size and RMSE, with occasional cubic patterns, while error variance consistently exhibited a quadratic trend. In the applied study, the Nadaraya-Watson Kernel with a Gaussian kernel achieved high accuracy, with an MSE of 0.00086 and an R-squared value indicating a strong model fit. However, this high R-squared value may reflect potential overfitting, which warrants further validation through cross-validation. These findings demonstrate that the Nadaraya-Watson Kernel offers an effective approach for bivariate nonparametric regression, supporting data-driven policy decisions in nutrition and public health.

Downloads

Download data is not yet available.

References

D. C. Montgomery, E. A. Peck, and G. VINNING, LINEAR REGRESSION ANALYSIS, 6th ed. John Wiley & Sons, Inc, 2021. doi: https://doi.org/10.2307/1268395

I. Sriliana, I. N. Budiantara, and V. Ratnasari, “THE MIXED ESTIMATOR OF TRUNCATED SPLINE AND LOCAL LINEAR IN MULTIVARIABLE NONPARAMETRIC REGRESSION,” AIP Conf. Proc., vol. 2554, no. 1, 2023. doi: https://doi.org/10.1063/5.0104167

R. L. Eubank, NONPARAMETRIC REGRESSION AND SPLINE SMOOTHING, 2nd ed. New York: Marcel Dekker, 1999. doi: https://doi.org/10.1201/9781482273144

I. N. Budiantara, REGRESI NONPARAMETRIK SPLINE TRUNCATED. Surabaya: ITS Press, 2019.

P. P. Gabrela, J. D. T. Purnomo, and I. N. Budiantara, “THE ESTIMATION OF MIXED TRUNCATED SPLINE AND FOURIER SERIES ESTIMATOR IN BI-RESPONSE NONPARAMETRIC REGRESSION,” AIP Conf. Proc., vol. 2903, no. 1, 2023. doi: https://doi.org/10.1063/5.0177224.

I. Sriliana, I. N. Budiantara, and V. Ratnasari, “A TRUNCATED SPLINE AND LOCAL LINEAR MIXED ESTIMATOR IN NONPARAMETRIC REGRESSION FOR LONGITUDINAL DATA AND ITS APPLICATION,” Symmetry (Basel)., vol. 14, no. 12, 2022. doi: https://doi.org/10.3390/sym14122687.

R. Hidayat, I. N. Budiantara, B. W. Otok, and V. Ratnasari, “THE REGRESSION CURVE ESTIMATION BY USING MIXED SMOOTHING SPLINE AND KERNEL (MSS-K) MODEL,” Commun. Stat. - Theory Methods, vol. 50, no. 17, pp. 3942–3953, 2021. doi: https://doi.org/10.1080/03610926.2019.1710201

R. Pahlepi, I. Sriliana, W. Agwil, and C. R. Oktarina, “BIRESPONSE SPLINE TRUNCATED NONPARAMETRIC REGRESSION MODELING FOR LONGITUDINAL DATA ON MONTHLY STOCK PRICES OF THREE PRIVATE BANKS IN INDONESIA,” Barekeng, vol. 19, no. 4, pp. 2467–2480, 2025. doi: https://doi.org/10.30598/barekengvol19iss4pp2467-2480

G. Wahba, SPLINE MODELS FOR OBSERVATIONAL DATA. PENNSYLVANIA: SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1990. doi: https://doi.org/10.1137/1.9781611970128

Y. R. Yue, D. Simpson, F. Lindgren, and H. Rue, “BAYESIAN ADAPTIVE SMOOTHING SPLINES USING STOCHASTIC DIFFERENTIAL EQUATIONS,” Bayesian Anal., vol. 9, no. 2, pp. 397–424, 2014. doi: https://doi.org/10.1214/13-BA866

K. Xie, F. Meng, and D. Zhang, “REGIONAL FORECASTING OF PM2.5 CONCENTRATIONS: A NOVEL MODEL BASED ON THE EMPIRICAL ORTHOGONAL FUNCTION ANALYSIS AND NADARAYA–WATSON KERNEL REGRESSION ESTIMATOR,” Environ. Model. Softw., vol. 170, no. January, p. 105857, 2023. doi: https://doi.org/10.1016/j.envsoft.2023.105857

A. R. Fadilah, A. Fitrianto, and I. M. Sumertajaya, “OUTLIER IDENTIFICATION ON PENALIZED SPLINE REGRESSION MODELING FOR POVERTY GAP INDEX IN JAVA,” BAREKENG J. Ilmu Mat. dan Terap., vol. 16, no. 4, pp. 1231–1240, 2022. doi: https://doi.org/10.30598/barekengvol16iss4pp1231-1240

D. Ruppert, M. P. Wand, and R. Carroll, SEMIPARAMETRIC REGRESSION. Cambridge University Press, 2003. doi: https://doi.org/10.1201/9781420091984-c17

N. Y. Adrianingsih, I. N. Budiantara, and J. D. T. Purnomo, “MIXTURE MODEL NONPARAMETRIC REGRESSION AND ITS APPLICATION,” J. Phys. Conf. Ser., vol. 1842, no. 1, 2021. doi: https://doi.org/10.1088/1742-6596/1842/1/012044

J. Fan and I. Gijbels, LOKAL POLYNOMIAL MODELLING AND ITS APPLICATIONS, 1st ed., no. 1985. Springer Science Business Media, 1996.

T. H. Ali, “MODIFICATION OF THE ADAPTIVE NADARAYA-WATSON KERNEL METHOD FOR NONPARAMETRIC REGRESSION (SIMULATION STUDY),” Commun. Stat. Simul. Comput., vol. 51, no. 2, pp. 391–403, 2019. doi: https://doi.org/10.1080/03610918.2019.1652319

C. P. A. Moraes, D. G. Fantinato, and A. Neves, “EPANECHNIKOV KERNEL FOR PDF ESTIMATION APPLIED TO EQUALIZATION AND BLIND SOURCE SEPARATION,” Signal Processing, vol. 189, p. 108251, 2021. doi: https://doi.org/10.1016/j.sigpro.2021.108251.

A. C. Guidoum, “KERNEL ESTIMATOR AND BANDWIDTH SELECTION FOR DENSITY AND ITS DERIVATIVES: THE KEDD PACKAGE,” vol. 3, no. 1, pp. 1–22, 2020. doi: https://doi.org/10.48550/arXiv.2012.06102.

I. Sriliana, I. N. Budiantara, and V. Ratnasari, “THE PERFORMANCE OF MIXED TRUNCATED SPLINE-LOCAL LINEAR NONPARAMETRIC REGRESSION MODEL FOR LONGITUDINAL DATA,” MethodsX, vol. 12, no. July 2023, 2024. doi: https://doi.org/10.1016/j.mex.2024.102652

Suparti, R. Santoso, A. Prahutama, and A. R. Devi, REGRESI NONPARAMETRIK, 1st ed. Ponorogo: Wade Group, 2017.

Sifriyani, A. R. M. Sari, A. T. R. Dani, and S. Jalaluddin, “BI-RESPONSE TRUNCATED SPLINE NONPARAMETRIC REGRESSION WITH OPTIMAL KNOT POINT SELECTION USING GENERALIZED CROSS-VALIDATION IN DIABETES MELLITUS PATIENT’S BLOOD SUGAR LEVELS,” Commun. Math. Biol. Neurosci., vol. 2023, pp. 1–18, 2023. doi: https://doi.org/10.28919/cmbn/7903.

C. R. Oktarina, S. Nugroho, and I. Sriliana, “Estimation of Stunting and Wasting Prevalence in Southern Part of Sumatra Using Nadaraya-Watson Kernel and Penalized Spline,” vol. 10, no. 2, pp. 647–660, 2026.

S. Tosatto, R. Akrour, and J. Peters, “AN UPPER BOUND OF THE BIAS OF NADARAYA-WATSON KERNEL REGRESSION UNDER LIPSCHITZ ASSUMPTIONS,” Stats, vol. 4, no. 1, pp. 1–17, 2021. doi: https://doi.org/10.3390/stats4010001

T. Misiakiewicz and B. Saeed, “A NON-ASYMPTOTIC THEORY OF KERNEL RIDGE REGRESSION: DETERMINISTIC EQUIVALENTS, TEST ERROR, AND GCV ESTIMATOR,” pp. 1–131, 2024, [Online]. Available:

P. Green and B. Silverman, NONPARAMETRIC REGRESSION AND GENERALIZED LINEAR MODELS. Springer Science Business Media, 1994. doi: https://doi.org/10.1007/978-1-4899-4473-3

A. Islamiyati et al., “THE USE OF PENALIZED WEIGHTED LEAST SQUARE TO OVERCOME CORRELATIONS BETWEEN TWO RESPONSES,” BAREKENG J. Ilmu Mat. dan Terap., vol. 16, no. 4, pp. 1497–1504, 2022. doi: https://doi.org/10.30598/barekengvol16iss4pp1497-1504

L. Yang and Y. Hong, “ADAPTIVE PENALIZED SPLINES FOR DATA SMOOTHING,” Comput. Stat. Data Anal., vol. 108, pp. 70–83, 2017. doi: https://doi.org/10.1016/j.csda.2016.10.022

D. Chicco, M. J. Warrens, and G. Jurman, “THE COEFFICIENT OF DETERMINATION R-SQUARED IS MORE INFORMATIVE THAN SMAPE, MAE, MAPE, MSE AND RMSE IN REGRESSION ANALYSIS EVALUATION,” PeerJ Comput. Sci., vol. 7, no. e623, pp. 1–24, 2021. doi: https://doi.org/10.7717/peerj-cs.623

T. O. Hodson, “ROOT-MEAN-SQUARE ERROR (RMSE) OR MEAN ABSOLUTE ERROR (MAE): WHEN TO USE THEM OR NOT,” Geosci. Model Dev., vol. 15, no. 14, pp. 5481–5487, 2022. doi: https://doi.org/10.5194/gmd-15-5481-2022

S. Dazzi, R. Vacondio, and P. Mignosa, “FLOOD STAGE FORECASTING USING MACHINE-LEARNING METHODS: A CASE STUDY ON THE PARMA RIVER (ITALY),” Water (Switzerland), vol. 13, no. 12, pp. 1–22, 2021. doi: https://doi.org/10.3390/w13121612

A. M. Sadek and L. A. Mohammed, “EVALUATION OF THE PERFORMANCE OF KERNEL NON-PARAMETRIC REGRESSION AND ORDINARY LEAST SQUARES REGRESSION,” Int. J. Informatics Vis., vol. 8, no. 3, pp. 1352–1360, 2024. doi: https://doi.org/10.62527/joiv.8.3.2430

C. R. Oktarina, I. Sriliana, and S. Nugroho, “PENALIZED SPLINE SEMIPARAMETRIC REGRESSION FOR BIVARIATE RESPONSE IN MODELING MACRO POVERTY INDICATORS,” Indones. J. Appl. Stat., vol. 8, no. 1, pp. 13–23, 2025. doi: https://doi.org/10.12962/j27213862.v8i3.23330.

Published
2026-04-08
How to Cite
[1]
C. R. Oktarina, S. Nugroho, and I. Sriliana, “EVALUATION OF THE FLEXIBILITY OF NADARAYA-WATSON KERNEL AND PENALIZED SPLINE ESTIMATORS IN BIVARIATE RESPONSE NONPARAMETRIC REGRESSION MODELS”, BAREKENG: J. Math. & App., vol. 20, no. 3, pp. 2179-2194, Apr. 2026.