EVALUATION OF THE FLEXIBILITY OF NADARAYA-WATSON KERNEL AND PENALIZED SPLINE ESTIMATORS IN BIVARIATE RESPONSE NONPARAMETRIC REGRESSION MODELS
Abstract
Nonparametric regression is a flexible approach used when the functional relationship between predictors and responses is unknown. In the context of multiple responses, bivariate nonparametric regression allows modeling two correlated response variables, such as stunting and wasting prevalence, which remain critical issues in public health. This study aims to evaluate the flexibility and performance of two nonparametric estimators, the Nadaraya-Watson Kernel and the Penalized Spline, for modeling bivariate response data. The research was conducted in two stages: (1) simulation using variations in sample sizes (50, 100, 150, 200) and error variances based on exponential and trigonometric functions, and (2) application to real data on stunting and wasting prevalence in Indonesia (2024) obtained from Statistics Indonesia (BPS), with socioeconomic and health-related predictors. Model performance was assessed using RMSE, MSE, and R-squared, complemented by MANOVA, orthogonal polynomial contrasts, and Tukey’s post-hoc test to examine significant differences across scenarios. Simulation results indicate that the Nadaraya-Watson Kernel estimator consistently outperformed the Penalized Spline, providing lower RMSE and MSE values and greater stability, particularly for larger sample sizes and smaller error variances. Orthogonal polynomial analysis revealed a quadratic relationship between sample size and RMSE, with occasional cubic patterns, while error variance consistently exhibited a quadratic trend. In the applied study, the Nadaraya-Watson Kernel with a Gaussian kernel achieved high accuracy, with an MSE of 0.00086 and an R-squared value indicating a strong model fit. However, this high R-squared value may reflect potential overfitting, which warrants further validation through cross-validation. These findings demonstrate that the Nadaraya-Watson Kernel offers an effective approach for bivariate nonparametric regression, supporting data-driven policy decisions in nutrition and public health.
Downloads
References
D. C. Montgomery, E. A. Peck, and G. VINNING, LINEAR REGRESSION ANALYSIS, 6th ed. John Wiley & Sons, Inc, 2021. doi: https://doi.org/10.2307/1268395
I. Sriliana, I. N. Budiantara, and V. Ratnasari, “THE MIXED ESTIMATOR OF TRUNCATED SPLINE AND LOCAL LINEAR IN MULTIVARIABLE NONPARAMETRIC REGRESSION,” AIP Conf. Proc., vol. 2554, no. 1, 2023. doi: https://doi.org/10.1063/5.0104167
R. L. Eubank, NONPARAMETRIC REGRESSION AND SPLINE SMOOTHING, 2nd ed. New York: Marcel Dekker, 1999. doi: https://doi.org/10.1201/9781482273144
I. N. Budiantara, REGRESI NONPARAMETRIK SPLINE TRUNCATED. Surabaya: ITS Press, 2019.
P. P. Gabrela, J. D. T. Purnomo, and I. N. Budiantara, “THE ESTIMATION OF MIXED TRUNCATED SPLINE AND FOURIER SERIES ESTIMATOR IN BI-RESPONSE NONPARAMETRIC REGRESSION,” AIP Conf. Proc., vol. 2903, no. 1, 2023. doi: https://doi.org/10.1063/5.0177224.
I. Sriliana, I. N. Budiantara, and V. Ratnasari, “A TRUNCATED SPLINE AND LOCAL LINEAR MIXED ESTIMATOR IN NONPARAMETRIC REGRESSION FOR LONGITUDINAL DATA AND ITS APPLICATION,” Symmetry (Basel)., vol. 14, no. 12, 2022. doi: https://doi.org/10.3390/sym14122687.
R. Hidayat, I. N. Budiantara, B. W. Otok, and V. Ratnasari, “THE REGRESSION CURVE ESTIMATION BY USING MIXED SMOOTHING SPLINE AND KERNEL (MSS-K) MODEL,” Commun. Stat. - Theory Methods, vol. 50, no. 17, pp. 3942–3953, 2021. doi: https://doi.org/10.1080/03610926.2019.1710201
R. Pahlepi, I. Sriliana, W. Agwil, and C. R. Oktarina, “BIRESPONSE SPLINE TRUNCATED NONPARAMETRIC REGRESSION MODELING FOR LONGITUDINAL DATA ON MONTHLY STOCK PRICES OF THREE PRIVATE BANKS IN INDONESIA,” Barekeng, vol. 19, no. 4, pp. 2467–2480, 2025. doi: https://doi.org/10.30598/barekengvol19iss4pp2467-2480
G. Wahba, SPLINE MODELS FOR OBSERVATIONAL DATA. PENNSYLVANIA: SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1990. doi: https://doi.org/10.1137/1.9781611970128
Y. R. Yue, D. Simpson, F. Lindgren, and H. Rue, “BAYESIAN ADAPTIVE SMOOTHING SPLINES USING STOCHASTIC DIFFERENTIAL EQUATIONS,” Bayesian Anal., vol. 9, no. 2, pp. 397–424, 2014. doi: https://doi.org/10.1214/13-BA866
K. Xie, F. Meng, and D. Zhang, “REGIONAL FORECASTING OF PM2.5 CONCENTRATIONS: A NOVEL MODEL BASED ON THE EMPIRICAL ORTHOGONAL FUNCTION ANALYSIS AND NADARAYA–WATSON KERNEL REGRESSION ESTIMATOR,” Environ. Model. Softw., vol. 170, no. January, p. 105857, 2023. doi: https://doi.org/10.1016/j.envsoft.2023.105857
A. R. Fadilah, A. Fitrianto, and I. M. Sumertajaya, “OUTLIER IDENTIFICATION ON PENALIZED SPLINE REGRESSION MODELING FOR POVERTY GAP INDEX IN JAVA,” BAREKENG J. Ilmu Mat. dan Terap., vol. 16, no. 4, pp. 1231–1240, 2022. doi: https://doi.org/10.30598/barekengvol16iss4pp1231-1240
D. Ruppert, M. P. Wand, and R. Carroll, SEMIPARAMETRIC REGRESSION. Cambridge University Press, 2003. doi: https://doi.org/10.1201/9781420091984-c17
N. Y. Adrianingsih, I. N. Budiantara, and J. D. T. Purnomo, “MIXTURE MODEL NONPARAMETRIC REGRESSION AND ITS APPLICATION,” J. Phys. Conf. Ser., vol. 1842, no. 1, 2021. doi: https://doi.org/10.1088/1742-6596/1842/1/012044
J. Fan and I. Gijbels, LOKAL POLYNOMIAL MODELLING AND ITS APPLICATIONS, 1st ed., no. 1985. Springer Science Business Media, 1996.
T. H. Ali, “MODIFICATION OF THE ADAPTIVE NADARAYA-WATSON KERNEL METHOD FOR NONPARAMETRIC REGRESSION (SIMULATION STUDY),” Commun. Stat. Simul. Comput., vol. 51, no. 2, pp. 391–403, 2019. doi: https://doi.org/10.1080/03610918.2019.1652319
C. P. A. Moraes, D. G. Fantinato, and A. Neves, “EPANECHNIKOV KERNEL FOR PDF ESTIMATION APPLIED TO EQUALIZATION AND BLIND SOURCE SEPARATION,” Signal Processing, vol. 189, p. 108251, 2021. doi: https://doi.org/10.1016/j.sigpro.2021.108251.
A. C. Guidoum, “KERNEL ESTIMATOR AND BANDWIDTH SELECTION FOR DENSITY AND ITS DERIVATIVES: THE KEDD PACKAGE,” vol. 3, no. 1, pp. 1–22, 2020. doi: https://doi.org/10.48550/arXiv.2012.06102.
I. Sriliana, I. N. Budiantara, and V. Ratnasari, “THE PERFORMANCE OF MIXED TRUNCATED SPLINE-LOCAL LINEAR NONPARAMETRIC REGRESSION MODEL FOR LONGITUDINAL DATA,” MethodsX, vol. 12, no. July 2023, 2024. doi: https://doi.org/10.1016/j.mex.2024.102652
Suparti, R. Santoso, A. Prahutama, and A. R. Devi, REGRESI NONPARAMETRIK, 1st ed. Ponorogo: Wade Group, 2017.
Sifriyani, A. R. M. Sari, A. T. R. Dani, and S. Jalaluddin, “BI-RESPONSE TRUNCATED SPLINE NONPARAMETRIC REGRESSION WITH OPTIMAL KNOT POINT SELECTION USING GENERALIZED CROSS-VALIDATION IN DIABETES MELLITUS PATIENT’S BLOOD SUGAR LEVELS,” Commun. Math. Biol. Neurosci., vol. 2023, pp. 1–18, 2023. doi: https://doi.org/10.28919/cmbn/7903.
C. R. Oktarina, S. Nugroho, and I. Sriliana, “Estimation of Stunting and Wasting Prevalence in Southern Part of Sumatra Using Nadaraya-Watson Kernel and Penalized Spline,” vol. 10, no. 2, pp. 647–660, 2026.
S. Tosatto, R. Akrour, and J. Peters, “AN UPPER BOUND OF THE BIAS OF NADARAYA-WATSON KERNEL REGRESSION UNDER LIPSCHITZ ASSUMPTIONS,” Stats, vol. 4, no. 1, pp. 1–17, 2021. doi: https://doi.org/10.3390/stats4010001
T. Misiakiewicz and B. Saeed, “A NON-ASYMPTOTIC THEORY OF KERNEL RIDGE REGRESSION: DETERMINISTIC EQUIVALENTS, TEST ERROR, AND GCV ESTIMATOR,” pp. 1–131, 2024, [Online]. Available:
P. Green and B. Silverman, NONPARAMETRIC REGRESSION AND GENERALIZED LINEAR MODELS. Springer Science Business Media, 1994. doi: https://doi.org/10.1007/978-1-4899-4473-3
A. Islamiyati et al., “THE USE OF PENALIZED WEIGHTED LEAST SQUARE TO OVERCOME CORRELATIONS BETWEEN TWO RESPONSES,” BAREKENG J. Ilmu Mat. dan Terap., vol. 16, no. 4, pp. 1497–1504, 2022. doi: https://doi.org/10.30598/barekengvol16iss4pp1497-1504
L. Yang and Y. Hong, “ADAPTIVE PENALIZED SPLINES FOR DATA SMOOTHING,” Comput. Stat. Data Anal., vol. 108, pp. 70–83, 2017. doi: https://doi.org/10.1016/j.csda.2016.10.022
D. Chicco, M. J. Warrens, and G. Jurman, “THE COEFFICIENT OF DETERMINATION R-SQUARED IS MORE INFORMATIVE THAN SMAPE, MAE, MAPE, MSE AND RMSE IN REGRESSION ANALYSIS EVALUATION,” PeerJ Comput. Sci., vol. 7, no. e623, pp. 1–24, 2021. doi: https://doi.org/10.7717/peerj-cs.623
T. O. Hodson, “ROOT-MEAN-SQUARE ERROR (RMSE) OR MEAN ABSOLUTE ERROR (MAE): WHEN TO USE THEM OR NOT,” Geosci. Model Dev., vol. 15, no. 14, pp. 5481–5487, 2022. doi: https://doi.org/10.5194/gmd-15-5481-2022
S. Dazzi, R. Vacondio, and P. Mignosa, “FLOOD STAGE FORECASTING USING MACHINE-LEARNING METHODS: A CASE STUDY ON THE PARMA RIVER (ITALY),” Water (Switzerland), vol. 13, no. 12, pp. 1–22, 2021. doi: https://doi.org/10.3390/w13121612
A. M. Sadek and L. A. Mohammed, “EVALUATION OF THE PERFORMANCE OF KERNEL NON-PARAMETRIC REGRESSION AND ORDINARY LEAST SQUARES REGRESSION,” Int. J. Informatics Vis., vol. 8, no. 3, pp. 1352–1360, 2024. doi: https://doi.org/10.62527/joiv.8.3.2430
C. R. Oktarina, I. Sriliana, and S. Nugroho, “PENALIZED SPLINE SEMIPARAMETRIC REGRESSION FOR BIVARIATE RESPONSE IN MODELING MACRO POVERTY INDICATORS,” Indones. J. Appl. Stat., vol. 8, no. 1, pp. 13–23, 2025. doi: https://doi.org/10.12962/j27213862.v8i3.23330.
Copyright (c) 2026 Cinta Rizki Oktarina, Sigit Nugroho, Idhia Sriliana

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this Journal agree to the following terms:
- Author retain copyright and grant the journal right of first publication with the work simultaneously licensed under a creative commons attribution license that allow others to share the work within an acknowledgement of the work’s authorship and initial publication of this journal.
- Authors are able to enter into separate, additional contractual arrangement for the non-exclusive distribution of the journal’s published version of the work (e.g. acknowledgement of its initial publication in this journal).
- Authors are permitted and encouraged to post their work online (e.g. in institutional repositories or on their websites) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published works.




1.gif)


