THE ORDINAL LOGISTIC REGRESSION MODEL WITH SAMPLING WEIGHTS ON DATA FROM THE NATIONAL SOCIO-ECONOMIC SURVEY

  • Reni Amelia Department of Statistics, Faculty of Mathematics and Natural Sciences, IPB University
  • Indahwati Indahwati Department of Statistics, Faculty of Mathematics and Natural Sciences, IPB University
  • Erfiani Erfiani Department of Statistics, Faculty of Mathematics and Natural Sciences, IPB University
Keywords: Ordinal logistic regression, sampling weight, SUSENAS, pseudo maximum likelihood, Taylor linearization

Abstract

Ordinal logistic regression is a method describing the relationship between an ordered categorical response variable and one or more explanatory variables. The parameter estimation of this model uses the maximum likelihood estimation having assumption that each sample unit having an equal chance of being selected, or using simple random sampling (SRS) design. This study uses data from the National Socio-Economic Survey (SUSENAS) having two-stage one-phase sampling (not SRS). So, the parameter estimation should consider the sampling weights. This study describes the parameter estimation of the ordinal logistic regression with sampling weight using the pseudo maximum likelihood method, especially in SUSENAS sampling design framework. The variance estimation method uses Taylor linearization. This study also provides numerical examples using ordinal logistic regression with sampling weight. Data used is 121,961 elderly spread over 514 districts/cities. Testing data (20%) is used to obtain the accuracy of the prediction results. The variables used in this study are the health status of the elderly as the response variable, and nine explanatory variables. The results of this study indicate that the ordinal logistic regression model with sampling weights is more representative of the population and more capable to predict minority categories of the response variable (poor and moderate health status) than is without sampling weights.

Downloads

Download data is not yet available.

References

M. W. Fagerland and D. W. Hosmer, "How to test for goodness of fit in ordinal logistic regression models," Stata Journal, vol. 17, no. 3, pp. 668-686, 2017.

K. J. Archer and S. Lemeshow, "Goodness of Fit Test for a Logistic Regression Model Fitted Using Survey Sample Data," The Stata Journal, vol. 6, no. 1, pp. 97-105, 2006.

Badan Pusat Statistik, Pedoman Kepala BPS Provinsi, Kepala Bidang Statistik Sosial, dan Kepala BPS Kabupaten/Kota: SUSENAS Maret 2020, Jakarta: Badan Pusat Statistik, 2019.

M. A. Ciol, J. M. Hoffman, B. J. Dudgeon, A. Shumway-Cook, K. M. Yorkston and L. Chan, "Understanding the use of weights in the analysis of data from multistage surveys," Arch Phys Med Rehabil, vol. 87, no. 2, pp. 299-303, 2006.

S. R. Cassy, I. Natario and M. R. O. Martins, "Logistic Regression Modelling for Complex Survey Data with an Application for Bed Net Use in Mozambique," Open Journal of Statistics, vol. 6, no. 5, pp. 898-907, 2016.

K. S. Barasa and C. Muchwanju, "Incorporating Survey Weights into Binary and Multinomial Logistic Regression Models," Science Journal of Applied Mathematics and Statistics, vol. 3, no. 6, pp. 243-249, 2015.

B. Anthony, "Performing Logistic Regression on Survey Data with the New SURVEYLOGISTIC Procedure," in The Twenty-Seventh Annual SAS®Users Group International Conference, Orlando, 2002.

J. Jajang, N. Nurhayati and S. J. Mufida, "Ordinal Logistic Regression Model and Classification Tree on Ordinal Response Data," BAREKENG: J. Il. Mat. & Ter., vol. 16, no. 1, pp. 075-082, 2022.

G. Solomon and L. Weissfeld, "Pseudo maximum likelihood approach for the analysis of multivariate left-censored longitudinal data," Stat Med., vol. 36, no. 1, pp. 81-91, 2017.

A. Guolo, "Pseudo-Likelihood inference for regression models with misclassified and mismeasured variables," Statistica Sinica, vol. 21, pp. 1639-1663, 2011.

M. Abdalmoaty and H. Hjalmarsson, "Simulated Pseudo Maximum Likelihood Identification of Nonlinear Models," in The 20th IFAC World Congress, Elsevier, 2017.

G. Fiorentini and E. Sentana, "Consistent non-Gaussian pseudo maximum likelihood estimators," Journal of Econometrics, vol. 213, no. 2, pp. 321-358, 2019.

C. Gourieroux, A. Monfort and E. M. Renault, "Consistent Pseudo-Maximum Likelihood Estimators," Annals of Economics and Statistics, vol. 125/126, pp. 187-218, 2017.

J. Wang, "The Pseudo Maximum Likelihood Estimator for Quantiles of Survey Variables Get access Arrow," Journal of Survey Statistics and Methodology, vol. 9, no. 1, p. 185–201, 2021.

M. Denuit, D. Hainaut and J. Trufin, Effective Statistical Learning Methods for Actuaries I: GLMs and Extensions, Switzerland: Springer, 2019.

S. L. Lohr, Sampling: Design and Analysis, Second Edition, Boston: Brooks/Cole, 2010.

A. Demnati and J. N. K. Rao, "Linearization variance estimators for model parameters from complex survey data," Survey Methodology, vol. 36, no. 2, pp. 193-201, 2010.

F. L. Haung, "Analyzing Group Level Effects with Clustered Data Using Taylor Series Linearization," Practical Assessment, Research, and Evaluation, vol. 19, no. 1, pp. 1-9, 2014.

R. Prasetya and A. Ridwan, "Data Mining Application on Weather Prediction Using Classification Tree, Naïve Bayes and K-Nearest Neighbor Algorithm with Model Testing of Supervised Learning Probabilistic Brier Score, Confusion Matrix and ROC," Journal of Applied Communication and Information Technologies, vol. 4, no. 2, pp. 25-33, 2019.

S. Aisyah, S. Wahyuningsih and F. Amijaya, "Peramalan Jumlah Titik Panas Provinsi Kalimantan Timur Menggunakan Metode Radial Basis Function Neural Network," Jambura Journal of Probability and Statistics, vol. 2, no. 2, pp. 64-74, 2021.

United Nations Population Fund (UNFPA) and HelpAge International, "Ageing in the Twenty-First Century: A Celebration and A Challenge," UNFPA and HelpAge International, New York and London, 2012.

Published
2022-12-15
How to Cite
[1]
R. Amelia, I. Indahwati, and E. Erfiani, “THE ORDINAL LOGISTIC REGRESSION MODEL WITH SAMPLING WEIGHTS ON DATA FROM THE NATIONAL SOCIO-ECONOMIC SURVEY”, BAREKENG: J. Math. & App., vol. 16, no. 4, pp. 1355-1364, Dec. 2022.