APPLICATION OF BAGGING CART IN THE CLASSIFICATION OF ON-TIME GRADUATION OF STUDENTS IN THE STATISTICS STUDY PROGRAM OF TANJUNGPURA UNIVERSITY
Abstract
The timeliness of graduation is used as the success of students in pursuing education which can be seen from the time taken and measured by the predicate of graduation obtained. The characteristics of students who tend to graduate not or on time can be analyzed using classification techniques. Classification and Regression Tree (CART) is one of the classification tree methods. There is a weakness in CART, which is less stable in predicting a single classification tree. The weaknesses in CART can be improved by using Ensemble methods, one of which is Bootstrap Aggregating (Bagging) which can reduce classification errors and increase accuracy in a single classification model. This study aims to classify and determine the accuracy of Bagging CART in the case of the accuracy of student graduation classification. The number of samples used is 140 data on the graduation status of Untan Statistics Study Program students from Period I of the 2017/2018 academic year to Period II of the 2022/2023 academic year. The variables used are the timeliness of graduation which is categorized into two namely Not and On Time, Gender, Semester 1 GPA, Semester 2 GPA, Semester 3 GPA, Semester 4 GPA, Region of Origin Domicile, High School Accreditation, Entry Path, Scholarship, and first TUTEP. A good classification can be seen from the accuracy value. The CART method obtained an accuracy value of 70%. While using the CART Bagging method obtained an accuracy value of 85.71%. Based on the accuracy value obtained, the application of the CART Bagging method can increase accuracy and correct classification errors on a single CART classification tree by 15.71% by resampling 25 times.
Downloads
References
A. Wibowo, D. Manongga, and H. D. Purnomo, “The Utilization of Naive Bayes and C.45 in Predicting The Timeliness of Students’ Graduation,” Scientific Journal of Informatics, vol. 7, no. 1, pp. 99–112, 2020, doi: 10.15294/sji.v7i1.24241.
J. Ha, M. Kambe, and J. Pe, Data Mining: Concepts and Techniques. 2011. doi: 10.1016/C2009-0-61819-5.
W. Agwil, H. Fransiska, and N. Hidayati, “Analisis Ketepatan Waktu Lulus Mahasiswa Dengan Menggunakan Bagging Cart,” FIBONACCI: Jurnal Pendidikan Matematika dan Matematika, vol. 6, no. 2, p. 155, 2020, doi: 10.24853/fbc.6.2.155-166.
N. Z. Zacharis, “Classification and regression trees (CART) for predictive modeling in blended learning,” International Journal of Intelligent Systems and Applications, vol. 10, no. 3, pp. 1–9, 2018, doi: 10.5815/ijisa.2018.03.01.
D. Ratnaningrum, M. A. Mukid, and T. Wuryandari, “Analisis Klasifikasi Nasabah Kredit Menggunakan Bootstrap Aggregating Classification And Regression Trees (Bagging CART),” Jurnal Gaussian, vol. 5, no. 1, pp. 81–90, 2016, [Online]. Available: http://ejournal-s1.undip.ac.id/index.php/gaussian
S. Gocheva-Ilieva, H. Kulina, and A. Ivanov, “Assessment of students’ achievements and competencies in mathematics using cart and cart ensembles and bagging with combined model improvement by mars,” Mathematics, vol. 9, no. 1, pp. 1–17, 2021, doi: 10.3390/math9010062.
G. Kesavaraj and S. Sukumaran, “06726842,” 2013.
A. Maesya and T. Hendiyanti, “Forecasting Student Graduation with Classification and Regression Tree (CART) Algorithm,” IOP Conference Series: Materials Science and Engineering, vol. 621, no. 1, 2019, doi: 10.1088/1757-899X/621/1/012005.
Y. Y. Song and Y. Lu, “Decision tree methods: applications for classification and prediction,” Shanghai Archives of Psychiatry, vol. 27, no. 2, pp. 130–135, 2015, doi: 10.11919/j.issn.1002-0829.215044.
M. M. Ghiasi, S. Zendehboudi, and A. A. Mohsenipour, “Decision tree-based diagnosis of coronary artery disease: CART model,” Computer Methods and Programs in Biomedicine, vol. 192, p. 105400, 2020, doi: 10.1016/j.cmpb.2020.105400.
S. Innassuraiya, T. Widiharih, and I. T. Utami, “ANALISIS KLASIFIKASI MENGGUNAKAN METODE REGRESI LOGISTIK BINER DAN BOOTSTRAP AGGREGATING CLASSIFICATION AND REGRESSION TREES (BAGGING CART) (Studi Kasus: Nasabah Koperasi Simpan Pinjam Dan Pembiayaan Syariah (KSPPS)),” vol. 11, no. 2, pp. 183–194, 2022, [Online]. Available: https://ejournal3.undip.ac.id/index.php/gaussian/
S. H. Sumartini, “Penggunaan Metode Classification and Regression Trees (CART) untuk Klasifikasi Rekurensi Pasien Kanker Serviks di RSUD Dr. Soetomo Surabaya,” Jurnal Sains dan Seni ITS, vol. 4, no. 2, pp. 211–216, 2015, [Online]. Available: https://www.neliti.com/publications/15687/penggunaan-metode-classification-and-regression-trees-cart-untuk-klasifikasi-re
X. Li, X. Liu, and P. Gong, “Integrating ensemble-urban cellular automata model with an uncertainty map to improve the performance of a single model,” International Journal of Geographical Information Science, vol. 29, no. 5, pp. 762–785, 2015, doi: 10.1080/13658816.2014.997237.
P. Radha and B. Srinivasan, “Predicting Diabetes by cosequencing the various Data Mining Classification Techniques,” vol. 1, no. 6, pp. 334–339, 2014.
D. Kusnandar, N. N. Debataraja, S. W. Rizki, and E. Saputri, “Water quality mapping in pontianak city using multiple discriminant analysis,” AIP Conference Proceedings, vol. 2268, no. September, 2020, doi: 10.1063/5.0016809.
Y. T. Samuel, J. J. Hutapea, and B. Jonathan, “Predicting the timeliness of student graduation using decision tree c4.5 algorithm in universitas advent Indonesia,” Proceedings of 2019 International Conference on Information and Communication Technology and Systems, ICTS 2019, pp. 276–280, 2019, doi: 10.1109/ICTS.2019.8850948.
F. Rahmad, Y. Suryanto, and K. Ramli, “Performance Comparison of Anti-Spam Technology Using Confusion Matrix Classification,” IOP Conference Series: Materials Science and Engineering, vol. 879, no. 1, 2020, doi: 10.1088/1757-899X/879/1/012076.
Copyright (c) 2023 Widad Imtiyaz, Neva Satyahadewi, Hendra Perdana
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this Journal agree to the following terms:
- Author retain copyright and grant the journal right of first publication with the work simultaneously licensed under a creative commons attribution license that allow others to share the work within an acknowledgement of the work’s authorship and initial publication of this journal.
- Authors are able to enter into separate, additional contractual arrangement for the non-exclusive distribution of the journal’s published version of the work (e.g. acknowledgement of its initial publication in this journal).
- Authors are permitted and encouraged to post their work online (e.g. in institutional repositories or on their websites) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published works.