STUDY TIME CLASSIFICATION OF MATHEMATICS AND INFORMATION TECHNOLOGY DEPARTMENT OF KALIMANTAN INSTITUTE OF TECHNOLOGY USING NAÏVE BAYES ALGORITHM

  • Fatrysia Wikarya Sucipto Mathematics Study Program, Department of Mathematics and Technology Information, Institut Teknologi Kalimantan, Indonesia
  • Ramadhan Paninggalih Informatics Study Program, Department of Mathematics and Technology Information, Institut Teknologi Kalimantan, Indonesia
  • Indira Anggriani Mathematics Study Program, Department of Mathematics and Technology Information, Institut Teknologi Kalimantan, Indonesia
Keywords: Accuracy, Classification, College Quality, F1-Score, Naïve Bayes

Abstract

Institut Teknologi Kalimantan (ITK) is one of the state universities in Indonesia which has 5 majors, one of them is the Department of Mathematics and Information Technology (JMTI). JMTI has six study programs, and only three study programs have graduates, namely Mathematics, Information Systems, and Informatics. Every year the number of new students continues to grow, but this is not proportional to the number of graduates, because some students study for more than 8 semesters. Because of this, the quality of study programs being poor. In this research, a model was built that could classify student study timeliness, using the naïve Bayes algorithm. The data used is data from JMTI student graduates from the 2013 to 2019 batch. The 2013 to 2018 batch data will be training data and validation data, while the 2019 batch data will be testing data. This research compare accuracy and F1-score naïve Bayes algorithm without correlation and with correlation. The best model obtained from training data is a model with variables that have gone through a correlation test, namely 70:30, 80:20, and 90:10. The attributes selected after the correlation test, namely, IP Tahap Bersama, GPA, Final GPA, Length of Study (Semester), dan Graduation GPA (Category), yield results for accuracy and an F1-score of 1.

Downloads

Download data is not yet available.

References

W. N. Putri, "Pengaruh Media Pembelajaran Terhadap Motivasi Belajar Bahasa Arab Siswa Madrasah Tsanawiyah," Lisania, 2017.

itk.ac.id, "Tentang ITK," 2022. [Online]. Available: https://itk.ac.id/home/tentang-itk/. [Accessed 21 September 2022].

Wikipedia, "Institut Teknologi Kalimantan," 2022. [Online]. Available: https://id.wikipedia.org/wiki/Institut_Teknologi_Kalimantan. [Accessed 19 September 2022].

D. Heryana, DATA MINING UNTUK MEMPREDIKSI KELULUSAN MAHASISWA PENDIDIKAN MATEMATIKA UIN RADEN INTAN LAMPUNG MENGGUNAKAN NAIVE BAYES, Lampung: UNIVERSITAS ISLAM NEGERI RADEN INTAN, 2019.

N. M. M. Alfitri and E. Mashamy, "Klasifikasi Data Penduduk Untuk Menerima Bantuan Pangan Non Tunai Menggunakan Algoritma Naïve Bayes," Jurnal Riset Komputer, vol. 9, pp. 1035-1043, 2022.

M. Y. Putra and D. I. Putri, "PEMANFAATAN ALGORITMA NAÏVE BAYES DAN K-NEAREST NEIGHBOR UNTUK KLASIFIKASI JURUSAN SISWA KELAS XI," Jurnal Teknokompak, vol. 16, pp. 176-187, 2022.

N. Gogtay and U. Thatte, "Principles of Correlation Analysis," Journal of The Association of Physicians of India, vol. 65, pp. 78-81, 2017.

D. Edelmann, T. F. Móri and G. J. Székely, "Onrelationships between the Pearson and the distance correlation coefficients," Statistics and Probability Letters, vol. 169, 2021.

Bustami, "Penerapan Algoritma Naïve Bayes Untuk Mengklasifikasi Data Nasabah Asuransi," Jurnal Informatika, pp. 884-898, 2014.

A. Luque, A. Carrasco, . A. Martín and A. d. l. Heras, "The impact of class imbalance in classification performance metrics based on the binary confusion matrix," Pattern Recognition, vol. 91, pp. 216-231, 2019.

K. Barkved, "How To Know if Your Machine Learning Model Has Good Performance," 9 March 2022. [Online]. Available: https://www.obviously.ai/post/machine-learning-model-performance.

Harikrishnan, "Confusion Matrix, Accuracy, Precision, Recall, F1 Score," 2019. [Online]. Available: https://medium.com/analytics-vidhya/confusion-matrix-accuracy-precision-recall-f1-score-ade299cf63cd. [Accessed 24 September 2022].

developers.google.com, "Machine Learning," 2022. [Online]. Available:https://developers.google.com/machine-learning/data-prep/construct/sampling-splitting/imbalanced-data [Accessed 24 September 2022].

B. Krawczyk, "Learning from imbalanced data: open challenges and future directions," Progress in Artificial Intelligence, vol. 5, pp. 221-232, 2016.

A. Fernández, S. García, M. Galar, R. C. Prati, B. Krawczyk and F. Herrera, Learning from Imbalanced Data Sets, Springer Cham, 2018.

Published
2023-09-30
How to Cite
[1]
F. Sucipto, R. Paninggalih, and I. Anggriani, “STUDY TIME CLASSIFICATION OF MATHEMATICS AND INFORMATION TECHNOLOGY DEPARTMENT OF KALIMANTAN INSTITUTE OF TECHNOLOGY USING NAÏVE BAYES ALGORITHM”, BAREKENG: J. Math. & App., vol. 17, no. 3, pp. 1419-1428, Sep. 2023.