CREDIT CARD FRAUD DETECTION USING LINEAR DISCRIMINANT ANALYSIS (LDA), RANDOM FOREST, AND BINARY LOGISTIC REGRESSION

  • Muhammad Ahsan Departement of Statistics, Institut Teknologi Sepuluh Nopember
  • Tabita Yuni Susanto Departement of Statistics, Institut Teknologi Sepuluh Nopember
  • Tiza Ayu Virania Department of Statistics, Institut Teknologi Sepuluh Nopember
  • Andi Indra Jaya Departement of Statistics, Institut Teknologi Sepuluh Nopember
Keywords: Binary Logistics Regression, Credit Card, Fraud, Linear Discriminant Analysis, Random Forests

Abstract

The growth of electronic payment usage makes the monetary tension of credit-card deception is changing into major defiance for finance and technology companies. Therefore, pressuring them to continuously advance their fraud detection system is crucial. In this research, we describe fraud detection as a classification issue by comparing three methods. The method used is Linear Discriminant Analysis (LDA), Random Forest, and Binary Logistic Regression. The dataset used is a dataset containing transactions made by credit cards. The challenge in this analysis is that the dataset is highly unbalanced, so SMOTE must perform better on the data. The dataset contains only continuous features that are transformed into Principal Component Scores (PCs). The results show that the binary regression algorithm, the Random Forest algorithm, and the Linear Discriminant Analysis with variables that have SMOTE have AUC values greater than using the original variables. The largest AUC value was obtained by binary logistic regression with 90:10 separation data and Random Forest Algorithm with 60:40 separation data.

Downloads

Download data is not yet available.

References

M. Ahsan, A. K. Anam, E. Julian, and A. I. Jaya, “Interpretable Predictive Model of Network Intrusion Using Several Machine Learning Algorithms,” BAREKENG J. Ilmu Mat. dan Terap., vol. 16, no. 1, pp. 57–64, 2022.

A. Tharwat, T. Gaber, A. Ibrahim, and A. E. Hassanien, “Linear discriminant analysis: A detailed tutorial,” Ai Commun., vol. 30, pp. 169–190, 2017, doi: 10.3233/AIC-170729.

F. Carcillo, Y.-A. Le Borgne, O. Caelen, Y. Kessaci, F. Oblé, and G. Bontempi, “Combining unsupervised and supervised learning in credit card fraud detection,” Inf. Sci. (Ny)., vol. 557, pp. 317–331, 2021.

A. Husejinovic, “Credit card fraud detection using naive Bayesian and C4. 5 decision tree classifiers,” Husejinovic, A.(2020). Credit card Fraud Detect. using naive Bayesian C, vol. 4, pp. 1–5, 2020.

S. Dhankhad, E. Mohammed, and B. Far, “Supervised machine learning algorithms for credit card fraudulent transaction detection: a comparative study,” in 2018 IEEE International Conference on Information Reuse and Integration (IRI), 2018, pp. 122–125.

N. Malini and M. Pushpa, “Analysis on credit card fraud identification techniques based on KNN and outlier detection,” in 2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB), 2017, pp. 255–258.

S. Patil, V. Nemade, and P. K. Soni, “Predictive modelling for credit card fraud detection using data analytics,” Procedia Comput. Sci., vol. 132, pp. 385–395, 2018.

K. Modi and R. Dayma, “Review on fraud detection methods in credit card transactions,” in 2017 International Conference on Intelligent Computing and Control (I2C2), 2017, pp. 1–5.

J. O. Awoyemi, A. O. Adetunmbi, and S. A. Oluwadare, “Credit card fraud detection using machine learning techniques: A comparative analysis,” in 2017 International Conference on Computing Networking and Informatics (ICCNI), 2017, pp. 1–9, doi: 10.1109/ICCNI.2017.8123782.

K. Fu, D. Cheng, Y. Tu, and L. Zhang, “Credit card fraud detection using convolutional neural networks,” in International conference on neural information processing, 2016, pp. 483–490.

M. Komorowski, D. Marshall, J. Salciccioli, and Y. Crutain, “Exploratory Data Analysis,” in Secondary Analysis of Electronic Health Records, 2016, pp. 185–203.

R. A. Johnson and G. K. Bhattacharyya, Statistics: principles and methods. John Wiley & Sons, 2019.

A. C. Sitepu, W. Wanayumini, and Z. Situmorang, “Determining Bullying Text Classification Using Naive Bayes Classification on Social Media,” J. Varian, vol. 4, no. 2, pp. 133–140, 2021.

I. K. P. Suniantara, G. Suwardika, and S. Soraya, “Peningkatan Akurasi Klasifikasi Ketidaktepatan Waktu Kelulusan Mahasiswa Menggunakan Metode Boosting Neural Network,” J. Varian, vol. 3, no. 2, pp. 95–102, 2020.

P. Galdi and R. Tagliaferri, “Data Mining: Accuracy and Error Measures for Classification and Prediction,” 2018.

Published
2022-12-15
How to Cite
[1]
M. Ahsan, T. Susanto, T. Virania, and A. I. Jaya, “CREDIT CARD FRAUD DETECTION USING LINEAR DISCRIMINANT ANALYSIS (LDA), RANDOM FOREST, AND BINARY LOGISTIC REGRESSION”, BAREKENG: J. Math. & App., vol. 16, no. 4, pp. 1337-1346, Dec. 2022.