The International Arab Journal of Information Technology (IAJIT)

..............................
..............................
..............................


Support Vector Machine with Information Gain Based Classification for Credit Card Fraud Detection System

In the credit card industry, fraud is one of the major issues to handle as sometimes the genuine credit card customers may get misclassified as fraudulent and vice-versa. Several detection systems have been developed but the complexity of these systems along with accuracy and precision limits its usefulness in fraud detection applications. In this paper, a new methodology Support Vector Machine with Information Gain (SVMIG) to improve the accuracy of identifying the fraudulent transactions with high true positive rate for the detection of frauds in credit card is proposed. In SVMIG, the min- max normalization is used to normalize the attributes and the feature set of the attributes are reduced by using information gain based attribute selection. Further, the Apriori algorithm is used to select the frequent attribute set and to reduce the candidate’s itemset size while detecting fraud. The experimental results suggest that the proposed algorithm achieves 94.102% higher accuracy on the standard dataset compared to the existing Bayesian and random forest based approaches for a large sample size in dealing with legal and fraudulent transactions.

[1] Adewumi A. and Akinyelu A., “A Survey of Machine-Learning and Nature-inspired based Credit Card Fraud Detection Techniques,” International Journal of System Assurance Engineering and Management, vol. 8, no. 2, pp. 937-953, 2017.

[2] Agrawal R. and Srikant R., “Fast Algorithms for Mining Association Rules,” in Proceedings of 20th International Conference on Very Large Databases, San Francisco, pp. 487-499, 1994.

[3] Behera T. and Panigrahi S., “Credit Card Fraud Detection: A Hybrid Approach Using Fuzzy Clustering and Neural Network,” in Proceedings of 2nd IEEE International Conference on Advances in Computing and Communication Engineering, Dehradun, pp. 494-499, 2015.

[4] Cai S., Hao S., Sun R., and Wu G., “Mining Recent Maximal Frequent Itemsets over Data Streams with Sliding Window,” The International Arab Journal of Information Technology, vol. 16, no. 6, pp. 961-969, 2019.

[5] Darwish S., “An Intelligent Credit Card Fraud Detection Approach based on Semantic Fusion of Two Classifiers,” Soft Computing, vol. 24, no. 2, pp. 1243-1253, 2020.

[6] Datta D., De A., Roy D., and Dutta S., “Apriori Algorithm using Hashing for Frequent Itemsets Mining,” International Journal of Education and Management Engineering, vol. 8, no. 6, pp. 46- 58, 2018.

[7] De Sá A., Pereira A., and Pappa G., “A Customized Classification Algorithm for Credit Card Fraud Detection,” Engineering Applications of Artificial Intelligence, vol. 72, pp. 21-29, 2018.

[8] Delamaire L., Abdou H., and Pointon J., “Credit Card Fraud and Detection Techniques: a Review,” Banks and Bank Systems, vol. 4, no. 2, pp. 57-68, 2009.

[9] Domingos P. and Pazzani M., “Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier,” in Proceedings 206 The International Arab Journal of Information Technology, Vol. 18, No. 2, March 2021 of the 13th International Conference on Machine Learning, Bari, pp. 105-112, 1996.

[10] Duman E. and Ozcelik M., “Detecting Credit Card Fraud by Genetic Algorithm and Scatter Search,” Expert Systems with Applications, vol. 38, no. 10, pp. 13057-13063, 2011.

[11] Excell D., “Bayesian Inference-The Future of Online Fraud Protection,” Computer Fraud and Security, vol. 2012, no. 2, pp. 8-11, 2012.

[12] Fu K., Cheng D., Tu Y., and Zhang L., “Credit Card Fraud Detection using Convolutional Neural Networks,” in Proceedings of the International Conference on Neural Information Processing, Kyoto, pp. 483-490, 2016.

[13] Jain Y., Tiwari N., Dubey S., and Jain S., “A Comparative Analysis of Various Credit Card Fraud Detection Techniques,” International Journal of Recent Technology and Engineering, vol. 7, no. 5S2, pp. 402-407, 2019.

[14] Jha S., Guillen M., and Westland J., “Employing Transaction Aggregation Strategy to Detect Credit Card Fraud,” Expert Systems with Applications, vol. 39, no. 16, pp. 12650-12657, 2012.

[15] Jiang C., Song J., Liu G., Zheng L., and Luan W., “Credit Card Fraud Detection: A Novel Approach using Aggregation Strategy and Feedback Mechanism,” IEEE Internet of Things Journal, vol. 5, no. 5, pp. 3637-3647, 2018.

[16] Khare N. and Sait S., “Credit Card Fraud Detection using Machine Learning Models and Collating Machine Learning Models,” International Journal of Pure and Applied Mathematics, vol. 118, no. 20, pp. 825-838, 2018.

[17] Maes S., Tuyls K., Vanschoenwinkel B., and Manderick B., “Credit Card Fraud Detection using Bayesian and Neural Networks,” in Proceedings of the 1st International Naiso Congress on Neuro Fuzzy Technologies, Havana, Cuba, pp. 261-270, 2002.

[18] Ogwueleka F., “Data Mining Application in Credit Card Fraud Detection System,” Journal of Engineering Science and Technology, vol. 6, no. 3, pp. 311-322, 2011.

[19] Pei J., Han J., Mortazavi-Asl B., Wang J., Pinto H., Chen Q., Dayal U., and Hsu M., “Mining Sequential Patterns by Pattern-growth: The Prefixspan Approach,” IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 11, pp. 1424-1440, 2004.

[20] Quah J. and Sriganesh M., “Real-time Credit Card Fraud Detection using Computational Intelligence,” Expert Systems with Applications, vol. 35. no. 4, pp. 1721-1732, 2008.

[21] Randhawa K., Loo C., Seera M., Lim C., and Nandi A., “Credit Card Fraud Detection using AdaBoost and Majority Voting,” IEEE Access, vol. 6, pp. 14277-14284, 2018.

[22] Razooqi T., Khurana P., Raahemifar K., and Abhari A., “Credit Card Fraud Detection using Fuzzy Logic and Neural Network,” in Proceedings of 19th Communications and Networking Symposium, Pasadena, pp. 1-5, 2016.

[23] Srivastava A., Kundu A., Sural S., and Majumdar A., “Credit Card Fraud Detection using Hidden Markov Model,” IEEE Transactions on Dependable and Secure Computing, vol. 5, no. 1, pp. 37-48, 2008.

[24] Teh B., Islam M. B., Kumar N., Islam M., and Eaganathan U., “Statistical and Spending Behavior based Fraud Detection of Card-based Payment System,” in Proceedings of the International Conference on Electrical Engineering and Informatics, Banda Aceh, pp. 78-83, 2018.

[25] Tripathi K. and Pavaskar M., “Survey on Credit Card Fraud Detection Methods,” International Journal of Emerging Technology and Advanced Engineering, vol. 2, no. 11, pp. 721-726, 2012.

[26] UCI Machine Learning Repository, default of credit card clients dataset, https://archive.ics.uci.edu/ml/datasets/default+of +credit+card+clients, Last Visited, 2020.

[27] Van Vlasselaer V., Bravo C., Caelen O., Eliassi- Rad T., Akoglu L., Snoeck M., and Baesens B., “APATE: A Novel Approach for Automated Credit Card Transaction Fraud Detection using Network-based Extensions,” Decision Support Systems, vol. 75, pp. 38-48, 2015.

[28] Wang C. and Han D., “Credit Card Fraud Forecasting Model based on Clustering Analysis and Integrated Support Vector Machine,” Cluster Computing, vol. 22, no. 6, pp. 13861-13866, 2019.

[29] Wang Y., Adams S., Beling P., Greenspan S., Rajagopalan S., Velez-Rojas M., Mankovski S., Boker S., and Brown D., “Privacy Preserving Distributed Deep Learning and its Application in Credit Card Fraud Detection,” in Proceedings of 17th IEEE International Conference on Trust, Security and Privacy, New York, pp. 1070-1078, 2018.

[30] Xuan S., Liu G., Li Z., Zheng L., Wang S., and Jiang C., “Random Forest for Credit Card Fraud Detection,” in Proceedings of 15th IEEE International Conference on Networking, Sensing and Control, Zhuhai, pp. 1-6, 2018.

[31] Zaslavsky V. and Strizhak A., “Credit Card Fraud Detection using Self-Organizing Maps,” Information and Security, vol. 18, pp. 48-63, 2006.