Enhancing Credit Score Analysis: A Novel Approach with Random Forest and Kernel SVM

International Journal of Electronics and Communication Engineering
© 2023 by SSRG - IJECE Journal
Volume 10 Issue 11
Year of Publication : 2023
Authors : Annie Chacko, D. John Aravindhar, A. Antonidoss
pdf
How to Cite?

Annie Chacko, D. John Aravindhar, A. Antonidoss, "Enhancing Credit Score Analysis: A Novel Approach with Random Forest and Kernel SVM," SSRG International Journal of Electronics and Communication Engineering, vol. 10,  no. 11, pp. 45-54, 2023. Crossref, https://doi.org/10.14445/23488549/IJECE-V10I11P105

Abstract:

Credit score analysis systematically evaluates an individual or entity’s financial history and behaviour to determine their creditworthiness. Traditional methods for credit score analysis have several challenges, such as privacy concerns, lack of flexibility, vulnerability to identity theft, limited data, and real-time analysis. To overcome these complexities, this paper proposes a novel method combining the advantages of Random Forest and kernel Support Vector Machine (SVM). The proposed method has three phases: data preprocessing, feature extraction, and classification. In the preprocessing phase, the proposed method eliminates the noise and errors from the raw data based on obtaining quality input for the analysis. In this study, Random Forest is utilized to extract the most significant features based on the domain and credit data analysis also, kernel SVM is employed for classification by analyzing the components and their impact on credit scoring. Also, the study conducted experiments on the German Credit Risk dataset. The performance evaluation of the proposed method involves analyzing its effectiveness based on evaluation metrics and comparing its performance with existing methods. The experimental results depict that the proposed method obtained better outcomes and achieved high efficiency for credit score analysis.

Keywords:

Credit score analysis, Random Forest, Kernel Support Vector Machine, German Credit Risk dataset, Feature extraction, Classification.

References:

[1] Anil Kumar, Suneel Sharma, and Mehregan Mahdavi, “Machine Learning (ML) Technologies for Digital Credit Scoring in Rural Finance: A Literature Review,” Risks, vol. 9, no. 11, pp. 1-15, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Yoichi Hayashi, “Emerging Trends in Deep Learning for Credit Scoring: A Review,” Electronics, vol. 11, no. 19, pp. 1-30, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Yiheng Li, and Weidong Chen, “A Comparative Performance Assessment of Ensemble Learning for Credit Scoring,” Mathematics, vol. 8, no. 10, pp. 1-19, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Beibei Niu, Jinzheng Ren, and Xiaotao Li, “Credit Scoring Using Machine Learning by Combing Social Network Information: Evidence from Peer-to-Peer Lending,” Information, vol. 10, no. 12, pp. 1-14, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Apostolos Ampountolas et al., “A Machine Learning Approach for Micro-Credit Scoring,” Risks, vol. 9, no. 3, pp. 1-20, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Juan Laborda, and Seyong Ryoo, “Feature Selection in a Credit Scoring Model,” Mathematics, vol. 9, no. 7, pp. 1-22, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Sunghyon Kyeong, Daehee Kim, and Jinho Shin, “Can System Log Data Enhance the Performance of Credit Scoring?-Evidence from an Internet Bank in Korea,” Sustainability, vol. 14, no. 1, pp. 1-12, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Lars Ole Hjelkrem, Petter Eilif de Lange, and Erik Nesset, “The Value of Open Banking Data for Application Credit Scoring: Case Study of a Norwegian Bank,” Journal of Risk and Financial Management, vol. 15, no. 12, pp. 1-15, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Paweł Pławiak et al., “DGHNL: A New Deep Genetic Hierarchical Network of Learners for Prediction of Credit Scoring,” Information Sciences, vol. 516, pp. 401-418, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Xiaohong Chen et al., “A Novel GSCI-Based Ensemble Approach for Credit Scoring,” IEEE Access, vol. 8, pp. 222449-222465, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Pranith Kumar Roy, and Krishnendu Shaw, “A Multicriteria Credit Scoring Model for SMEs Using Hybrid BWM and Topsis,” Financial Innovation, vol. 7, pp. 1-27, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Srđan Jelinek et al., “A Novel IBA-DE Hybrid Approach for Modeling Sovereign Credit Ratings,” Mathematics, vol. 10, no. 15, pp. 1- 21, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Mostafa Yousofi Tezerjan, Azamdokht Safi Samghabadi, and Azizollah Memariani, “ARF: A Hybrid Model for Credit Scoring in Complex Systems,” Expert Systems with Applications, vol. 185, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Diwakar Tripathi, B. Ramachandra Reddy, and Alok Kumar Shukla, “CFR: Collaborative Feature Ranking for Improving the Performance of Credit Scoring Data Classification,” Computing, vol. 104, no. 4, pp. 893-923, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Peshawa J. Muhammad Ali, and Rezhna H. Faraj, “Data Normalization and Standardization: A Technical Report,” Koya University, Ebril, Iraq, Machine Learning Technical Reports, vol. 1, no. 1, pp. 1-6, 2014.
[Google Scholar]
[16] Mohammad Savargiv, Behrooz Masoumi, and Mohammad Reza Keyvanpour, “A New Random Forest Algorithm Based on Learning Automata,” Computational Intelligence and Neuroscience, vol. 2021, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Diwakar Tripathi et al., Experimental Analysis of Machine Learning Methods for Credit Score Classification,” Progress in Artificial Intelligence, vol. 10, pp. 217-243, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Shihao Jiang, Richard Hartley, and Basura Fernando, “Kernel Support Vector Machines and Convolutional Neural Networks,” 2018 Digital Image Computing: Techniques and Applications (DICTA), pp. 1-7, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[19] German Credit Risk, Credit Classification. [Online]. Available: https://www.kaggle.com/datasets/uciml/german-credit