Development of Machine Learning-Based Customer Credit Scoring Models: Analysis of Bank Melli Iran Data for Credit Risk Prediction

Authors

Keywords:

Customer Credit Scoring, Credit Risk, Machine Learning, Default Prediction, XGBoost Algorithm

Abstract

The aim of this study was to develop and evaluate customer credit risk prediction models using machine learning algorithms based on real-world data from Bank Melli Iran. In this research, a dataset consisting of 24,860 individual customers with 42 financial, demographic, behavioral, and credit-related variables was utilized. The default rate of the dataset was 14.8%, providing an appropriate basis for evaluating classification models. After conducting preprocessing procedures, including outlier removal, missing value imputation, and data normalization, five models—Logistic Regression, Decision Tree, Random Forest, XGBoost, and Multilayer Perceptron Neural Network—were trained. Model performance was evaluated using indicators such as Accuracy, Precision, Recall, F1-Score, and Area Under the Curve (AUC). The results demonstrated that the XGBoost algorithm achieved the best performance in predicting customer default probability, with an accuracy of 86.1% and an AUC value of 0.912. Variable importance analysis also revealed that the debt-to-income ratio, number of overdue installments, history of payment delay, and average account balance were among the most influential factors in determining credit risk. The findings indicate that machine learning models, particularly gradient boosting-based methods, can significantly enhance the accuracy of banking credit scoring systems.

References

[1] A. Kanas, D. Vasiliou, and N. Eriotis, "Credit risk and profitability in banking: Evidence from international banks," Research in International Business and Finance, vol. 72, p. 102260, 2025, doi: 10.1016/j.ribaf.2024.102260.

[2] V. Mahadevan, S. Subramaniam, and V. Srivastava, "Carbon Concentration in Bank Portfolios and Efficiency: The Role of Credit Risk and Capitalization," Journal of International Financial Markets, Institutions and Money, vol. 109, p. 102327, 2026, doi: 10.1016/j.intfin.2026.102327.

[3] S. Sadegh Khani and F. Ramazani, "Credit risk and ranking of Bank Saderat Ilam customers using econometric methods and logistic regression model," in Eighth National Conference on Economics, Management, and Accounting, Shirvan, 2021. [Online]. Available: https://en.civilica.com/doc/1234991/.

[4] J. Maghsoodi, Credit Scoring of Customers at Post Bank of Iran. Tehran: Hanoon, 2021.

[5] A. Aghaei and F. Mohammadi, "Providing a credit risk prediction model for Bank Sepah customers using metaheuristic algorithms," Banking Management Quarterly, vol. 12, no. 46, pp. 67-84, 2022.

[6] A. Ghasemi Armaki, M. F. Fallah, and M. Alborzi, "Development and Explanation of a Customer Credit Scoring System for Banks Based on Hybrid Machine Learning Models: A Case Study of Bank Mellat," Financial Management Perspective, vol. 12, no. 37, pp. 69-94, 2022, doi: 10.52547/JFMP.12.37.69.

[7] M. Jalili, M. Khodaei Valehzaghar, and M. Kaneshlou, "Credit Scoring of Real Customers in the Country's Banking System," Quarterly Journal of Quantitative Studies in Management, vol. 1, no. 3, pp. 127-148, 2023 2023. [Online]. Available: https://sid.ir/paper/191947/en.

[8] J. H. Plikas and D. Kenourgios, "Renewable Energy and Bank Credit Risk: The Mediating Role of Economic Growth," Economic Modelling, p. 107576, 2026, doi: 10.1016/j.econmod.2026.107576.

[9] V. Pourshahabi, H. Khodadadi Didani, and A. Taadolkhah, "Designing a Strategic Alliace Implementation Model in Customer-Based Organizations by Exploratory Mixed Method (Case study: Saman Bank)," Strategic Management Researches, vol. 27, no. 83, pp. 41-66, 2022. [Online]. Available: https://smr.journals.iau.ir/article_690906_2ec2f7d0a4fb6ed590b6cc50e26ec6c7.pdf.

[10] X. Yan, Y. Li, F. Nie, and R. Li, "Bank Customer Segmentation and Marketing Strategies Based on Improved DBSCAN Algorithm," Applied Sciences, vol. 15, no. 6, 2025.

[11] L. Yang and Y. Zhou, "Physical environment and its impact on customer perceptions of service quality in retail banking," International Journal of Bank Marketing, vol. 43, no. 2, pp. 210-228, 2025.

[12] M. J. Taghipourian, "The role of emotional branding on brand equity in the banking sector: Is customer experience a mediator?," Dynamic Management and Business Analysis, pp. 1-17, 2026. [Online]. Available: https://www.dmbaj.com/index.php/dmba/article/view/247.

[13] M. Jamshidi, "Determining and Ranking the Credit Scoring Indicators of Sepah Bank Customers," Quarterly Journal of Islamic Financial and Banking Studies, vol. 7, no. 17, pp. 47-66, 2022 2022. [Online]. Available: https://jifb.ibi.ac.ir/article_158635.html?lang=en.

[14] S. Abolghasemi, A. Nabavi Chashmi, E. Memarian, and M. Taghipour, "Explaining the Operational and Credit Risk Model in Banks with Emphasis on Ethical Validation of Customers," Ethics in Science and Technology, vol. 19, no. 2, pp. 185-190, 2024.

[15] A. Yāvarī, H. Jabbārī, and H. Panāhiyān, "Designing a Credit Risk Management Model with a Pathological Approach to Guarantees and Collaterals of Bank Facilities," Technology in Entrepreneurship and Strategic Management Quarterly, vol. 4, no. 2, pp. 1-22, 2025, doi: 10.61838/kman.jtesm.4.2.11.

[16] L. Abdollahzadeh Ramhormozi, A. H. Azh, and M. Khaki, "Evaluation and ranking service quality in the banking system and its impact on CRM Rishi Kant from the model (Case study: Melli Bank of Iran)," (in en), Transactions on Data Analysis in Social Science, vol. 5, no. 1, pp. 11-24, 2023, doi: 10.47176/TDASS/2023.11.

Downloads

Published

2026-11-01

Issue

Section

Articles

How to Cite

Abdi, B. ., Moradi, A. ., & Fatemi, A. . (2026). Development of Machine Learning-Based Customer Credit Scoring Models: Analysis of Bank Melli Iran Data for Credit Risk Prediction. Management Strategies and Engineering Sciences, 1-11. https://msesj.com/index.php/mses/article/view/384

Similar Articles

1-10 of 110

You may also start an advanced similarity search for this article.