A machine learning and ANOVA-based approach to model financial predictors for corporate failure in imbalanced dataset: The case of Taiwan
Vol 9, Issue 1, 2025
VIEWS - 37 (Abstract)
Abstract
This study meticulously explores the crucial elements precipitating corporate failures in Taiwan during the decade from 1999 to 2009. It proposes a new methodology, combining ANOVA and tuning the parameters of the classification so that its functional form describes the data best. Our analysis reveals the ten paramount factors, including Return on Capital ROA(C) before interest and depreciation, debt ratio percentage, consistent EPS across the last four seasons, Retained Earnings to Total Assets, Working Capital to Total Assets, dependency on borrowing, ratio of Current Liability to Assets, Net Value Per Share (B), the ratio of Working Capital to Equity, and the Liability-Assets Flag. This dual approach enables a more precise identification of the most instrumental variables in leading Taiwanese firms to bankruptcy based only on financial rather than including corporate governance variable. By employing a classification methodology adept at addressing class imbalance, we substantiate the significant influence these factors had on the incidence of bankruptcy among Taiwanese companies that rely solely on financial parameters. Thus, our methodology streamlines variable selection from 95 to 10 critical factors, improving bankruptcy prediction accuracy and outperforming Liang’s 2016 results.
Keywords
Full Text:
PDFReferences
Ahmed Sh., Alshater M., Ammari A., Hammami H., Artificial intelligence and machine learning in finance: A bibliometric review, Research in International Business and Finance, 61, 2022, https://doi.org/10.1016/j.ribaf.2022.101646.
Almamy, J., Aston, J., & Ngwa, L., An evaluation of Altman’s Z-score using cash flow ratio to predict corporate failure amid the recent financial crisis: Evidence from the UK. Journal of Corporate Finance, 36, 2016, pp.278–285. https://doi.org/10.1016/j.jcorpfin.201
Altman, E., Financial ratios, discriminant analysis and the prediction of corporate bankruptcy, Journal of Finance, 23, 4, 1968, pp. 589-609
ANOVA f-classification, https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.f_classif.html
Black F., Scholes M., The pricing of options and corporate liabilities, Journal of Political Economy, 81, 3, 1973, pp. 637-654
Bock K., Coussement K., Lessmann S., Cost-sensitive business failure prediction when misclassification costs are uncertain: A heterogeneous ensemble selection approach, European Journal of Operational Research, 285 (20), 2020, pp. 612-630, https://doi.org/10.1016/j.ejor.2020.01.052.
Borchert P., Coussement K., Caigny A., Weerdt J., Extending business failure prediction models with textual website content using deep learning, European Journal of Operational Research, 306, 2023, pp. 348-357
Bragoli D., Ferretti C., Ganugi P. & Marseguerra G., Mezzogori D., Zammori F., “Machine-learning models for bankruptcy prediction: do industrial variables matter?,” Spatial Economic Analysis, Taylor & Francis Journals, 17 (2), 2022, pp. 156-177
Brenes R. F., Johannssen A., Chukhrova N., An intelligent bankruptcy prediction model using a multilayer perceptron, Intelligent Systems with Applications, 16, 2022, https://doi.org/10.1016/j.iswa.2022.200136.
Chatterjee S., Khan P., Byun Y., Recent advances and applications of machine learning in the variable renewable energy sector, Energy Reports, 12, 2024, pp. 5044-5065
Dasilas A., Rigani A., Machine learning techniques in bankruptcy prediction: A systematic literature review, Expert Systems with Applications, 255, Part C, 2024, https://doi.org/10.1016/j.eswa.2024.124761.
Dou W., Taylor L., Wang W., Wang W., Dissecting bankruptcy frictions, Journal of Financial Economics, 142 (3), 2021, pp. 975-1000, https://doi.org/10.1016/j.jfineco.2021.06.014.
Du X., Li W., Ruan S., Li L., CUS-heterogeneous ensemble-based financial distress prediction for imbalanced dataset with ensemble feature selection, Applied Soft Computing, 97, 2020, https://doi.org/ 10.1016/j.asoc.2020.106758.
Ékes, K. S., & Koloszár, L., The efficiency of bankruptcy forecast models in the Hungarian SME Sector. Journal of Competitiveness, 6(2), 2014, pp. 56–73. https://doi.org/10.7441/joc.2014.02.05
F1 score for imbalanced data, F1 Score in Machine Learning Explained | Encord
F1-score, https://scikit-learn.org/stable/modules/generated/sklearn.metrics.f1_score.html#sklearn.metrics.f1_score
Fernández-Gámez M.Á., Soria J., Santos J., Alaminos D., European country heterogeneity in financial distress prediction: An empirical analysis with macroeconomic and regulatory factors, Economic Modelling, 88, 2020, pp. 398-407
Fitzpatrick P., A comparison of the ratios of successful industrial enterprises with those of failed companies, The Certified Public Accountant, 12, 1932, pp. 727-731, 598-605, 656-662 respectively
Grunert J., Norden L., Weber M., The role of non-financial factors in internal credit ratings, J. Bank. Financ., 29 (2), 2005, pp. 509-531, https://doi.org/10.1016/j.jbankfin.2004.05.017
Hwang, R. C., Cheng, K. F., & Lee, C. F., On multiple-class prediction of issuer credit ratings. Applied Stochastic Models in Business and Industry, 25(5), 2009, pp.535–550, https://doi.org/10.1002/asmb.735
Jabeur S., Stef N., Carmona P., 2023, Bankruptcy Prediction using the XGBoost Algorithm and Variable Importance Feature Engineering,” Computational Economics, Springer; Society for Computational Economics, vol. 61(2), pp. 715-741
James G., Witten D., Hastie T., Tibshirani R.: “An Introduction to Statistical Learning: with Applications in R”, Springer, Second Edition 2021, https://www.statlearning.com/
Jones, S., “A literature survey of corporate failure prediction models”, Journal of Accounting Literature, 45(2), 2023, pp. 364-405, https://doi.org/10.1108/JAL-08-2022-0086
Kalak I., Azevedo A., Hudson R., Karim M., Stock liquidity and SMEs’ likelihood of bankruptcy: Evidence from the US market, Research in International Business and Finance, 42, 2017, pp. 1383-1393, https://doi.org/10.1016/j.ribaf.2017.07.077.
Karas, M., Reznakova, M., Bartos, V., & Zinecker, M., Possibilities for the application of the Altman model within the Czech Republic, In Recent Researches in Law Science and Finances, 2023, pp. 203–207, http://www.wseas.us/e-library/conferences/2013/Chania/ICFA/ICFA-30.pdf
Ko, Y. C., Fujita, H., & Li, T., An evidential analysis of Altman Z-score for financial predictions: Case study on solar energy companies, Applied Soft Computing, 52, 2017, 748–759, https://doi.org/10.1016/j.asoc.2016.09.050
Kooptiwoot, S. & Javadi, B., Development of Decision Support System Platform for Daily Dietary Plan, Current Nutrition & Food Science, 18, 2022, https://doi.org/10.2174/1573401318666220318102124.
Kooptiwoot, S. & Kooptiwoot, S. & Javadi, B., Application of regression decision tree and machine learning algorithms to examine students’ online learning preferences during COVID-19 pandemic. International Journal of Education and Practice. 12, 2024, pp. 82-94, https://doi.org/10.18488/61.v12i1.3619. (a)
Kooptiwoot, S. & Tharasawatpipat, Ch. & Choo-In, S. & Kayee, P. & Javadi, B., AI-driven telemedicine: Optimizing daily dietary recommendations amidst the COVID-19 pandemic. Journal of Infrastructure, Policy and Development. 8, 2024, https://doi/org/10.24294/jipd.v8i11.8908. (b)
Kooptiwoot,S. & Tharasawatpipat, Ch.& Choo-in, S. & Kayee, P. & Meethongjan, K. & Sangsuwon, Ch. & Javadi, B., Deciphering the complexity of COVID-19 transmission: Unveiling precision through robust vaccination policies and advanced predictive modeling with random forest regression, Journal of Infrastructure, Policy and Development, 8, 2024, https://doi.org/10.24294/jipd.v8i8.5321. (c)
Kou G., Xu Y., Yi Peng, Shen F., Chen Y., Chang K., Kou S., Bankruptcy prediction for SMEs using transactional data and two-stage multiobjective feature selection, Decision Support Systems, 140, 2021, https://doi.org/10.1016/j.dss.2020.113429.
Letizia E., Lillo F., Corporate payments networks and credit risk rating, EPJ Data Sci., 8, 2019, pp. 8-21, https://doi.org/10.1140/epjds/s13688-019-0197-5
Liang D., Lu C., Tsai C., Shih G., Financial ratios and corporate governance indicators in bankruptcy prediction: a comprehensive study, Eur. J. Oper. Res., 252 (2), 2016, pp. 561-572, https://doi.org/10.1016/j.ejor.2016.01.012
Liang D., Tsai C., Wu H., The effect of feature selection on financial distress prediction, Knowledge-Based Systems, 73, 2015, pp. 289.
Logistic Regression in Python, sklearn.linear_model.LogisticRegression — scikit-learn 1.3.2 documentation
Lohmann Ch., Möllenhoff S., How do bankruptcy risk estimations change in time? Empirical evidence from listed US companies, Finance Research Letters, Volume 58, Part B, 2023, https://doi.org/10.1016/j.frl.2023.104389.
Mateika, H., Jia, J., Lillard, L., Cronbaugh, N., & Shin, W., Fallen angel bonds investment and bankruptcy predictions using manual models and automated machine learning, 2022, arXiv preprint arXiv:2212.03454.
Mattos E., Dennis S., Bankruptcy prediction with low-quality financial information, Expert Systems with Applications, 237, 2024, https://doi.org/10.1016/j.eswa.2023.121418.
Mohtasham, F., Pourhoseingholi, M., Hashemi Nazari, S.S. et al. Comparative analysis of feature selection techniques for COVID-19 dataset. Sci Rep 14, 2024, https://doi.org/10.1038/s41598-024-69209-6
Noga, T., & Adamowicz, K., Forecasting bankruptcy in the wood industry. European Journal of Wood Products, 79, 2021, pp. 735–743, https://doi.org/10.1007/s00107-020-01620-y
Odom M., Sharda R., A neural network model for bankruptcy prediction, Proceedings of the IJCNN International Joint Conference on Neural Networks, IEEE, 1990, pp. 163-168
Ohlson J., Financial ratios and the probabilistic prediction of bankruptcy, Journal of Accounting Research, 18 (1), 1980, pp. 109-131
Pereira J.M., Basto M., Silva A., The logistic lasso and ridge regression in predicting corporate failure, Procedia Economics and Finance, 39, 2016, pp. 634.
Qu Y., Quan P., Lei M., Shi Y., Review of bankruptcy prediction using machine learning and deep learning techniques, Procedia Computer Science, 162, 2019, pp. 895-899, https://doi.org/10.1016/j.procs.2019.12.065.
Ross BC. Mutual information between discrete and continuous data sets. PLoS One., 19, 2019, https://doi.org/10.1371/journal.pone.0087357.
Ruxanda, G., Zamfir, C., & Muraru, A., Predicting financial distress for Romanian companies. Technological and Economic Development Economy, 24(6), 2018, 2318–2337. https://doi.org/10.3846/tede.2018.6736
Salehi, M., & Pour, M. D., Bankruptcy prediction of listed companies on the Tehran Stock Exchange. International Journal of Law and Management, 58(5), 2016, 545–561. https://doi.org/10.1108/IJLMA-05-2015-0023
Sarkar S., Sriram R., Bayesian models for early warning of bank failures, Management Science, 47 (11), 2001, pp. 1457-1475
Shen F., Liu Y., Wang R., Zhou W., A dynamic financial distress forecast model with multiple forecast results under unbalanced data environment, Knowledge-Based Systems, 192, 2020, https://doi.org/10.1016/j.knosys.2019.105365.
SVM In Python, sklearn.svm.SVC — scikit-learn 1.3.2 documentation
Szeghalmy S, Fazekas A. A Comparative Study of the Use of Stratified Cross-Validation and Distribution-Balanced Stratified Cross-Validation in Imbalanced Learning. Sensors (Basel)., 23(4), 2023, https://doi.org/10.3390/s23042333.
Uthayakumar, J., Metawa, N., Shankar, K., & Lakshmanaprabu, S. K., Financial crisis prediction model using ant colony optimization. International Journal of Information Management, 50, 2020, pp.538-556.
Veganzones, D. and Severin, E., “Corporate failure prediction models in the twenty-first century: a review”, European Business Review, 33 (2), 2021, pp. 204-226. https://doi.org/10.1108/EBR-12-2018-0209
Voda, A. D., Dobrotă, G., Țîrcă, D. M., Dumitrașcu, D. D., & Dobrotă, D., Corporate bankruptcy and insolvency prediction model . Technological and Economic Development of Economy, 27(5), 2021, pp. 1039-1056, https://doi.org/10.3846/tede.2021.15106
Wang H, Liu X, Undersampling bankruptcy prediction: Taiwan bankruptcy data. PLoS ONE 16(7), 2021, https://doi.org/10.1371/journal.pone.0254030
Yuxia S., Congyuan Y., Zhiya L., Yanting T., Initiative for China to establish a dual model of mixed corporate governance on bankruptcy reorganization: An empirical analysis based on 93 listed companies, Heliyon, 8(12), 2022, https://doi.org/10.1016/j.heliyon.2022.e12007.
Zelenkov Y., Fedorova E., Chekrizov D., Two-step classification method based on genetic algorithm for bankruptcy forecasting, Expert Systems with Applications, 88, 2017, pp. 393.
Zhang, W., Machine Learning Approaches to Predicting Company Bankruptcy. Journal of Financial Risk Management, 6, 2017, pp. 364-374, https://doi.org/10.4236/jfrm.2017.64026.
Zhao J., Ouenniche J., Smedt J., Survey, classification and critical analysis of the literature on corporate bankruptcy and financial distress prediction, Machine Learning with Applications, 15, 2024, https://doi.org/10.1016/j.mlwa.2024.100527.
Zhou L., Lu D., Fujita H., The performance of corporate financial distress prediction models with features selection guided by domain knowledge and data mining approaches, Knowledge-Based Systems, 85, 2015, pp. 52-61.
DOI: https://doi.org/10.24294/jipd10072
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Borislava Toleva, Ivan Ivanov, Vincent Hooper
License URL: https://creativecommons.org/licenses/by/4.0/
This site is licensed under a Creative Commons Attribution 4.0 International License.