Policy-driven innovations in fraud prevention: Developing an ARFLGB-XGBoost early warning model to mitigate online romance scams in telecommunication networks
Vol 8, Issue 12, 2024
VIEWS - 22 (Abstract) 3 (PDF)
Abstract
To address the escalating online romance scams within telecom fraud, we developed an Adaptive Random Forest Light Gradient Boosting (ARFLGB)-XGBoost early warning system. Our method involves compiling detailed Online Romance Scams (ORS) incident data into a 24-variable dataset, categorized to analyze feature importance with Random Forest and LightGBM models. An innovative adaptive algorithm, the Adaptive Random Forest Light Gradient Boosting, optimizes these features for integration with XGBoost, enhancing early Online romance scams threat detection. Our model showed significant performance improvements over traditional models, with accuracy gains of 3.9%, a 12.5% increase in precision, recall improvement by 5%, an F1 score increase by 5.6%, and a 5.2% increase in Area Under the Curve (AUC). This research highlights the essential role of advanced fraud detection in preserving communication network integrity, contributing to a stable economy and public safety, with implications for policymakers and industry in advancing secure communication infrastructure.
Keywords
Full Text:
PDFReferences
Abbasimehr, H., Paki, R., & Bahrini, A. (2023). A novel XGBoost-based featurization approach to forecast renewable energy consumption with deep learning models. Sustainable Computing: Informatics and Systems, 38. https://doi.org/10.1016/j.suscom.2023.100863
Afriyie, J. K., Tawiah, K., Pels, W. A., et al. (2023). A supervised machine learning algorithm for detecting and predicting fraud in credit card transactions. Decision Analytics Journal, 6. https://doi.org/10.1016/j.dajour.2023.100163
Ashraf, M. T., Dey, K., & Mishra, S. (2023). Identification of high-risk roadway segments for wrong-way driving crash using rare event modeling and data augmentation techniques. Accident Analysis & Prevention, 181. https://doi.org/10.1016/j.aap.2022.106933
Bahaghighat, M., Ghasemi, M., & Ozen, F. (2023). A high-accuracy phishing website detection method based on machine learning. Journal of Information Security and Applications, 77. https://doi.org/10.1016/j.jisa.2023.103553
Belly, G., Boeckelmann, L., Graciano, C. M. C., et al. (2023). Forecasting sovereign risk in the Euro area via machine learning. Journal of Forecasting, 42(3), 657–684. Portico. https://doi.org/10.1002/for.2938
Breiman, L. L. (2001). Random forest. Mach Learn. 45, 5-32. https://doi.org/10.1023/A:1010933404324
Cao, R., Wang, J., Mao, M., et al. (2023). Feature-wise attention based boosting ensemble method for fraud detection. Engineering Applications of Artificial Intelligence, 126. https://doi.org/10.1016/j.engappai.2023.106975
Coluccia, A., Pozza, A., Ferretti, F., et al. (2020). Online Romance Scams: Relational Dynamics and Psychological Characteristics of the Victims and Scammers. A Scoping Review. Clinical Practice & Epidemiology in Mental Health, 16(1), 24–35. https://doi.org/10.2174/1745017902016010024
Cross, C., & Holt, T. J. (2023). More than Money: Examining the Potential Exposure of Romance Fraud Victims to Identity Crime. Global Crime, 24(2), 107–121. https://doi.org/10.1080/17440572.2023.2185607
Domashova, J., & Kripak, E. (2022). Development of a generalized algorithm for identifying atypical bank transactions using machine learning methods. Procedia Computer Science, 213, 101–109. https://doi.org/10.1016/j.procs.2022.11.044
G.S., T., Dheeshjith, S., Iyengar, S. S., et al. (2021). A hybrid and effective learning approach for Click Fraud detection. Machine Learning with Applications, 3. https://doi.org/10.1016/j.mlwa.2020.100016
Gogineni, A., Panday, I. K., Kumar, P., et al. (2023). Predicting compressive strength of concrete with fly ash and admixture using XGBoost: a comparative study of machine learning algorithms. Asian Journal of Civil Engineering, 25(1), 685–698. https://doi.org/10.1007/s42107-023-00804-0
Izotova, A., & Valiullin, A. (2021). Comparison of Poisson process and machine learning algorithms approach for credit card fraud detection. Procedia Computer Science, 186, 721–726. https://doi.org/10.1016/j.procs.2021.04.214
Jiang, Z., Chen, K., Wen, H., et al. (2022). Applying blockchain-based method to smart contract classification for CPS applications. Digital Communications and Networks, 8(6), 964–975. https://doi.org/10.1016/j.dcan.2022.08.011
Kamboj, A., Kumar, P., Bairwa, A. K., et al. (2023). Detection of malware in downloaded files using various machine learning models. Egyptian Informatics Journal, 24(1), 81–94. https://doi.org/10.1016/j.eij.2022.12.002
Koc, K., Ekmekcioğlu, Ö., & Gurgun, A. P. (2021). Integrating feature engineering, genetic algorithm and tree-based machine learning methods to predict the post-accident disability status of construction workers. Automation in Construction, 131. https://doi.org/10.1016/j.autcon.2021.103896
Kolev, M. (2023). XGB-COF: A machine learning software in Python for predicting the friction coefficient of porous Al-based composites with Extreme Gradient Boosting. Software Impacts, 17. https://doi.org/10.1016/j.simpa.2023.100531
Kumar, M. (2023). Early detection of chronic kidney disease using recursive feature elimination and cross-validated XGBoost model. International Journal of Computational Materials Science and Engineering, 13(04). https://doi.org/10.1142/s2047684123500367
Lao, Z., He, D., Wei, Z., et al. (2023). Intelligent fault diagnosis for rail transit switch machine based on adaptive feature selection and improved LightGBM. Engineering Failure Analysis, 148. https://doi.org/10.1016/j.engfailanal.2023.107219
Lazarus, S., Button, M., & Kapend, R. (2022). Exploring the value of feminist theory in understanding digital crimes: Gender and cybercrime types. The Howard Journal of Crime and Justice, 61(3), 381–398. Portico. https://doi.org/10.1111/hojo.12485
Lazarus, S., Whittaker, J. M., McGuire, M. R., et al. (2023). What do we know about online romance fraud studies? A systematic review of the empirical literature (2000 to 2021). Journal of Economic Criminology, 2. https://doi.org/10.1016/j.jeconc.2023.100013
Liu, W., Fan, H., Xia, M., et al. (2022). Predicting and interpreting financial distress using a weighted boosted tree-based tree. Engineering Applications of Artificial Intelligence, 116. https://doi.org/10.1016/j.engappai.2022.105466
Liu, Z., Zhang, Z., Yang, H., et al. (2023). An innovative model fusion algorithm to improve the recall rate of peer-to-peer lending default customers. Intelligent Systems with Applications, 20. https://doi.org/10.1016/j.iswa.2023.200272
Ma, L., Zhou, C., Lee, D., et al. (2022). Prediction of axial compressive capacity of CFRP-confined concrete-filled steel tubular short columns based on XGBoost algorithm. Engineering Structures, 260. https://doi.org/10.1016/j.engstruct.2022.114239
Mohiuddin, G., Lin, Z., Zheng, J., et al. (2023). Intrusion Detection using hybridized Meta-heuristic techniques with Weighted XGBoost Classifier. Expert Systems with Applications, 232. https://doi.org/10.1016/j.eswa.2023.120596
Mokbal, F. M. M., Dan, W., Xiaoxi, W., et al. (2021). XGBXSS: An Extreme Gradient Boosting Detection Framework for Cross-Site Scripting Attacks Based on Hybrid Feature Selection Approach and Parameters Optimization. Journal of Information Security and Applications, 58. https://doi.org/10.1016/j.jisa.2021.102813
Murugan, M. S., & T, S. K. (2023). Large-scale data-driven financial risk management & analysis using machine learning strategies. Measurement: Sensors, 27. https://doi.org/10.1016/j.measen.2023.100756
Musbah, H., Ali, G., Aly, H. H., et al. (2022). Energy management using multi-criteria decision making and machine learning classification algorithms for intelligent system. Electric Power Systems Research, 203. https://doi.org/10.1016/j.epsr.2021.107645
Nanath, K., & Olney, L. (2023). An investigation of crowdsourcing methods in enhancing the machine learning approach for detecting online recruitment fraud. International Journal of Information Management Data Insights, 3(1). https://doi.org/10.1016/j.jjimei.2023.100167
Nti, I. K., & Somanathan, A. R. (2024). A Scalable RF-XGBoost Framework for Financial Fraud Mitigation. IEEE Transactions on Computational Social Systems, 11(2), 1556–1563. https://doi.org/10.1109/tcss.2022.3209827
Qian, Q., Sun, H., Wu, J., et al. (2020). AKI Prediction Models in ICU: A Comparative Study (Preprint). https://doi.org/10.2196/preprints.18257
Razavi, R., Gharipour, A., Fleury, M., et al. (2019). A practical feature-engineering framework for electricity theft detection in smart grids. Applied Energy, 238, 481–494. https://doi.org/10.1016/j.apenergy.2019.01.076
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. Cornell University. https://doi.org/10.1145/2939672.2939778
Sha, Y., Faber, J., Gou, S., et al. (2022). An acoustic signal cavitation detection framework based on XGBoost with adaptive selection feature engineering. Measurement, 192. https://doi.org/10.1016/j.measurement.2022.110897
Singh, V. K. V., Rashwan, H. H., Akram, F. F., et al. (2018). Retinal Optic Disc Segmentation using Conditional Generative Adversarial Network. Cornell University.
Srivastava, A., Singh, V., & Drall, G. S. (2019). Sentiment Analysis of Twitter Data. International Journal of Healthcare Information Systems and Informatics, 14(2), 1–16. https://doi.org/10.4018/ijhisi.2019040101
Tan, B., Gan, Z., & Wu, Y. (2023). The measurement and early warning of daily financial stability index based on XGBoost and SHAP: Evidence from China. Expert Systems with Applications, 227. https://doi.org/10.1016/j.eswa.2023.120375
Tang, Z., Xiao, Y., Jiao, Y., et al. (2022). Research on Short-Term Low-Voltage Distribution Network Line Loss Prediction Based on Kmeans-LightGBM. Journal of Circuits, Systems and Computers, 31(13). https://doi.org/10.1142/s0218126622502280
Tianyu, B. B., Changbing, Z. Z., Chenlin, L. L., Economics, S.O.S. (2019). Design and application of pur-chase behavior recognition model in implicit feedback data based on Lightgbm algorithm. Wireless Internet Technology.
Wang, X., Gao, S., Guo, Y., et al. (2022). A Combined Prediction Model for Hog Futures Prices Based on WOA‐LightGBM‐CEEMDAN. Complexity, 2022(1). Portico. https://doi.org/10.1155/2022/3216036
Wang, X., Zhang, G., Lou, S., et al. (2022). Two-round feature selection combining with LightGBM classifier for disturbance event recognition in phase-sensitive OTDR system. Infrared Physics & Technology, 123. https://doi.org/10.1016/j.infrared.2022.104191
Wang, Z., & Thing, V. L. L. (2023). Feature mining for encrypted malicious traffic detection with deep learning and other machine learning algorithms. Computers & Security, 128. https://doi.org/10.1016/j.cose.2023.103143
Whitty, M. T., & Buchanan, T. (2016). The online dating romance scam: The psychological impact on victims – both financial and non-financial. Criminology & Criminal Justice, 16(2), 176–194. https://doi.org/10.1177/1748895815603773
Yan, Z., Chen, H., Dong, X., et al. (2022). Research on prediction of multi-class theft crimes by an optimized decomposition and fusion method based on XGBoost. Expert Systems with Applications, 207. https://doi.org/10.1016/j.eswa.2022.117943
Yang, L. L., Niu, X. X., Wu, J. J. (2021). RF-LighGBM: A probabilistic ensemble way to predict customer repurchase behaviour in community e-commerce. Cornell University.
Zhang, T., Zhu, W., Wu, Y., et al. (2023). An explainable financial risk early warning model based on the DS-XGBoost model. Finance Research Letters, 56. https://doi.org/10.1016/j.frl.2023.104045
Zhao, Z., Li, D., & Dai, W. (2023). Machine-learning-enabled intelligence computing for crisis management in small and medium-sized enterprises (SMEs). Technological Forecasting and Social Change, 191. https://doi.org/10.1016/j.techfore.2023.122492
Zheng, H.-L., An, S.-Y., Qiao, B.-J., et al. (2023). A data-driven interpretable ensemble framework based on tree models for forecasting the occurrence of COVID-19 in the USA. Environmental Science and Pollution Research, 30(5), 13648–13659. https://doi.org/10.1007/s11356-022-23132-3
Zhou, X., Zhao, C., & Bian, X. (2023). Prediction of maximum ground surface settlement induced by shield tunneling using XGBoost algorithm with golden-sine seagull optimization. Computers and Geotechnics, 154. https://doi.org/10.1016/j.compgeo.2022.105156
Zhu, C., Zhang, C., Wang, R., et al. (2023). Building of safer urban hubs: Insights from a comparative study on cyber telecom scams and early warning design. Urban Governance, 3(3), 200–210. https://doi.org/10.1016/j.ugj.2023.05.004
DOI: https://doi.org/10.24294/jipd.v8i12.7153
Refbacks
- There are currently no refbacks.
Copyright (c) 2024 Guancheng Chen, Wenzhuo Du, Yichen Shan, Anqi Wang, Liping Wang
License URL: https://creativecommons.org/licenses/by/4.0/
This site is licensed under a Creative Commons Attribution 4.0 International License.