Comparing data mining methods for predicting cost construction projects: A case study of cost management datasets from Thailand

Tanayut Chaitongrat, Kridtsada Janthachai, Wuttipong Kusonkhum, Paranee Boonchai, M. Faisi Ikhwali, Mathinee Khotdee

Article ID: 2801
Vol 8, Issue 5, 2024

VIEWS - 348 (Abstract) 160 (PDF)

Abstract


This research examines three data mining approaches employing cost management datasets from 391 Thai contractor companies to investigate the predictive modeling of construction project failure with nine parameters. Artificial neural networks, naive bayes, and decision trees with attribute selection are some of the algorithms that were explored. In comparison to artificial neural network’s (91.33%) and naive bays’ (70.01%) accuracy rates, the decision trees with attribute selection demonstrated greater classification efficiency, registering an accuracy of 98.14%. Finally, the nine parameters include: 1) planning according to the current situation; 2) the company’s cost management strategy; 3) control and coordination from employees at different levels of the organization to survive on the basis of various uncertainties; 4) the importance of labor management factors; 5) the general status of the company, which has a significant effect on the project success; 6) the cost of procurement of the field office location; 7) the operational constraints and long-term safe work procedures; 8) the implementation of the construction system system piece by piece, using prefabricated parts; 9) dealing with the COVID-19 crisis, which is crucial for preventing project failure. The results show how advanced data mining approaches can improve cost estimation and prevent project failure, as well as how computational methods can enhance sustainability in the building industry. Although the results are encouraging, they also highlight issues including data asymmetry and the potential for overfitting in the decision tree model, necessitating careful consideration.


Keywords


contractor costs; data mining; neural networks; naïve bayes; decision tree

Full Text:

PDF


References


Abdul-Samad, Z., & Kulandaisamy, P. P. (2022). Cost Management for Information and Communication Technology Projects. Journal of Engineering, Project, and Production Management, 12(2), 166–178. https://doi.org/10.32738/jeppm-2022-0015

Abu Aisheh, Y. I. (2021). Lessons Learned, Barriers, and Improvement Factors for Mega Building Construction Projects in Developing Countries: Review Study. Sustainability, 13(19), 10678. https://doi.org/10.3390/su131910678

Aghimien, D. O., Adegbembo, T. F., Aghimien, E. I., et al. (2018). Challenges of Sustainable Construction: A Study of Educational Buildings in Nigeria. International Journal of Built Environment and Sustainability, 5(1). https://doi.org/10.11113/ijbes.v5.n1.244

Ahlawat, K., Chug, A., & Singh, A. P. (2021). An Insight on the Class Imbalance Problem and Its Solutions in Big Data. In: Large-Scale Data Streaming, Processing, and Blockchain Security. IGI Global.

Antoniou, F., Aretoulis, G., Giannoulakis, D., et al. (2023). Cost and Material Quantities Prediction Models for the Construction of Underground Metro Stations. Buildings, 13(2), 382. https://doi.org/10.3390/buildings13020382

Arena, F., Collotta, M., Luca, L., et al. (2021). Predictive Maintenance in the Automotive Sector: A Literature Review. Mathematical and Computational Applications, 27(1), 2. https://doi.org/10.3390/mca27010002

Berry, M. J., Linoff, G.S. Data Mining Techniques: For Marketing, Sales, and Customer Support, 2nd ed. John Willey & Sons, Canada.

Bilal, M., Oyedele, L. O., Qadir, J., et al. (2016). Big Data in the construction industry: A review of present status, opportunities, and future trends. Advanced Engineering Informatics, 30(3), 500–521. https://doi.org/10.1016/j.aei.2016.07.001

Borovskikh, O., Evstafieva, A., & Marfina, L. (2021). Cost management of a construction company based on functional cost analysis. E3S Web of Conferences, 274, 05003. https://doi.org/10.1051/e3sconf/202127405003

Boujnouni, M. E. (2022). A study and identification of COVID-19 viruses using N-grams with Naïve Bayes, K-Nearest Neighbors, Artificial Neural Networks, Decision tree and Support Vector Machine. 2022 International Conference on Intelligent Systems and Computer Vision (ISCV). https://doi.org/10.1109/iscv54655.2022.9806081

Chaitongrat, T. (2021). Causal relationship model of problems in public sector procurement. International Journal of GEOMATE, 20(80). https://doi.org/10.21660/2021.80.6266

Chamidah, N., Santoni, M. M., & Matondang, N. (2020). The effect of oversampling on the classification of hypertension with the naive bayes algorithm, decision tree, and artificial neural network. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 4(4), 635–641. https://doi.org/10.29207/resti.v4i4.2015

Chen, F., Deng, P., Wan, J., et al. (2015). Data mining for the internet of things: literature review and challenges, International Journal of Distributed Sensor Networks, 11(8), 431047. https://doi.org/10.1155/ 2015/431047

Chen, W. T., Merrett, H. C., Lu, S. T., et al. (2019). Analysis of Key Failure Factors in Construction Partnering—A Case Study of Taiwan. Sustainability, 11(14), 3994. https://doi.org/10.3390/su11143994

Dlamini, M., & Cumberlege, R. (2021). The impact of cost overruns and delays in the construction business. IOP Conference Series: Earth and Environmental Science, 654(1), 012029. https://doi.org/10.1088/1755-1315/654/1/012029

Fan, C., Xiao, F., Li, Z., et al. (2018). Unsupervised data analytics in mining big building operational data for energy efficiency enhancement: A review. Energy and Buildings, 159, 296–308. https://doi.org/10.1016/j.enbuild.2017.11.008

Faten Albtoush, A. M., Doh, S. I., Abdul Rahman, A. R. B., & Albtoush, J. F. A. A. (2020). Factors effecting the cost management in construction projects. International Journal of Civil Engineering and Technology, 11(1).

Gündüz, M., Nielsen, Y., & Özdemir, M. (2013). Quantification of delay factors using the relative importance index method for construction projects in Turkey. Journal of management in engineering, 29(2), 133–139.

Gyadu-Asiedu, W., Ampadu-Asiamah, A., & Fokuo-Kusi, A. (2021). A framework for systemic sustainable construction industry development (SSCID). Discover Sustainability, 2(1). https://doi.org/10.1007/s43621-021-00033-y

Hansen, D. R., Mowen, M. M., & Heitger, D. L. (2021). Cost management. Cengage Learning.

Holm, L., & Schaufelberger, J. E. (2021). Construction cost estimating. Routledge.

Hoseini, E., Van Veen, P., Bosch-Rekveldt, M., & Hertogh, M. (2020). Cost performance and cost contingency during project execution: Comparing client and contractor perspectives. Journal of Management in Engineering, 36(4), 05020006.

Hu, Y.-X., Huai, L.-B., & Cui, R.-Y. (2019). Research on Teaching Evaluation Model Based on Weighted Naive Bayes. 2019 10th International Conference on Information Technology in Medicine and Education (ITME). https://doi.org/10.1109/itme.2019.00112

Hui, S.C., Jha, G. Data mining for customer service support. Information & Management, 38(1), 1–13. https://doi.org/10.1016/S0378-7206(00)00051-3

Katarya, R., & Srinivas, P. (2020). Predicting Heart Disease at Early Stages using Machine Learning: A Survey. 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). https://doi.org/10.1109/icesc48915.2020.9155586

Kim, S., & Lee, H. (2022). Customer Churn Prediction in Influencer Commerce: An Application of Decision Trees. Procedia Computer Science, 199, 1332–1339. https://doi.org/10.1016/j.procs.2022.01.169

Kusonkhum, W., Srinavin, K., Leungbootnak, N., et al. (2022). Using a Machine Learning Approach to Predict the Thailand Underground Train’s Passenger. Journal of Advanced Transportation, 2022, 1–15. https://doi.org/10.1155/2022/8789067

Larson, E. W., Gray, C. F. (2013). Project management: the managerial process. McGraw Hill Professional.

László, K., & Ghous, H. (2020). Efficiency comparison of Python and RapidMiner. Multidiszciplináris Tudományok, 10(3), 212–220.

Liu, P., Qingqing, W., & Liu, W. (2021). Enterprise human resource management platform based on FPGA and data mining. Microprocessors and Microsystems, 80, 103330. https://doi.org/10.1016/j.micpro.2020.103330

Marzukhi, S., Awang, N., Alsagoff, S. N., et al. (2021). RapidMiner and Machine Learning Techniques for Classifying Aircraft Data. Journal of Physics: Conference Series, 1997(1), 012012. https://doi.org/10.1088/1742-6596/1997/1/012012

Mohd, H. N. N., & Shamsul, S. (2011). Critical success factors for software projects: A comparative study. Scientific Research and Essays, 6(10), 2174–2186. https://doi.org/10.5897/sre10.1171

Monteiro, F. P., Sousa, V., Meireles, I., et al. (2021). Cost Modeling from the Contractor Perspective: Application to Residential and Office Buildings. Buildings, 11(11), 529. https://doi.org/10.3390/buildings11110529

Mumali, F. (2022). Artificial neural network-based decision support systems in manufacturing processes: A systematic literature review. Computers & Industrial Engineering, 165, 107964. https://doi.org/10.1016/j.cie.2022.107964

Pertiwi, M. W., Kusmira, M., Rezkiani, R., et al. (2022). Naïve Bayes Classification Model for the Producer Price Index Prediction. SISTEMASI, 11(1), 171. https://doi.org/10.32520/stmsi.v11i1.1669

Plebankiewicz, E. (2018). Model of Predicting Cost Overrun in Construction Projects. Sustainability, 10(12), 4387. https://doi.org/10.3390/su10124387

Raj, P. V., Teja, P. S., Siddhartha, K. S., & Rama, J. K. (2021). Housing with low-cost materials and techniques for a sustainable construction in India-A review. Materials Today: Proceedings, 43, 1850–1855.

Shehadeh, A., Alshboul, O., Al Mamlook, R. E., & Hamedat, O. (2021). Machine learning models for predicting the residual value of heavy construction equipment: An evaluation of modified decision tree, LightGBM, and XGBoost regression. Automation in Construction, 129, 103827.

Sinsom, B., S. (2019). An efficiency comparison in prediction of imbalanced data classification with data mining techniques. ai Journal of Science and Technology, 8(3), 383–393.

Soibelman, L., Kim, H. Data preparation process for construction knowledge generation through knowledge discovery in databases. Journal of Computing in Civil Engineering, 16(1), 39–48. https://doi.org/10.1061/(ASCE)0887-3801(2002)16:1(39)

Srinavin, K., Kusonkhum, W., Chonpitakwong, B., et al. (2021). Readiness of Applying Big Data Technology for Construction Management in Thai Public Sector. Journal of Advances in Information Technology, 12(1), 1–5. https://doi.org/10.12720/jait.12.1.1-5

Sweis, G., Sweis, R., Abu Hammad, A., et al. (2008). Delays in construction projects: The case of Jordan. International Journal of Project Management, 26(6), 665–674. https://doi.org/10.1016/j.ijproman.2007.09.009

Trost SM, Oberlender GD. (2003). Predicting accuracy of early cost estimates using factor analysis and multivariate regression. Journal of Construction Engineering and Management, 129(2), 198–204.

Witten, I. H., Frank, E., Hall, M. A., Pal, C. J. (2016). Data Mining: Practical Machine Learning Tools and Techniques, 4th ed. Morgan Kaufmann.

Xu, H., Chen, X., Li, P., Ding, J., & Eghan, C. (2019). A Novel RFID Data Management Model Based on Quantum Cryptography. In: Proceedings of the Third International Congress on Information and Communication Technology: ICICT 2018.

You, Z., & Wu, C. (2019). A framework for data-driven informatization of the construction company. Advanced Engineering Informatics, 39, 269–277. https://doi.org/10.1016/j.aei.2019.02.002

Yu, Z., Haghighat, F., Fung, B.C.M. (2016). Advances and challenges in building engineering and data mining applications for energy-efficient communities. Sustainable Cities and Society, 25, 33–38.

Zeydalinejad, N. (2022). Artificial neural networks vis-à-vis MODFLOW in the simulation of groundwater: a review. Modeling Earth Systems and Environment, 8(3), 2911–2932. https://doi.org/10.1007/s40808-022-01365-y




DOI: https://doi.org/10.24294/jipd.v8i5.2801

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 Tanayut Chaitongrat, Kridtsada Janthachai, Wuttipong Kusonkhum, Paranee Boonchai, M. Faisi Ikhwali, Mathinee Khotdee

License URL: https://creativecommons.org/licenses/by/4.0/

This site is licensed under a Creative Commons Attribution 4.0 International License.