Comparing data mining methods for predicting cost construction projects: A case study of cost management datasets from Thailand
Vol 8, Issue 5, 2024
VIEWS - 1463 (Abstract)
Abstract
This research examines three data mining approaches employing cost management datasets from 391 Thai contractor companies to investigate the predictive modeling of construction project failure with nine parameters. Artificial neural networks, naive bayes, and decision trees with attribute selection are some of the algorithms that were explored. In comparison to artificial neural network’s (91.33%) and naive bays’ (70.01%) accuracy rates, the decision trees with attribute selection demonstrated greater classification efficiency, registering an accuracy of 98.14%. Finally, the nine parameters include: 1) planning according to the current situation; 2) the company’s cost management strategy; 3) control and coordination from employees at different levels of the organization to survive on the basis of various uncertainties; 4) the importance of labor management factors; 5) the general status of the company, which has a significant effect on the project success; 6) the cost of procurement of the field office location; 7) the operational constraints and long-term safe work procedures; 8) the implementation of the construction system system piece by piece, using prefabricated parts; 9) dealing with the COVID-19 crisis, which is crucial for preventing project failure. The results show how advanced data mining approaches can improve cost estimation and prevent project failure, as well as how computational methods can enhance sustainability in the building industry. Although the results are encouraging, they also highlight issues including data asymmetry and the potential for overfitting in the decision tree model, necessitating careful consideration.
Keywords
Full Text:
PDFReferences
- Abdul-Samad, Z., & Kulandaisamy, P. P. (2022). Cost Management for Information and Communication Technology Projects. Journal of Engineering, Project, and Production Management, 12(2), 166–178. https://doi.org/10.32738/jeppm-2022-0015
- Abu Aisheh, Y. I. (2021). Lessons Learned, Barriers, and Improvement Factors for Mega Building Construction Projects in Developing Countries: Review Study. Sustainability, 13(19), 10678. https://doi.org/10.3390/su131910678
- Aghimien, D. O., Adegbembo, T. F., Aghimien, E. I., et al. (2018). Challenges of Sustainable Construction: A Study of Educational Buildings in Nigeria. International Journal of Built Environment and Sustainability, 5(1). https://doi.org/10.11113/ijbes.v5.n1.244
- Ahlawat, K., Chug, A., & Singh, A. P. (2021). An Insight on the Class Imbalance Problem and Its Solutions in Big Data. In: Large-Scale Data Streaming, Processing, and Blockchain Security. IGI Global.
- Antoniou, F., Aretoulis, G., Giannoulakis, D., et al. (2023). Cost and Material Quantities Prediction Models for the Construction of Underground Metro Stations. Buildings, 13(2), 382. https://doi.org/10.3390/buildings13020382
- Arena, F., Collotta, M., Luca, L., et al. (2021). Predictive Maintenance in the Automotive Sector: A Literature Review. Mathematical and Computational Applications, 27(1), 2. https://doi.org/10.3390/mca27010002
- Berry, M. J., Linoff, G.S. Data Mining Techniques: For Marketing, Sales, and Customer Support, 2nd ed. John Willey & Sons, Canada.
- Bilal, M., Oyedele, L. O., Qadir, J., et al. (2016). Big Data in the construction industry: A review of present status, opportunities, and future trends. Advanced Engineering Informatics, 30(3), 500–521. https://doi.org/10.1016/j.aei.2016.07.001
- Borovskikh, O., Evstafieva, A., & Marfina, L. (2021). Cost management of a construction company based on functional cost analysis. E3S Web of Conferences, 274, 05003. https://doi.org/10.1051/e3sconf/202127405003
- Boujnouni, M. E. (2022). A study and identification of COVID-19 viruses using N-grams with Naïve Bayes, K-Nearest Neighbors, Artificial Neural Networks, Decision tree and Support Vector Machine. 2022 International Conference on Intelligent Systems and Computer Vision (ISCV). https://doi.org/10.1109/iscv54655.2022.9806081
- Chaitongrat, T. (2021). Causal relationship model of problems in public sector procurement. International Journal of GEOMATE, 20(80). https://doi.org/10.21660/2021.80.6266
- Chamidah, N., Santoni, M. M., & Matondang, N. (2020). The effect of oversampling on the classification of hypertension with the naive bayes algorithm, decision tree, and artificial neural network. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 4(4), 635–641. https://doi.org/10.29207/resti.v4i4.2015
- Chen, F., Deng, P., Wan, J., et al. (2015). Data mining for the internet of things: literature review and challenges, International Journal of Distributed Sensor Networks, 11(8), 431047. https://doi.org/10.1155/ 2015/431047
- Chen, W. T., Merrett, H. C., Lu, S. T., et al. (2019). Analysis of Key Failure Factors in Construction Partnering—A Case Study of Taiwan. Sustainability, 11(14), 3994. https://doi.org/10.3390/su11143994
- Dlamini, M., & Cumberlege, R. (2021). The impact of cost overruns and delays in the construction business. IOP Conference Series: Earth and Environmental Science, 654(1), 012029. https://doi.org/10.1088/1755-1315/654/1/012029
- Fan, C., Xiao, F., Li, Z., et al. (2018). Unsupervised data analytics in mining big building operational data for energy efficiency enhancement: A review. Energy and Buildings, 159, 296–308. https://doi.org/10.1016/j.enbuild.2017.11.008
- Faten Albtoush, A. M., Doh, S. I., Abdul Rahman, A. R. B., & Albtoush, J. F. A. A. (2020). Factors effecting the cost management in construction projects. International Journal of Civil Engineering and Technology, 11(1).
- Gündüz, M., Nielsen, Y., & Özdemir, M. (2013). Quantification of delay factors using the relative importance index method for construction projects in Turkey. Journal of management in engineering, 29(2), 133–139.
- Gyadu-Asiedu, W., Ampadu-Asiamah, A., & Fokuo-Kusi, A. (2021). A framework for systemic sustainable construction industry development (SSCID). Discover Sustainability, 2(1). https://doi.org/10.1007/s43621-021-00033-y
- Hansen, D. R., Mowen, M. M., & Heitger, D. L. (2021). Cost management. Cengage Learning.
- Holm, L., & Schaufelberger, J. E. (2021). Construction cost estimating. Routledge.
- Hoseini, E., Van Veen, P., Bosch-Rekveldt, M., & Hertogh, M. (2020). Cost performance and cost contingency during project execution: Comparing client and contractor perspectives. Journal of Management in Engineering, 36(4), 05020006.
- Hu, Y.-X., Huai, L.-B., & Cui, R.-Y. (2019). Research on Teaching Evaluation Model Based on Weighted Naive Bayes. 2019 10th International Conference on Information Technology in Medicine and Education (ITME). https://doi.org/10.1109/itme.2019.00112
- Hui, S.C., Jha, G. Data mining for customer service support. Information & Management, 38(1), 1–13. https://doi.org/10.1016/S0378-7206(00)00051-3
- Katarya, R., & Srinivas, P. (2020). Predicting Heart Disease at Early Stages using Machine Learning: A Survey. 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). https://doi.org/10.1109/icesc48915.2020.9155586
- Kim, S., & Lee, H. (2022). Customer Churn Prediction in Influencer Commerce: An Application of Decision Trees. Procedia Computer Science, 199, 1332–1339. https://doi.org/10.1016/j.procs.2022.01.169
- Kusonkhum, W., Srinavin, K., Leungbootnak, N., et al. (2022). Using a Machine Learning Approach to Predict the Thailand Underground Train’s Passenger. Journal of Advanced Transportation, 2022, 1–15. https://doi.org/10.1155/2022/8789067
- Larson, E. W., Gray, C. F. (2013). Project management: the managerial process. McGraw Hill Professional.
- László, K., & Ghous, H. (2020). Efficiency comparison of Python and RapidMiner. Multidiszciplináris Tudományok, 10(3), 212–220.
- Liu, P., Qingqing, W., & Liu, W. (2021). Enterprise human resource management platform based on FPGA and data mining. Microprocessors and Microsystems, 80, 103330. https://doi.org/10.1016/j.micpro.2020.103330
- Marzukhi, S., Awang, N., Alsagoff, S. N., et al. (2021). RapidMiner and Machine Learning Techniques for Classifying Aircraft Data. Journal of Physics: Conference Series, 1997(1), 012012. https://doi.org/10.1088/1742-6596/1997/1/012012
- Mohd, H. N. N., & Shamsul, S. (2011). Critical success factors for software projects: A comparative study. Scientific Research and Essays, 6(10), 2174–2186. https://doi.org/10.5897/sre10.1171
- Monteiro, F. P., Sousa, V., Meireles, I., et al. (2021). Cost Modeling from the Contractor Perspective: Application to Residential and Office Buildings. Buildings, 11(11), 529. https://doi.org/10.3390/buildings11110529
- Mumali, F. (2022). Artificial neural network-based decision support systems in manufacturing processes: A systematic literature review. Computers & Industrial Engineering, 165, 107964. https://doi.org/10.1016/j.cie.2022.107964
- Pertiwi, M. W., Kusmira, M., Rezkiani, R., et al. (2022). Naïve Bayes Classification Model for the Producer Price Index Prediction. SISTEMASI, 11(1), 171. https://doi.org/10.32520/stmsi.v11i1.1669
- Plebankiewicz, E. (2018). Model of Predicting Cost Overrun in Construction Projects. Sustainability, 10(12), 4387. https://doi.org/10.3390/su10124387
- Raj, P. V., Teja, P. S., Siddhartha, K. S., & Rama, J. K. (2021). Housing with low-cost materials and techniques for a sustainable construction in India-A review. Materials Today: Proceedings, 43, 1850–1855.
- Shehadeh, A., Alshboul, O., Al Mamlook, R. E., & Hamedat, O. (2021). Machine learning models for predicting the residual value of heavy construction equipment: An evaluation of modified decision tree, LightGBM, and XGBoost regression. Automation in Construction, 129, 103827.
- Sinsom, B., S. (2019). An efficiency comparison in prediction of imbalanced data classification with data mining techniques. ai Journal of Science and Technology, 8(3), 383–393.
- Soibelman, L., Kim, H. Data preparation process for construction knowledge generation through knowledge discovery in databases. Journal of Computing in Civil Engineering, 16(1), 39–48. https://doi.org/10.1061/(ASCE)0887-3801(2002)16:1(39)
- Srinavin, K., Kusonkhum, W., Chonpitakwong, B., et al. (2021). Readiness of Applying Big Data Technology for Construction Management in Thai Public Sector. Journal of Advances in Information Technology, 12(1), 1–5. https://doi.org/10.12720/jait.12.1.1-5
- Sweis, G., Sweis, R., Abu Hammad, A., et al. (2008). Delays in construction projects: The case of Jordan. International Journal of Project Management, 26(6), 665–674. https://doi.org/10.1016/j.ijproman.2007.09.009
- Trost SM, Oberlender GD. (2003). Predicting accuracy of early cost estimates using factor analysis and multivariate regression. Journal of Construction Engineering and Management, 129(2), 198–204.
- Witten, I. H., Frank, E., Hall, M. A., Pal, C. J. (2016). Data Mining: Practical Machine Learning Tools and Techniques, 4th ed. Morgan Kaufmann.
- Xu, H., Chen, X., Li, P., Ding, J., & Eghan, C. (2019). A Novel RFID Data Management Model Based on Quantum Cryptography. In: Proceedings of the Third International Congress on Information and Communication Technology: ICICT 2018.
- You, Z., & Wu, C. (2019). A framework for data-driven informatization of the construction company. Advanced Engineering Informatics, 39, 269–277. https://doi.org/10.1016/j.aei.2019.02.002
- Yu, Z., Haghighat, F., Fung, B.C.M. (2016). Advances and challenges in building engineering and data mining applications for energy-efficient communities. Sustainable Cities and Society, 25, 33–38.
- Zeydalinejad, N. (2022). Artificial neural networks vis-à-vis MODFLOW in the simulation of groundwater: a review. Modeling Earth Systems and Environment, 8(3), 2911–2932. https://doi.org/10.1007/s40808-022-01365-y
DOI: https://doi.org/10.24294/jipd.v8i5.2801
Refbacks
- There are currently no refbacks.
Copyright (c) 2024 Tanayut Chaitongrat, Kridtsada Janthachai, Wuttipong Kusonkhum, Paranee Boonchai, M. Faisi Ikhwali, Mathinee Khotdee
License URL: https://creativecommons.org/licenses/by/4.0/
This site is licensed under a Creative Commons Attribution 4.0 International License.