Abstract
Consumers evaluate water quality mostly on the basis of its aesthetically acceptable qualities, which include flavor, odor, and appearance. The aesthetic acceptability of water is greatly impacted by the seven heavy metal characteristics: calcium, sodium, potassium, sulfate, magnesium, total hardness, and chloride. These parameters, in turn, have a profound effect on variables such as flavor and odor. The present paper attempts to enhance the level of prediction regarding water quality with the aid of machine learning algorithms. Accordingly, Extra Tree Classification (ETC) was applied to the projection of water quality parameters. In addition to this base model, the Subtraction-Average-Based Optimization and the Horned Lizard Optimization Algorithm were utilized to optimize its level of prediction accuracy. This strategic integration resulted in new hybrid model types. The ETC model, in combination with SABO, is the ETSA framework, which, with HLOA, integrates into the ETHL framework. During Testing for the Accuracy metric, the highest score was 0.986 from the ETHL model, while the ETSA model attained 0.972 and yielded the second best. In the Testing phase, concerning the Precision metric, the best performing model was, once again, the ETHL model with 0.986, while the ETC model developed the weakest performance with a value of 0.970. Future research could enhance water quality prediction through advanced hybrid models, real-time monitoring, deep learning, and validation across diverse water bodies and parameters.
Funding source: This work was supported by Hebei Province Key R&D Program Self Funded Project.
Acknowledgments
I would like to take this opportunity to acknowledge that there are no individuals or organizations that require acknowledgment for their contributions to this work.
-
Research ethics: Research involving Human Participants and Animals: The observational study conducted on medical staff needs no ethical code. Therefore, the above study was not required to acquire ethical code.
-
Informed consent: This option is not neccessary due to that the data were collected from the references.
-
Use of Large Language Models, AI and Machine Learning Tools: During the preparation of this work, the authors used Large Language Models, AI, and Machine Learning tools for tasks such as language refinement, data analysis, or figure generation, with all outputs being reviewed and validated by the authors to ensure accuracy and originality. After using these tools/services, the authors reviewed and edited the content and take full responsibility for the content of the published article.
-
Author contributions: The author contributed to the study’s conception and design. Data collection, simulation and analysis were performed by “Fengmin Cheng”. Also the first draft of the manuscript was written by Fengmin Cheng commented on previous versions of the manuscript.
-
Conflict of interest: The author declare no competing of interests.
-
Research funding: This work was supported by Hebei Province Key R&D Program Self Funded Project.
-
Data availability: The author do not have permissions to share data.
References
1. Fernández, N, Ramírez, A, Solano, F. Physico-chemical water quality indices-a comparative review. BISTUA: Revista de la Facultad de Ciencias Básicas 2004;2:19–30.Search in Google Scholar
2. Abbasi, T, Abbasi, SA. Water quality indices based on bioassessment: the biotic indices. J Water Health 2011;9:330–48. https://doi.org/10.2166/wh.2011.133.Search in Google Scholar PubMed
3. Ezeilo, FE, Oba, KM. gyauarioN OF WATER euary INDEX OF ORASHI gvg. Int J Environ 2016;12:60–75.Search in Google Scholar
4. Uddin, MG, Nash, S, Olbert, AI. A review of water quality index models and their use for assessing surface water quality. Ecol Indic 2021;122:107218. https://doi.org/10.1016/j.ecolind.2020.107218.Search in Google Scholar
5. Paepae, T, Bokoro, PN, Kyamakya, K. From fully physical to virtual sensing for water quality assessment: a comprehensive review of the relevant state-of-the-art. Sensors 2021;21:6971. https://doi.org/10.3390/s21216971.Search in Google Scholar PubMed PubMed Central
6. Thakur, A, Devi, P. A comprehensive review on water quality monitoring devices: materials advances, current status, and future perspective. Crit Rev Anal Chem 2024;54:193–218. https://doi.org/10.1080/10408347.2022.2070838.Search in Google Scholar PubMed
7. Tung, TM, Yaseen, ZM. A survey on river water quality modelling using artificial intelligence models: 2000–2020. J Hydrol (Amst) 2020;585:124670. https://doi.org/10.1016/j.jhydrol.2020.124670.Search in Google Scholar
8. Kadlec, P, Gabrys, B, Strandt, S. Data-driven soft sensors in the process industry. Comput Chem Eng 2009;33:795–814. https://doi.org/10.1016/j.compchemeng.2008.12.012.Search in Google Scholar
9. Danandeh Mehr, A, Marttila, H, Haghighi, AT, Croghan, D, Fathollahzadeh Attar, N. GTAR: a new ensemble evolutionary autoregressive approach to model dissolved organic carbon. AQUA – Water Infrastruct Ecosyst Soc 2023;72:381–94.10.2166/aqua.2023.235Search in Google Scholar
10. Huang, J, Liu, S, Hassan, SG, Xu, L, Huang, C. A hybrid model for short-term dissolved oxygen content prediction. Comput Electron Agric 2021;186:106216. https://doi.org/10.1016/j.compag.2021.106216.Search in Google Scholar
11. Jouanneau, S, Recoules, L, Durand, M, Boukabache, A, Picot, V, Primault, Y, et al.. Methods for assessing biochemical oxygen demand (BOD): a review. Water Res 2014;49:62–82. https://doi.org/10.1016/j.watres.2013.10.066.Search in Google Scholar PubMed
12. Zhi, W, Feng, D, Tsai, WP, Sterle, G, Harpold, A, Shen, C, et al.. From hydrometeorology to river water quality: can a deep learning model predict dissolved oxygen at the continental scale? Environ Sci Technol 2021;55:2357–68. https://doi.org/10.1021/acs.est.0c06783.Search in Google Scholar PubMed
13. Özdemi̇r, S, Celebi, A, Dede, G, Maghrebi, M, Danandeh Mehr, A. Impact of land use change on lake pollution dynamics: a case study of Sapanca lake, Turkey. Water (Basel) 2025;17:1–12.10.3390/w17020182Search in Google Scholar
14. Jones, AS, Stevens, DK, Horsburgh, JS, Mesner, NO. Surrogate measures for providing high frequency estimates of total suspended solids and total phosphorus concentrations 1. JAWRA J Am Water Resour Assoc 2011;47:239–53. https://doi.org/10.1111/j.1752-1688.2010.00505.x.Search in Google Scholar
15. He, M, Wu, S, Huang, B, Kang, C, Gui, F. Prediction of total nitrogen and phosphorus in surface water by deep learning methods based on multi-scale feature extraction. Water (Basel) 2022;14:1643. https://doi.org/10.3390/w14101643.Search in Google Scholar
16. Zhu, M, Wang, J, Yang, X, Zhang, Y, Zhang, L, Ren, H, et al.. A review of the application of machine learning in water quality evaluation. Eco-Environ Health 2022;1:107–16. https://doi.org/10.1016/j.eehl.2022.06.001.Search in Google Scholar PubMed PubMed Central
17. Bilali, AEL, Taleb, A, Mazigh, N, Mokhliss, M. Prediction of chemical water quality used for drinking purposes based on artificial neural networks. Moroc J Chem 2020;8.Search in Google Scholar
18. Haghiabi, AH, Nasrolahi, AH, Parsaie, A. Water quality prediction using machine learning methods. Water Qual Res J 2018;53:3–13. https://doi.org/10.2166/wqrj.2018.025.Search in Google Scholar
19. Alizamir, M, Heddam, S, Kim, S, Mehr, AD. On the implementation of a novel data-intelligence model based on extreme learning machine optimized by bat algorithm for estimating daily chlorophyll-a concentration: case studies of river and lake in USA. J Clean Prod 2021;285:124868. https://doi.org/10.1016/j.jclepro.2020.124868.Search in Google Scholar
20. Prasad, DVV, Venkataramana, LY, Kumar, PS, Prasannamedha, G, Harshana, S, Srividya, SJ, et al.. Analysis and prediction of water quality using deep learning and auto deep learning techniques. Sci Total Environ 2022;821:153311. https://doi.org/10.1016/j.scitotenv.2022.153311.Search in Google Scholar PubMed
21. Antanasijević, D, Pocajt, V, Perić-Grujić, A, Ristić, M. Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo Simulation uncertainty analysis. J Hydrol (Amst) 2014;519:1895–907.10.1016/j.jhydrol.2014.10.009Search in Google Scholar
22. Antanasijević, D, Pocajt, V, Povrenović, D, Perić-Grujić, A, Ristić, M. Modelling of dissolved oxygen content using artificial neural networks: Danube River, North Serbia, case study. Environ Sci Pollut Control Ser 2013;20:9006–13. https://doi.org/10.1007/s11356-013-1876-6.Search in Google Scholar PubMed
23. Tomić, AŠ, Antanasijević, D, Ristić, M, Perić-Grujić, A, Pocajt, V. A linear and non-linear polynomial neural network modeling of dissolved oxygen content in surface water: inter-and extrapolation performance with inputs’ significance analysis. Sci Total Environ 2018;610:1038–46.10.1016/j.scitotenv.2017.08.192Search in Google Scholar PubMed
24. Zounemat-Kermani, M, Seo, Y, Kim, S, Ghorbani, MA, Samadianfard, S, Naghshara, S, et al.. Can decomposition approaches always enhance soft computing models? Predicting the dissolved oxygen concentration in the St. Johns River, Florida. Appl Sci 2019;9:2534. https://doi.org/10.3390/app9122534.Search in Google Scholar
25. Ma, J, Ding, Y, Cheng, JCP, Jiang, F, Xu, Z. Soft detection of 5-day BOD with sparse matrix in city harbor water using deep learning techniques. Water Res 2020;170:115350. https://doi.org/10.1016/j.watres.2019.115350.Search in Google Scholar PubMed
26. Liu, M, Lu, J. Support vector machine – an alternative to artificial neuron network for water quality forecasting in an agricultural nonpoint source polluted river? Environ Sci Pollut Control Ser 2014;21:11036–53. https://doi.org/10.1007/s11356-014-3046-x.Search in Google Scholar PubMed
27. Lu, H, Ma, X. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 2020;249:126169. https://doi.org/10.1016/j.chemosphere.2020.126169.Search in Google Scholar PubMed
28. Geurts, P, Ernst, D, Wehenkel, L. Extremely randomized trees. Mach Learn 2006;63:3–42. https://doi.org/10.1007/s10994-006-6226-1.Search in Google Scholar
29. Sharaff, A, Gupta, H. Extra-tree classifier with metaheuristics approach for email classification. In: Advances in computer communication and computational sciences: proceedings of IC4S 2018. Springer; 2019:189–97 pp.10.1007/978-981-13-6861-5_17Search in Google Scholar
30. Peng, L, Yuan, R, Shen, L, Gao, P, Zhou, L. LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification. BioData Min 2021;14:1–22. https://doi.org/10.1186/s13040-021-00277-4.Search in Google Scholar PubMed PubMed Central
31. Baby, D, Devaraj, SJ, Hemanth, J. Leukocyte classification based on feature selection using extra trees classifier: atransfer learning approach. Turk J Electr Eng Comput Sci 2021;29:2742–57. https://doi.org/10.3906/elk-2104-183.Search in Google Scholar
32. Patel, AM, Suthar, A. Adaboosted extra trees classifier for object-based multispectral image classification of urban fringe area. Int J Image Graph 2022;22:2140006. https://doi.org/10.1142/s0219467821400064.Search in Google Scholar
33. Trojovský, P, Dehghani, M. Subtraction-average-based optimizer: a new swarm-inspired metaheuristic algorithm for solving optimization problems. Biomimetics 2023;8:149. https://doi.org/10.3390/biomimetics8020149.Search in Google Scholar PubMed PubMed Central
34. Cui, X, Zhu, J, Jia, L, Wang, J. A novel multi-time scale heat load prediction model for district heating system: hybrid subtraction average based optimizer (Sabo) and Cnn-Bilstm model with attention mechanism. Amsterdam, Netherlands: Elsevier BV; 2024.10.2139/ssrn.4846731Search in Google Scholar
35. Smaili, IH, Almalawi, DR, Shaheen, AM, Mansour, HSE. Optimizing PV sources and shunt capacitors for energy efficiency improvement in distribution systems using subtraction-average algorithm. Mathematics 2024;12:625. https://doi.org/10.3390/math12050625.Search in Google Scholar
36. Wang, Y, He, X, Liu, Q, Razmjooy, S. Economic and technical analysis of an off-grid wind/PV/fuel cells hybrid renewable energy system using an improved subtraction-average-based optimizer. Heliyon 2024;10:1–22. https://doi.org/10.1016/j.heliyon.2024.e32712.Search in Google Scholar PubMed PubMed Central
37. Mai, C, Zhang, L, Hu, X. An adaptive snake optimization algorithm incorporating Subtraction-Average-Based Optimizer for photovoltaic cell parameter identification. Heliyon 2024;10:1–30. https://doi.org/10.1016/j.heliyon.2024.e35382.Search in Google Scholar PubMed PubMed Central
38. Liu, C, Xie, Y, Liu, S, Qin, J, Wei, J, Fang, H, et al.. An enhanced Horned Lizard Optimization Algorithm for flood control operation of cascade reservoirs. Water 2024;16. https://doi.org/10.3390/w16172458.Search in Google Scholar
39. Peraza-Vázquez, H, Peña-Delgado, A, Merino-Treviño, M, Morales-Cepeda, AB, Sinha, N. A novel metaheuristic inspired by horned lizard defense tactics. Artif Intell Rev 2024;57:59. https://doi.org/10.1007/s10462-023-10653-7.Search in Google Scholar
40. Leaché, AD, McGuire, JA. Phylogenetic relationships of horned lizards (Phrynosoma) based on nuclear and mitochondrial data: evidence for a misleading mitochondrial gene tree. Mol Phylogenet Evol 2006;39:628–44. https://doi.org/10.1016/j.ympev.2005.12.016.Search in Google Scholar PubMed
41. Stevens, M, Merilaita, S. Animal camouflage: mechanisms and function. Cambridge, England: Cambridge University Press; 2011.10.1017/CBO9780511852053Search in Google Scholar
42. Ruxton, GD, Sherratt, TN, Speed, MP. The evolutionary ecology of crypsis, warning signals and mimicry. Oxford, England: Oxford University Press; 2004.Search in Google Scholar
43. Cooper, WEJr, Sherbrooke, WC. Plesiomorphic escape decisions in cryptic horned lizards (Phrynosoma) having highly derived antipredatory defenses. Ethology 2010;116:920–8. https://doi.org/10.1111/j.1439-0310.2010.01805.x.Search in Google Scholar
44. Sherbrooke, WC, Middendorf, GAIII. Blood-squirting variability in horned lizards (Phrynosoma). Copeia 2001;2001:1114–22. https://doi.org/10.1643/0045-8511(2001)001[1114:bsvihl]2.0.co;2.10.1643/0045-8511(2001)001[1114:BSVIHL]2.0.CO;2Search in Google Scholar
45. Lara-Reséndiz, RA, Arenas-Moreno, DM, Beltrán-Sánchez, E, Gramajo, W, Verdugo-Molina, J, Sherbrooke, WC, et al.. Selected body temperature of nine species of Mexican horned lizards (Phrynosoma). Rev Mex Biodivers 2015;86:275–8. https://doi.org/10.7550/rmb.48028.Search in Google Scholar
46. Grigg, JW, Buckley, LB. Conservatism of lizard thermal tolerances and body temperatures across evolutionary history and geography. Biol Lett 2013;9:20121056. https://doi.org/10.1098/rsbl.2012.1056.Search in Google Scholar PubMed PubMed Central
47. Westland, S, Ripamonti, C, Cheung, V. Computational colour science using MATLAB. Hoboken, New Jersey, USA: John Wiley & Sons; 2012.10.1002/9780470710890Search in Google Scholar
48. Niall, KK. Erwin schrödinger’s color theory. Cham, Switzerland: Springer; 2017.10.1007/978-3-319-64621-3Search in Google Scholar
49. Abbas, F, Cai, Z, Shoaib, M, Iqbal, J, Ismail, M, Arifullah, et al.. Machine learning models for water quality prediction: a comprehensive analysis and uncertainty assessment in Mirpurkhas, Sindh, Pakistan. Water (Basel) 2024;16:941. https://doi.org/10.3390/w16070941.Search in Google Scholar
50. Chen, W, Xu, D, Pan, B, Zhao, Y, Song, Y. Machine learning-based water quality classification assessment. Water (Basel) 2024;16:2951. https://doi.org/10.3390/w16202951.Search in Google Scholar
51. Shams, MY, Elshewey, AM, El-Kenawy, E-SM, Ibrahim, A, Talaat, FM, Tarek, Z. Water quality prediction using machine learning models based on grid search method. Multimed Tool Appl 2024;83:35307–34. https://doi.org/10.1007/s11042-023-16737-4.Search in Google Scholar
© 2025 Walter de Gruyter GmbH, Berlin/Boston