Optimized machine learning models for water quality prediction: Integrating support vector machines and random forest through nonlinear functional analysis


Abstract views: 6 / PDF downloads: 1

Authors

  • G. Baskar PhD Scholar, Lincoln University College, Malaysia & KPR College of Arts Science and Research Coimbatore, Tamil Nadu
  • Midhunchakkaravarthy Professor Lincoln University College, Malaysia
  • Shakir Khan Professor, University Centre for Research and Development, Chandigarh University, Mohali 140413, India and College of Computer and Information Sciences & Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
  • Otabek Narmanov PhD, Associate Professor, Algorithms and Mathematical Modeling, Tashkent University of Information Technologies, Tashkent, Uzbekistan
  • O’tkir Qalandarov PhD, Department of Higher Mathematics, Tashkent University of Information Technologies named after Muhammad Al-Khwarizmi, Uzbekistan
  • Manzura Irisbayeva PhD, Associate Professor, Department of Psychology and Humanities, Economic and Pedagogical University, Karshi
  • Isayev Fakhriddin Termez University of Economics and Service, Uzbekistan & Scientific Research Center Scientific Foundations and Problems of the Development of the Economy of Uzbekistan under Tashkent State University of Economics, Tashkent, Uzbekistan

Keywords:

water quality prediction, nonlinear functional analysis, random forest, support vector machine, environmental monitoring, hybrid machine learning model

Abstract

It is imperative to predict water quality accurately to monitor the environment, human health, and smart water management. Conventional empirical evaluation techniques are inadequate in the description of nonlinear relationships of physicochemical parameters (pH, dissolved oxygen, turbidity, nitrate concentration, and conductivity). This paper hypothesizes a streamlined hybrid machine learning model, which combines Support Vector Machines (SVM) with Random Forest (RF) with nonlinear functional analysis to add predictive accuracy. The model uses nonlinear mapping of kernels, ranking of the importance of variables and functional decomposition to approximate interactions involving complex parameters. It also introduces a multi-stage optimization process that incorporates grid search and cross-validation with nonlinear functional transformation in order to find the best hyperparameters to use in SVM and RF models. Multiyear datasets (collected at freshwater sources) were experimented, and it was found that predictive accuracy improved significantly, with the hybrid model showing that the RMSE was 14.2% lower, and the Pearson correlation coefficient was 9.1 times higher than baseline ML models. The analysis of feature sensitivity and functional interaction demonstrates that nutrient load and dissolved oxygen have a strong nonlinear relationship, which confirms the potential of the proposed framework to represent the ecological relationships. These results indicate that nonlinear functional analysis can allow more consistent and interpretable machine learning models to predict water quality to support the environmental monitoring system in a sustainable way.

References

Abdullah, A., Chaturvedi, H., Fuladi, S., Ravuri, N. J., Deepa, N., & Nallakaruppan, M. K. (2023). Reliable and Efficient Model for Water Quality Prediction and Forecasting. International Journal of Advanced Computer Science and Applications, 14(12). https://doi.org/10.14569/ijacsa.2023.0141219

Abdullah, D. (2024). Integrating biophilic design in smart urban infrastructure for environmental and public health outcomes. Journal of Smart Infrastructure and Environmental Sustainability, 1(1), 14–22.

Akselrud, C. I. A. (2024). Random forest regression models in ecology: Accounting for messy biological data and producing predictions with uncertainty. Fisheries Research, 280, 107161. https://doi.org/10.1016/j.fishres.2024.107161

Almadani, M., & Kheimi, M. (2022). Stacking Artificial Intelligence Models for Predicting Water Quality Parameters in Rivers. Journal of Ecological Engineering, 24(2), 152. https://doi.org/10.12911/22998993/156663

Binetti, M. S., Massarelli, C., & Uricchio, V. F. (2024). Machine Learning in Geosciences: A Review of Complex Environmental Monitoring Applications [Review of Machine Learning in Geosciences: A Review of Complex Environmental Monitoring Applications]. Machine Learning and Knowledge Extraction, 6(2), 1263. Multidisciplinary Digital Publishing Institute. https://doi.org/10.3390/make6020059

Chen, W., Xu, D., Pan, B., Zhao, Y., & Song, Y. (2024). Machine Learning-Based Water Quality Classification Assessment. Water, 16(20), 2951. https://doi.org/10.3390/w16202951

Choudhary, A., Kumar, P., Pradhan, C., Sahu, S. K., Chaudhary, S. K., Joshi, P. K., Pandey, D. N., Prakash, D.,

& Mohanty, A. (2023). Evaluating air quality and criteria pollutants prediction disparities by data mining along a stretch of urban-rural agglomeration includes coal-mine belts and thermal power plants. Frontiers in Environmental Science, 11. https://doi.org/10.3389/fenvs.2023.1132159

Das, A. (2024). An innovative approach for quality assessment and its contamination on surface water for drinking purpose in Mahanadi River Basin, Odisha of India, with the integration of BA-WQI, AHP-TOPSIS, FL-DWQI, MOORA, and RF methodology. Applied Water Science, 14(12). https://doi.org/10.1007/s13201-024-02326-9

Helaly, M. A., Rady, S., Mabrouk, M., Aref, M., Villarroya, S., Cotos, J. M., & Mera, D. (2025). Advancements in water quality prediction: a practical review of machine learning and deep learning approaches. Cluster Computing, 28(9). https://doi.org/10.1007/s10586-025-05221-3

Islam, M. K. B., Newton, M. A. H., Trevathan, J., & Sattar, A. (2024). Lite approaches for long-range multi-step water quality prediction. Stochastic Environmental Research and Risk Assessment, 38(10), 3755. https://doi.org/10.1007/s00477-024-02770-8

Kumar, K. V., Kumar, Y., Godla, S. R., Ansari, M. S. A., El–Ebiary, Y. A. B., & Elangovan, M. (2024). Enhancing Water Quality Forecasting Reliability Through Optimal Parameterization of Neuro-Fuzzy Models via Tunicate Swarm Optimization. International Journal of Advanced Computer Science and Applications, 15(3). https://doi.org/10.14569/ijacsa.2024.01503110

Milà, C., Ludwig, M., Pebesma, E., Tonne, C., & Meyer, H. (2024). Random forests with spatial proxies for environmental modelling: opportunities and pitfalls. Geoscientific Model Development, 17(15), 6007. https://doi.org/10.5194/gmd-17-6007-2024

Poornimadarshini, S. (2025). Mathematical modeling of rotor dynamics in high-speed electric motors for aerospace applications. Journal of Applied Mathematical Models in Engineering, 1(1), 33–43.

Rahman, F., & Prabhakar, C. P. (2023). IoT-integrated precision irrigation systems for enhancing water use efficiency in semi-arid agro-ecosystems. National Journal of Smart Agriculture and Rural Innovation, 1(1).

Si, B., Liu, X., Guo, K., & Rastegarnia, A. (2025). Investigating factors affecting the quality of water resources by multivariate analysis and soft computing approaches. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-13380-x

Uken, E., & Getachew, B. (2023). IoT-enabled smart aquaculture monitoring system for energy-efficient water quality management. National Journal of Smart Fisheries and Aquaculture Innovation, 1(1).

Usikalua, M. R., & Unciano, N. (2025). Mathematical modeling of epidemic dynamics: Integrating public health and data science. Bridge: Journal of Multidisciplinary Explorations, 1(1), 11–22.

Utama, W., Indriani, R. F., Hermana, M., Anjasmara, I. M., Garini, S. A., & Putra, D. P. N. (2024). Towards Improving Sustainable Water Management in Geothermal Fields: SVM and RF Land Use Monitoring. Journal of Human Earth and Future, 5(2), 216. https://doi.org/10.28991/hef-2024-05-02-06

Downloads

Published

2026-01-14

How to Cite

G. Baskar, Midhunchakkaravarthy, Shakir Khan, Otabek Narmanov, O’tkir Qalandarov, Manzura Irisbayeva, & Isayev Fakhriddin. (2026). Optimized machine learning models for water quality prediction: Integrating support vector machines and random forest through nonlinear functional analysis. Results in Nonlinear Analysis, 8(4), 24–34. Retrieved from https://www.nonlinear-analysis.com/index.php/pub/article/view/809

Most read articles by the same author(s)