GEOSPATIAL MODELING OF GROUNDWATER POTENTIAL USING XGBoost, SHAP AND REMOTE SENSING DATA: A CASE STUDY OF THE BOKEYORDA DISTRICT OF WEST KAZAKHSTAN

Authors

  • Onlassynov Zhuldyzbek Alikhanuly Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University
  • Shagarova Lyudmila Valentinovna Omsk Scientific Center, Siberian Branch of the Russian Academy of Sciences
  • Absametov Malis Kudyssovich Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University
  • Muratova Mira Muratovna Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University

DOI:

https://doi.org/10.52269/SKVC2621177

Keywords:

groundwater potential mapping, machine learning, groundwater, remote sensing, agriculture

Abstract

In this study, a machine learning model for spatially predicting groundwater potential was developed using the XGBoost classifier, trained on 15 consistent geospatial predictors derived from topographic, hydrological, vegetation, and hydrogeological datasets. A total of 202 wells with measured groundwater discharge were used as training samples and converted into categorical groundwater productivity classes. The model integrates raster data from multiple sources, including elevation derived from SRTM, terrain indices (slope, TWI), hydrological distance measures, seasonal vegetation and moisture indices (NDVI, NDWI, BSI), and lithological and hydraulic properties derived from GLHYMPS. The interpretability of the model was assessed using SHAP analysis to quantify the contribution of each predictor to groundwater potential estimates. Furthermore, prediction uncertainty was quantified using the Shannon entropy of class probabilities, which enabled the identification of low-confidence zones associated with hydrogeological transitions. The results show that the model captures significant spatial patterns in groundwater potential, despite moderate prediction accuracy during spatial validation (F1 score ≈ 0.24). High uncertainty values ​​are concentrated in structurally complex and lithologically heterogeneous zones. The proposed framework demonstrates a reproducible and transferable approach to integrating machine learning and geospatial analysis for mapping groundwater potential in data-deficient regions.

Author Biographies

  • Onlassynov Zhuldyzbek Alikhanuly, Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University

    PhD, Head of the Laboratory of GIS Technologies and Remote Sensing

  • Shagarova Lyudmila Valentinovna , Omsk Scientific Center, Siberian Branch of the Russian Academy of Sciences

    Candidate of Technical Sciences, Member of RosHydroGeo, Junior Researcher, Institute of Radiophysics and Physical Electronics

  • Absametov Malis Kudyssovich, Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University

    Doctor of Geological and Mineralogical Sciences, Professor, Director

  • Muratova Mira Muratovna , Akhmedsafin Institute of Hydrogeology and Environmental Geoscience, Satbayev University

    – leading engineer of the laboratory of GIS technologies and remote sensing, Institute of Hydrogeology and Geoecology named after U.M. Akhmetsafina, Satbayev University

Additional Files

Published

2026-07-03