Geographically weighted machine learning for predicting the spatial distribution of groundwater nitrate nitrogen (NO3-N) concentration

  • Younghun Lee
  • , Changhyun Kim
  • , Hyemin Jeong
  • , Dongho Kim
  • , Byeongwon Lee
  • , Taeseung Park
  • , Seongyun Kim
  • , Dongjin Jeon
  • , Jongho Ahn
  • , Jai Young Lee
  • , Yoonkyung Cha
  • , Sangchul Lee

Research output: Contribution to journalArticlepeer-review

Abstract

Study region: This study was conducted on Jeju Island, South Korea, where groundwater serves as a critical water resource and is highly susceptible to contamination from both natural and anthropogenic influences. Study focus: This study aims to predict the spatial distribution of groundwater nitrate nitrogen (NO3-N) concentrations using geographically weighted random forest (GWRF). The predictive performance and robustness of the GWRF were compared against five conventional machine learning models (CMLMs). Shapley Additive Explanations (SHAP) analysis was employed to quantify the influence of key input variables, categorized into land surface, geological, and anthropogenic factors, on the model predictions. New hydrological insights for the region: The result showed the superior performance of the GWRF in predicting the spatial distribution of groundwater NO3-N on Jeju Island. Compared to CMLMs, the GWRF provided more accurate and spatially unbiased predictions due to consideration of spatial variability. The SHAP analysis indicated that key factors influencing groundwater NO3-N were average elevation, proportion of heavy clay fields and agricultural areas, and urban areas. These results demonstrated the potential of combining geographically weighted structures with machine learning models in groundwater modeling using geospatial data. Consequently, the findings from this study would help to develop targeted management strategies to mitigate groundwater NO3-N pollution.

Original languageEnglish
JournalJournal of Hydrology: Regional Studies
Volume62
DOIs
StatePublished - Dec 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities
  2. SDG 15 - Life on Land
    SDG 15 Life on Land

Keywords

  • Conventional machine learning model (CMLM)
  • Geospatially weighted machine learning model (GWMLM)
  • Groundwater quality
  • Spatial variability

Fingerprint

Dive into the research topics of 'Geographically weighted machine learning for predicting the spatial distribution of groundwater nitrate nitrogen (NO3-N) concentration'. Together they form a unique fingerprint.

Cite this