학술논문

Global streamflow modelling using process-informed machine learning
Document Type
article
Source
Journal of Hydroinformatics, Vol 25, Iss 5, Pp 1648-1666 (2023)
Subject
global hydrology
hybrid streamflow modelling
machine learning
post-processing
random forests
Information technology
T58.5-58.64
Environmental technology. Sanitary engineering
TD1-1066
Language
English
ISSN
1464-7141
1465-1734
Abstract
We present a novel hybrid framework that incorporates information from the process-based global hydrological model PCR-GLOBWB, to reduce prediction errors in streamflow simulations. In addition to catchment attributes and meteorological data, our methodology employs simulated streamflow and state variables from PCR-GLOBWB as predictors of observed river discharge. These outputs are used in a random forest, trained on a global database of streamflow measurements, to improve estimates of simulated river discharge across the globe. PCR-GLOBWB was run for the years 1979–2019 at 30 arcmin and its inputs and outputs were upscaled from daily to monthly time steps. A single random forest model was trained with these state variables, meteorological data and catchment attributes, as predictors of observed streamflow at 2,286 stations worldwide. Model performance was evaluated using Kling–Gupta efficiency (KGE). Results based on cross-validation show that the model is capable of discerning between a variety of hydroclimatic conditions and river flow dynamics, improving KGE of PCR-GLOBWB simulations at more than 80% of testing locations and increasing median KGE from −0.03 in uncalibrated runs to 0.51 after post-processing. Performance boosts are usually independent of the availability of streamflow data, making our method a potential candidate in addressing prediction in poorly gauged and ungauged basins. HIGHLIGHTS A hybrid framework for global streamflow modelling is developed, connecting PCR-GLOBWB with random forest.; The framework enables the correction of global-scale streamflow predictions with parsimonious parametrization.; Random forests improve streamflow predictions better when additionally fed with outputs from the hydrological model, as opposed to only using meteorological forcing and catchment attributes.;