Publication Details

Category Text Publication
Reference Category Journals
DOI 10.3389/fenvs.2025.1599320
Licence creative commons licence
Title (Primary) Comprehensive data aleatory uncertainty propagation in regression random forest using a Monte Carlo approach: a struggle with incomplete data provision using a case study on probabilistic soil moisture regionalization
Author Paasche, H.; Dega, S.; Schrön, M.; Dietrich, P. ORCID logo
Source Titel Frontiers in Environmental Science
Year 2025
Department MET
Volume 13
Page From art. 1599320
Language englisch
Topic T5 Future Landscapes
T8 Georesources
Keywords uncertainty propagation; Probabilistic prediction; monte carlo; regression random forest; soil moisture; Aleatory uncertainty; uncertainty quantification; Cosmic-ray neutron sensing
Abstract Data uncertainty never decreases along processing chains and should always be reported alongside processing results. In this study, we attempt to propagate aleatory data uncertainty through a multiple regression analysis to generate regionalized probabilistic soil moisture maps. We employ a nonparametric solution for multiple regression by means of random forests within a Monte Carlo framework. Our input data comprise sparse soil moisture data and spatially dense auxiliary soil and topographic maps, which serve as response and predictor variables in our regression model, respectively. While the methodology is technically straightforward, challenges arise due to incomplete communication of data uncertainty by data providers. This results in knowledge gaps that must be filled by subjective assumptions rather than data-driven insights. We highlight the issues that hinder straightforward uncertainty propagation, ultimately making our final uncertainty quantification of regionalized soil maps an optimistic estimate. Additionally, we sketch how existing uncertainty classification schemes could help data providers deliver quantified uncertainties with their data, enabling users to more accurately assess and report uncertainties in their derived data products.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=31095
Paasche, H., Dega, S., Schrön, M., Dietrich, P. (2025):
Comprehensive data aleatory uncertainty propagation in regression random forest using a Monte Carlo approach: a struggle with incomplete data provision using a case study on probabilistic soil moisture regionalization
Front. Environ. Sci. 13 , art. 1599320 10.3389/fenvs.2025.1599320