Details zur Publikation

Kategorie Textpublikation
Referenztyp Zeitschriften
DOI 10.1175/AIES-D-24-0114.1
Lizenz creative commons licence
Titel (primär) Evaluating the robustness of PCMCI+ for causal discovery of flood drivers
Autor Miersch, P. ORCID logo ; Günther, W.; Runge, J.; Zscheischler, J. ORCID logo
Quelle Artificial Intelligence for the Earth Systems (AIES)
Erscheinungsjahr 2025
Department CER
Band/Volume 4
Heft 4
Seite von e240114
Sprache englisch
Topic T5 Future Landscapes
Daten-/Softwarelinks https://doi.org/10.5281/zenodo.14765911
Keywords Extreme events; Statistical techniques; Time series; Flood events; Machine learning
Abstract Estimating causal drivers of high-impact extreme events such as floods from data is an aspiring pursuit. Time series causal discovery methods, such as the conditional-independence-based PC Momentary Conditional Independence (PCMCI) framework, are designed to identify causal relationships from complex multivariate observational time series. However, the application to extreme event data remains a challenge due to, by the nature of extremes, data length limitations, conditional independence testing for nonlinear relationships, and potential violations of the methods’ assumptions. So far, these challenges have mostly been explored on synthetic data with limited transferability to real-world applications. In this study, we evaluate causal discovery on real and pseudoreal data generated with a hydrological model across 45 catchments with varying flood-generating processes. Because no detailed causal ground truth exists, we focus on the robustness of output graphs. To this end, we simulate a large sample, identify discharge peaks, and investigate the robustness of the causal discovery algorithm PCMCI+ when applied to different realizations of the same setting for various sample sizes. We find that the robustness generally increases with sample size, yet a significant proportion of inferred causal edges remain inconsistent even for large datasets. Notably, while some flood drivers are reliably identified, other key hydrological mechanisms are systematically missed even for very large sample sizes, highlighting methodological limitations. Our study provides a blueprint for investigating the real-world performance of causal discovery methods and illustrates their current limitations for identifying causal drivers of floods.

Significance Statement

Revealing the causes of extremes in the Earth system, like floods, is important for climate risk assessments. Causal discovery is a modern machine learning approach aiming to find these causal drivers in ever more abundant observational data. However, the reliability of causal discovery algorithms depends on many sources of uncertainty. Here, we evaluate the robustness of causal discovery on real and pseudoreal data generated with a state-of-the-art hydrological model and find that current observational sample sizes may not be enough to reliably estimate causal drivers in such challenging settings.

dauerhafte UFZ-Verlinkung https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=31540
Miersch, P., Günther, W., Runge, J., Zscheischler, J. (2025):
Evaluating the robustness of PCMCI+ for causal discovery of flood drivers
Artificial Intelligence for the Earth Systems (AIES) 4 (4), e240114 10.1175/AIES-D-24-0114.1