Overflow forecasting in early warning systems is acknowledged as an essential task for devastating urban flood risk management. Although many machine learning models have been developed recently to forecast water levels in urban drainage systems (UDS), they usually require big and accurate data resources . Alternatively, ensemble data mining models are becoming more popular, in which time-series numerical data are turned into the categorised features that classify wet weather conditions as two classes of overflow and non-overflow conditions . However, the concept of time-series ensemble modelling i.e., blending different data mining techniques for predictions with different timesteps is still new . Furthermore, the application of more advanced models, particularly multi-blending in these types of ensemble modelling requires more investigation. This study aims to introduce a novel multi-stacking model that integrates different decision tree frameworks by developing various base weak learner data mining techniques and associated base model performance indicators in the process of time-series blending of pre-trained stacked ensemble models. The performance of this new approach is compared by several previously developed ensemble models  through confusion matrix performance criteria, including hit rate, overestimation, and underestimation. This method is demonstrated by its application to a real case study of UDS located in the northwest of London for performance assessment up to 5hr ahead (i.e., 20 timesteps with 15-min intervals). In total, 140 base-models and 20 stacked models were developed that are stored in the data warehouse to use as real-time early-warning flood overflow forecasting for this case study. These developed models were used through introduced decision three framework that specified stacking blending methodology. Results show that while base models and stacked models suffer from high miss rate, especially for forecasting more than 3hrs ahead (more than 50%), the proposed multi-stacking model could perfectly maintain the miss rate (i.e., sum of over- and under-estimations) of up to 4hr-ahead predictions less than 10%, but this rate dropped to nearly 30% for 5hr-ahead predictions. However, the rate of overflow forecasting remained acceptably near 80% whereas it is recorded to less than 58% for other benchmark models. Using different decision frameworks for determining importance of each stacked model in blending mode of multi-stacking method shows could reduce errors in forecasting rate and take advantage of each model in real-time early warning urban flood forecasting.
|Published - 25 Apr 2023
|European Geoscience Union: EGU General Assembly 2023 - Vienna, Austria
Duration: 23 Apr 2023 → 28 Apr 2023
Conference number: 23
|European Geoscience Union
|23/04/23 → 28/04/23