This study aimed to evaluate the estimation accuracy of rainfall erosivity (R-factor) in Italy and Switzerland through five Machine learning (ML) models (Decision Tree (DT), K-Nearest Neighbors (KNN), Random Forest (RF), Gradient Boosting (GB), and eXtreme Gradient Boost (XGB)) tuned with optimal hyperparameters. To build the ML model, high temporal resolution (HTR) rainfall data were collected from 297 rain-gauge stations located in the study area. To estimate the RUSLE-based R-factor through the models, the rainfall amount for each rainfall event, the rainfall duration, and the maximum 60-min intensity were used as input data. The datasets for training/validation and testing consisted of rainfall data from 287 and 10 stations, respectively. In a second phase, each ML model was trained through 10-fold cross validation based on training and validation data. For hyperparameter adjustment, the models were optimized using the Bayesian optimization algorithm (BOA). The R-factor estimation performance of each ML model through cross validation improved from 6.1% to 62.8% as hyperparameters were optimized through BOA. In particular, ensemble models such as RF, GB, and XGB were superior to other models with an accuracy performance of 0.9 or even more. And the RF showed an excellent estimation performance (R 2 = 0.965, NSE = 0.958, RMSE = 44.993 MJ mm ha−1h−1, and MAE = 13.901 MJ mm ha−1h−1) for test stations, followed by GB and XGB with similar performance. However, the R-factor for the extremely intense rainfall event estimated by the ML models showed a significant difference from the RUSLE-based R-factor. This result implies that although the ML model built in this study can reasonably estimate the R-factor in the general rainfall event, additional training and validation through securing various rainfall event data is required to improve estimation accuracy on an extreme rainfall event.
Lee, S., Bae, J.H., Hong, J., Yang, D., Panagos, P., Borrelli, P., et al. (2022). Estimation of rainfall erosivity factor in Italy and Switzerland using Bayesian optimization based machine learning models. CATENA, 211, 105957 [10.1016/j.catena.2021.105957].
Estimation of rainfall erosivity factor in Italy and Switzerland using Bayesian optimization based machine learning models
Borrelli P.Membro del Collaboration Group
;
2022-01-01
Abstract
This study aimed to evaluate the estimation accuracy of rainfall erosivity (R-factor) in Italy and Switzerland through five Machine learning (ML) models (Decision Tree (DT), K-Nearest Neighbors (KNN), Random Forest (RF), Gradient Boosting (GB), and eXtreme Gradient Boost (XGB)) tuned with optimal hyperparameters. To build the ML model, high temporal resolution (HTR) rainfall data were collected from 297 rain-gauge stations located in the study area. To estimate the RUSLE-based R-factor through the models, the rainfall amount for each rainfall event, the rainfall duration, and the maximum 60-min intensity were used as input data. The datasets for training/validation and testing consisted of rainfall data from 287 and 10 stations, respectively. In a second phase, each ML model was trained through 10-fold cross validation based on training and validation data. For hyperparameter adjustment, the models were optimized using the Bayesian optimization algorithm (BOA). The R-factor estimation performance of each ML model through cross validation improved from 6.1% to 62.8% as hyperparameters were optimized through BOA. In particular, ensemble models such as RF, GB, and XGB were superior to other models with an accuracy performance of 0.9 or even more. And the RF showed an excellent estimation performance (R 2 = 0.965, NSE = 0.958, RMSE = 44.993 MJ mm ha−1h−1, and MAE = 13.901 MJ mm ha−1h−1) for test stations, followed by GB and XGB with similar performance. However, the R-factor for the extremely intense rainfall event estimated by the ML models showed a significant difference from the RUSLE-based R-factor. This result implies that although the ML model built in this study can reasonably estimate the R-factor in the general rainfall event, additional training and validation through securing various rainfall event data is required to improve estimation accuracy on an extreme rainfall event.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.