This study investigates the performance of four boosting machine learning models, AdaBoost, XGBoost, CatBoost, and LightGBM, for forecasting maximal (Tmax) and minimal (Tmin) air temperatures at six lead times: the same day and 1, 7, 15, 21, and 30 days ahead. Daily temperature data from the USGS 02187010 weather station (South Carolina, USA) were used for model training and validation. To address the challenges posed by the non-linearity and complexity of climate data, the models were integrated with explainable artificial intelligence (XAI) techniques, specifically SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), which provide insights into the role of input features in shaping predictions. Results indicate that forecasting accuracy declines with increasing lead time. Among the tested models, CatBoost1 consistently exhibited the best performance. For Tmax forecasting on the validation set, CatBoost1 yielded a correlation coefficient (R) of 0.900, Nash–Sutcliffe efficiency (NSE) of 0.810, root mean squared error (RMSE) of 3.447 °C, mean absolute error (MAE) of 2.571 °C, Willmott’s Index (WI) of 0.947, Legates and McCabe Index (LM) of 0.615, explained variance score (EVS) of 0.810, and absolute percentage bias (APB) of 15.415%. For Tmin, CatBoost1 achieved R = 0.941, NSE = 0.885, RMSE = 2.618 °C, MAE = 1.952 °C, WI = 0.969, LM = 0.709, EVS = 0.885, and APB = 54.360%. These findings demonstrate that boosting models, when combined with explainable AI techniques, offer a robust and transparent framework for temperature forecasting, supporting their application in climate risk management, agriculture, and energy planning.

Forecasting maximal and minimal air temperatures using explainable machine learning: Shapley additive explanation versus local interpretable model-agnostic explanations

Di Nunno F.;Granata F.;
2025-01-01

Abstract

This study investigates the performance of four boosting machine learning models, AdaBoost, XGBoost, CatBoost, and LightGBM, for forecasting maximal (Tmax) and minimal (Tmin) air temperatures at six lead times: the same day and 1, 7, 15, 21, and 30 days ahead. Daily temperature data from the USGS 02187010 weather station (South Carolina, USA) were used for model training and validation. To address the challenges posed by the non-linearity and complexity of climate data, the models were integrated with explainable artificial intelligence (XAI) techniques, specifically SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), which provide insights into the role of input features in shaping predictions. Results indicate that forecasting accuracy declines with increasing lead time. Among the tested models, CatBoost1 consistently exhibited the best performance. For Tmax forecasting on the validation set, CatBoost1 yielded a correlation coefficient (R) of 0.900, Nash–Sutcliffe efficiency (NSE) of 0.810, root mean squared error (RMSE) of 3.447 °C, mean absolute error (MAE) of 2.571 °C, Willmott’s Index (WI) of 0.947, Legates and McCabe Index (LM) of 0.615, explained variance score (EVS) of 0.810, and absolute percentage bias (APB) of 15.415%. For Tmin, CatBoost1 achieved R = 0.941, NSE = 0.885, RMSE = 2.618 °C, MAE = 1.952 °C, WI = 0.969, LM = 0.709, EVS = 0.885, and APB = 54.360%. These findings demonstrate that boosting models, when combined with explainable AI techniques, offer a robust and transparent framework for temperature forecasting, supporting their application in climate risk management, agriculture, and energy planning.
File in questo prodotto:
File Dimensione Formato  
Forecasting maximal and minimal air temperatures using explainable_compressed.pdf

non disponibili

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 2.09 MB
Formato Adobe PDF
2.09 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11580/115125
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
social impact