Data-driven forecasting of photovoltaic performance with explainable artificial intelligence: Experimental validation using environmental and thermal predictors


Karakoyun Y., KATİPOĞLU O. M., DOĞAN A., Canbaz A.

Energy Conversion and Management, cilt.364, 2026 (SCI-Expanded, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 364
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1016/j.enconman.2026.121734
  • Dergi Adı: Energy Conversion and Management
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Applied Science & Technology Source, Compendex, Environment Index, INSPEC, Academic Search Ultimate (EBSCO), Engineering Source (EBSCO)
  • Anahtar Kelimeler: Machine Learning, Regression Analysis, SHAP, Solar Energy Forecasting, Sustainable Energy
  • Van Yüzüncü Yıl Üniversitesi Adresli: Evet

Özet

PV systems meet energy demand, but environmental and operational factors affect their performance. This study used a three-scenario ablation framework to predict PV power from baseline meteorological and operational variables to under-panel temperature and finally to the full thermal configuration, including water temperature (a newly studied variable). Scenario 1 included time, air temperature, panel angle, relative humidity, and average wind speed; Scenario 2 added under-panel temperature; and Scenario 3 added water temperature. Direct thermal predictors were evaluated separately from meteorological inputs using this design. Four sophisticated ML algorithms, Random Forest (RF), CatBoost, Support Vector Machine (SVM), and one-dimensional Convolutional Neural Network (1D CNN), were refined by Randomized Search (RS) and evaluated using metrics. According to the first group of power prediction results under the full Scenario 3 configuration, the CatBoost-RS model demonstrated superior performance with RMSE = 10.61, MAE = 8.41, AIC = 222.22, KGE = 0.957, and R2 = 0.99. For the second group, the SVM-RS model outperformed others, achieving RMSE = 8.02, MAE = 5.50, AIC = 200.36, KGE = 0.994, and R2 = 0.992. Comparative analysis showed that CatBoost-RS was the most robust predictor, while RF exhibited proficiency in managing nonlinearity in this experimental setting. One-way ANOVA, paired t -test, Wilcoxon signed-rank test, bootstrap confidence intervals, and effect size (Cohen’s d) analyses across both groups show that optimised ensemble and kernel-based models outperform the convolutional approach in predictive performance. Experimental validation and high predictive accuracy improve PV performance prediction in the proposed method.