Skip to main navigation Skip to search Skip to main content

A Comparative Study of Statistical and Machine Learning Methods for Solar Irradiance Forecasting Using the Folsom PLC Dataset

  • Polytechnic University of Valencia

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

The increasing penetration of photovoltaic solar energy has intensified the need for accurate production forecasting to ensure efficient grid operation. This study presents a critical comparison of traditional statistical methods and machine learning approaches for forecasting solar irradiance using the benchmark Folsom PLC dataset. Two primary research questions are addressed: whether machine learning models outperform traditional techniques, and whether time series modelling improves prediction accuracy. The analysis includes an evaluation of a range of models, including statistical regressions (OLS, LASSO, ridge), regression trees, neural networks, LSTM, and random forests, which are applied to physical modelling and time series approaches. The results reveal that although machine learning methods can outperform statistical models, particularly with the inclusion of exogenous weather features, they are not universally superior across all forecasting horizons. Furthermore, pure time series approach models yield lower performance. However, a hybrid approach in which physical models are integrated with machine learning demonstrates significantly improved accuracy. These findings highlight the value of hybrid models for photovoltaic forecasting and suggest strategic directions for operational implementation.

Original languageEnglish
Article number4122
JournalEnergies
Volume18
Issue number15
DOIs
StatePublished - Aug 2025
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • PV
  • energy
  • forecasting
  • machine learning
  • management
  • solar
  • time series

Fingerprint

Dive into the research topics of 'A Comparative Study of Statistical and Machine Learning Methods for Solar Irradiance Forecasting Using the Folsom PLC Dataset'. Together they form a unique fingerprint.

Cite this