Electricity price forecasting (EPF) is a branch of
energy forecasting Energy forecasting includes forecasting demand ( load) and price of electricity, fossil fuels (natural gas, oil, coal) and renewable energy sources (RES; hydro, wind, solar). Forecasting can be both expected price value and probabilistic forecast ...
which focuses on predicting the
spot
Spot or SPOT may refer to:
Places
* Spot, North Carolina, a community in the United States
* The Spot, New South Wales, a locality in Sydney, Australia
* South Pole Traverse, sometimes called the South Pole Overland Traverse
People
* Spot (produ ...
and
forward price The forward price (or sometimes forward rate) is the agreed upon price of an asset in a forward contract. Using the rational pricing assumption, for a forward contract on an underlying asset that is tradeable, the forward price can be expressed i ...
s in wholesale
electricity market
In a broad sense, an electricity market is a system that facilitates the exchange of electricity-related goods and services. During more than a century of evolution of the electric power industry, the economics of the electricity markets had u ...
s. Over the last 15 years
electricity price forecasts have become a fundamental input to energy companies’ decision-making mechanisms at the corporate level.
Since the early 1990s, the process of
deregulation
Deregulation is the process of removing or reducing state regulations, typically in the economic sphere. It is the repeal of governmental regulation of the economy. It became common in advanced industrial economies in the 1970s and 1980s, as a r ...
and the introduction of
competitive electricity markets have been reshaping the landscape of the traditionally monopolistic and government-controlled power sectors. Throughout Europe, North America and Australia, electricity is now traded under market rules using
spot
Spot or SPOT may refer to:
Places
* Spot, North Carolina, a community in the United States
* The Spot, New South Wales, a locality in Sydney, Australia
* South Pole Traverse, sometimes called the South Pole Overland Traverse
People
* Spot (produ ...
and
derivative contracts.
However, electricity is a very special commodity: it is economically non-storable and power system stability requires a constant balance between production and consumption. At the same time, electricity demand depends on weather (temperature, wind speed, precipitation, etc.) and the intensity of business and everyday activities (
on-peak vs. off-peak hours, weekdays vs. weekends, holidays, etc.). These unique characteristics lead to price dynamics not observed in any other market, exhibiting daily, weekly and often annual
seasonality
In time series data, seasonality is the presence of variations that occur at specific regular intervals less than a year, such as weekly, monthly, or quarterly. Seasonality may be caused by various factors, such as weather, vacation, and holidays a ...
and abrupt, short-lived and generally unanticipated
price spikes.
Extreme
price volatility, which can be up to two orders of magnitude higher than that of any other commodity or financial asset, has forced market participants to hedge not only volume but also price risk. Price forecasts from a few hours to a few months ahead have become of particular interest to power portfolio managers. A power market company able to forecast the volatile wholesale prices with a reasonable level of accuracy can adjust its bidding strategy and its own production or consumption schedule in order to reduce the risk or maximize the profits in day-ahead trading.
A ballpark estimate of savings from a 1% reduction in the
mean absolute percentage error
The mean absolute percentage error (MAPE), also known as mean absolute percentage deviation (MAPD), is a measure of prediction accuracy of a forecasting method in statistics. It usually expresses the accuracy as a ratio defined by the formula:
: ...
(MAPE) of short-term price forecasts is $300,000 per year for a
utility
As a topic of economics, utility is used to model worth or value. Its usage has evolved significantly over time. The term was introduced initially as a measure of pleasure or happiness as part of the theory of utilitarianism by moral philosoph ...
with 1GW
peak load
In electrical engineering, a load profile is a graph of the variation in the electrical load versus time. A load profile will vary according to customer type (typical examples include residential, commercial and industrial), temperature and hol ...
.
Electricity price forecasting is the process of using
mathematical model
A mathematical model is a description of a system using mathematical concepts and language. The process of developing a mathematical model is termed mathematical modeling. Mathematical models are used in the natural sciences (such as physics, ...
s to predict what electricity prices will be in the future.
Forecasting methodology
The simplest model for day ahead forecasting is to ask each generation source to bid on blocks of generation and choose the cheapest bids. If not enough bids are submitted, the price is increased. If too many bids are submitted the price can reach zero or become negative. The offer price includes the generation cost as well as the transmission cost, along with any profit. Power can be sold or purchased from adjoining
power pool Power pooling is used to balance electrical load over a larger network (electrical grid) than a single utility. It is a mechanism for interchange of power between two and more utilities which provide or generate electricity For exchange of power be ...
s.
The concept of independent system operators (ISOs) fosters competition for generation among wholesale market participants by unbundling the operation of transmission and generation. ISOs use bid-based markets to determine economic dispatch.
Wind and solar power are non-
dispatchable. Such power is normally sold before any other bids, at a predetermined rate for each supplier. Any excess is sold to another grid operator, or stored, using
pumped-storage hydroelectricity
Pumped-storage hydroelectricity (PSH), or pumped hydroelectric energy storage (PHES), is a type of hydroelectric energy storage used by electric power systems for load balancing. The method stores energy in the form of gravitational potent ...
, or in the worst case, curtailed. Curtailment could potentially significantly impact solar power's economic and environmental benefits at greater PV penetration levels.
Allocation is done by bidding.
The effect of the recent introduction of smart grids and integrating distributed renewable generation has been increased uncertainty of future supply, demand and prices. This uncertainty has driven much research into the topic of forecasting.
Driving factors
Electricity cannot be stored as easily as gas, it is produced at the exact moment of demand. All of the factors of supply and demand will, therefore, have an immediate impact on the price of electricity on the spot market. In addition to production
costs
In production, research, retail, and accounting, a cost is the value of money that has been used up to produce something or deliver a service, and hence is not available for use anymore. In business, the cost may be one of acquisition, in whic ...
, electricity prices are set by supply and demand. However, some fundamental drivers are the most likely to be considered.
Short-term prices are impacted the most by the weather. Demand due to heating in the winter and cooling in the summer are the main drivers for seasonal price spikes. Additional natural-gas fired capacity is driving down the price of electricity and increasing demand.
A country's natural resource endowment, as well as its regulations in place greatly influence tariffs from the supply side. The supply side of the electricity supply is most influenced by fuel prices, and CO
2 allowance prices. The EU carbon prices have doubled since 2017, making it a significant driving factor of price.
Weather
Studies show that demand for
electricity
Electricity is the set of physical phenomena associated with the presence and motion of matter that has a property of electric charge. Electricity is related to magnetism, both being part of the phenomenon of electromagnetism, as describ ...
is driven largely by
temperature
Temperature is a physical quantity that expresses quantitatively the perceptions of hotness and coldness. Temperature is measured with a thermometer.
Thermometers are calibrated in various temperature scales that historically have relied on ...
.
Heating
A central heating system provides warmth to a number of spaces within a building from one main source of heat. It is a component of heating, ventilation, and air conditioning (short: HVAC) systems, which can both cool and warm interior spaces.
...
demand in the
winter
Winter is the coldest season of the year in Polar regions of Earth, polar and temperate climates. It occurs after autumn and before spring (season), spring. The tilt of Axial tilt#Earth, Earth's axis causes seasons; winter occurs when a Hemi ...
and cooling demand (
air conditioner
Air conditioning, often abbreviated as A/C or AC, is the process of removing heat from an enclosed space to achieve a more comfortable interior environment (sometimes referred to as 'comfort cooling') and in some cases also strictly controlling ...
s) in the
summer
Summer is the hottest of the four temperate seasons, occurring after spring and before autumn. At or centred on the summer solstice, the earliest sunrise and latest sunset occurs, daylight hours are longest and dark hours are shortest, ...
are what primarily drive the seasonal peaks in most regions.
Heating degree day
Heating degree day (HDD) is a measurement designed to quantify the demand for energy needed to heat a building. HDD is derived from measurements of outside air temperature. The heating requirements for a given building at a specific location are ...
s and
cooling degree day
Heating degree day (HDD) is a measurement designed to quantify the demand for energy needed to heat a building. HDD is derived from measurements of outside air temperature. The heating requirements for a given building at a specific location are ...
s help measure energy consumption by referencing the outdoor temperature above and below 65
degrees Fahrenheit
The Fahrenheit scale () is a temperature scale based on one proposed in 1724 by the physicist Daniel Gabriel Fahrenheit (1686–1736). It uses the degree Fahrenheit (symbol: °F) as the unit. Several accounts of how he originally defined h ...
, a commonly accepted baseline.
In terms of renewable sources like solar and wind, weather impacts supply. California's duck curve shows the difference between electricity demand and the amount of solar energy available throughout the day. On a sunny day, solar power floods the electricity generation market and then drops during the evening, when electricity demand peaks.
Hydropower availability
Snowpack
Snowpack forms from layers of snow that accumulate in geographic regions and high elevations where the climate includes cold weather for extended periods during the year. Snowpacks are an important water resource that feed streams and rivers as th ...
,
streamflow
Streamflow, or channel runoff, is the flow of water in streams and other channels, and is a major element of the water cycle. It is one component of the movement of water from the land to waterbodies, the other component being surface runoff. W ...
s, seasonality,
salmon
Salmon () is the common name
In biology, a common name of a taxon or organism (also known as a vernacular name, English name, colloquial name, country name, popular name, or farmer's name) is a name that is based on the normal language of ...
, etc. all affect the amount of water that can flow through a
dam at any given time. Forecasting these variables predicts the available potential energy for a dam for a given period. Some regions such as Pakistan, Egypt,
China and the
Pacific Northwest
The Pacific Northwest (sometimes Cascadia, or simply abbreviated as PNW) is a geographic region in western North America bounded by its coastal waters of the Pacific Ocean to the west and, loosely, by the Rocky Mountains to the east. Though ...
get significant generation from
hydroelectric dams. In 2015,
SAIDI
A Ṣa‘īdī (, Coptic: ⲣⲉⲙⲣⲏⲥ ''Remris'') is a person from Upper Egypt (, Coptic: ⲙⲁⲣⲏⲥ ''Maris'').
Etymology
The word literally means "from Ṣa‘īd" (i.e. Upper Egypt), and can also refer to a form of music or ...
and
SAIFI
The Muslim Saifi, or sometimes pronounced Barhai are Muslim community, found in North India. They are also known as Saifi which denotes the Muslim sub-caste of blacksmiths and carpenters. A small number are also found in the Terai region of Nep ...
more than doubled from the previous year in Zambia due to low water reserves in their hydroelectric dams caused by insufficient rainfall.
[J. Arlet. 2017. “Electricity sector constraints for firms across economies : a comparative analysis.” ''Doing business research notes;'' no.1. Washington, D.C. : World Bank Group. http://documents.worldbank.org/curated/en/409771499690745091/Electricity-sector-constraints-for-firms-across-economies-a-comparative-analysis p.10]
Power plant and transmission outages
Whether planned or unplanned, outages affect the total amount of power that is available to the grid. Outages undermine electricity supply, which in turn affects the price.
Economic health
During times of economic hardship, many factories cut back production due to a reduction of consumer demand and therefore reduce production-related electrical demand.
Government regulation
Governments may choose to make electricity tariffs affordable for their population through subsidies to producers and consumers. Most countries characterized as having low energy access have electric power utilities that do not recover any of their capital and operating costs, due to high subsidy levels.
Taxonomy of modeling approaches

A variety of methods and ideas have been tried for EPF over the last 15 years, with varying degrees of success. They can be broadly classified into six groups.
Multi-agent models
''Multi-agent'' (
''multi-agent simulation'''', equilibrium,
game theoretic'') models simulate the operation of a system of heterogeneous agents (generating units, companies) interacting with each other, and build the price process by matching the demand and supply in the market. This class includes ''cost-based models'' (or ''production-cost models'', PCM), ''equilibrium'' or ''game theoretic'' approaches (like the Nash-Cournot framework, supply function equilibrium - SFE, strategic production-cost models - SPCM) and
agent-based model
An agent-based model (ABM) is a computational model for simulating the actions and interactions of autonomous agents (both individual or collective entities such as organizations or groups) in order to understand the behavior of a system and what ...
s.
Multi-agent models generally focus on qualitative issues rather than quantitative results. They may provide insights as to whether or not prices will be above marginal costs, and how this might influence the players’ outcomes. However, they pose problems if more quantitative conclusions have to be drawn, particularly if electricity prices have to be predicted with a high level of precision.
Fundamental models
''Fundamental'' (''structural'') methods try to capture the basic physical and economic relationships which are present in the production and trading of electricity.
The functional associations between fundamental drivers (loads, weather conditions, system parameters, etc.) are postulated, and the fundamental inputs are modeled and predicted independently, often via statistical, reduced-form or
computational intelligence
The expression computational intelligence (CI) usually refers to the ability of a computer to learn a specific task from data or experimental observation. Even though it is commonly considered a synonym of soft computing, there is still no ...
techniques. In general, two subclasses of fundamental models can be identified: ''parameter rich models''
and ''parsimonious structural models'' of supply and demand.
Two major challenges arise in the practical implementation of fundamental models: data availability and incorporation of stochastic fluctuations of the fundamental drivers. In building the model, we make specific assumptions about physical and economic relationships in the marketplace, and therefore the price projections generated by the models are very sensitive to violations of these assumptions.
Reduced-form models
''Reduced-form'' (''quantitative,
stochastic
Stochastic (, ) refers to the property of being well described by a random probability distribution. Although stochasticity and randomness are distinct in that the former refers to a modeling approach and the latter refers to phenomena themselve ...
'') models characterize the statistical properties of electricity prices over time, with the ultimate objective of
derivatives valuation and
risk management.
Their main intention is not to provide accurate hourly price forecasts, but rather to replicate the main characteristics of daily electricity prices, like
marginal distribution
In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset. It gives the probabilities of various values of the variables ...
s at future time points, price dynamics, and correlations between commodity prices. If the price process chosen is not appropriate for capturing the main properties of electricity prices, the results from the model are likely to be unreliable. However, if the model is too complex, the computational burden will prevent its use on-line in trading departments. Depending on the type of market under consideration, reduced-form models can be classified as:
* ''Spot price models'', which provide a parsimonious representation of the dynamics of spot prices. Their main drawback is the problem of pricing derivatives, i.e., the identification of the risk premium linking spot and forward prices. The two most popular subclasses include
jump-diffusion and
Markov regime-switching models.
* ''Forward price models'' allow for the pricing of derivatives in a straightforward manner (but only of those written on the forward price of electricity). However, they too have their limitations; most importantly, the lack of data that can be used for calibration and the inability to derive the properties of spot prices from the analysis of forward curves.
Statistical models
''Statistical'' (''
econometric
Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
,
technical analysis
In finance, technical analysis is an analysis methodology for analysing and forecasting the direction of prices through the study of past market data, primarily price and volume. Behavioral economics and quantitative analysis use many of the sa ...
'') methods forecast the current price by using a mathematical combination of the previous prices and/or previous or current values of
exogenous factors, typically consumption and production figures, or weather variables.
The two most important categories are ''additive'' and ''multiplicative'' models. They differ in whether the predicted price is the sum (additive) of a number of components or the product (multiplicative) of a number of factors. The former are far more popular, but the two are closely related - a multiplicative model for prices can be transformed into an additive model for log-prices. Statistical models are attractive because some physical interpretation may be attached to their components, thus allowing engineers and system operators to understand their behavior. They are often criticized for their limited ability to model the (usually) nonlinear behavior of electricity prices and related fundamental variables. However, in practical applications, their performances are not worse than those of the non-linear
computational intelligence
The expression computational intelligence (CI) usually refers to the ability of a computer to learn a specific task from data or experimental observation. Even though it is commonly considered a synonym of soft computing, there is still no ...
methods (see below). For instance, in the ''load forecasting track'' of the
Global Energy Forecasting Competition (GEFCom2012) attracting hundreds of participants worldwide, the top four winning entries used regression-type models.

Statistical models constitute a very rich class which includes:
* Similar-day and
exponential smoothing
Exponential smoothing is a rule of thumb technique for smoothing time series data using the exponential window function. Whereas in the simple moving average the past observations are weighted equally, exponential functions are used to assign e ...
methods.
*
Regression model
In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one o ...
s.
* Time series models without (
AR,
ARMA,
ARIMA
Arima, officially The Royal Chartered Borough of Arima is the easternmost and second largest in area of the three boroughs of Trinidad and Tobago. It is geographically adjacent to Sangre Grande and Arouca at the south central foothills of ...
,
Fractional ARIMA - FARIMA, Seasonal ARIMA - SARIMA, Threshold AR - TAR) and with
exogenous variables (ARX, ARMAX, ARIMAX, SARIMAX, TARX).
*
Heteroskedastic
In statistics, a sequence (or a vector) of random variables is homoscedastic () if all its random variables have the same finite variance. This is also known as homogeneity of variance. The complementary notion is called heteroscedasticity. The s ...
time series models (
GARCH
In econometrics, the autoregressive conditional heteroskedasticity (ARCH) model is a statistical model for time series data that describes the variance of the current error term or innovation as a function of the actual sizes of the previous time ...
, AR-GARCH).
Computational intelligence models
''
Computational intelligence
The expression computational intelligence (CI) usually refers to the ability of a computer to learn a specific task from data or experimental observation. Even though it is commonly considered a synonym of soft computing, there is still no ...
'' (''
artificial intelligence-based,
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
, non-parametric, non-linear statistical'') techniques combine elements of learning, evolution and fuzziness to create approaches that are capable of adapting to complex dynamic systems, and may be regarded as "intelligent" in this sense.
Artificial neural network
Artificial neural networks (ANNs), usually simply called neural networks (NNs) or neural nets, are computing systems inspired by the biological neural networks that constitute animal brains.
An ANN is based on a collection of connected units ...
s,
fuzzy systems
A fuzzy control system is a control system based on fuzzy logic—a mathematical system that analyzes analog input values in terms of logical variables that take on continuous values between 0 and 1, in contrast to classical or digital logic, ...
and
support vector machine
In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratories ...
s (SVM) are unquestionably the main classes of computational intelligence techniques in EPF. Their major strength is the ability to handle complexity and non-linearity. In general, computational intelligence methods are better at modeling these features of electricity prices than the statistical techniques (see above). At the same time, this flexibility is also their major weakness. The ability to adapt to nonlinear, spiky behaviors will not necessarily result in better point or probabilistic forecasts.
Hybrid models
Many of the modeling and price forecasting approaches considered in the literature are ''hybrid'' solutions, combining techniques from two or more of the groups listed above. Their classification is non-trivial, if possible at all.
As an example of hybrid model AleaModel (AleaSoft) combines Neural Networks and Box Jenkins models.
Forecasting horizons
It is customary to talk about short-, medium- and long-term forecasting,
but there is no consensus in the literature as to what the thresholds should actually be:
* ''Short-term forecasting'' generally involves horizons from a few minutes up to a few days ahead, and is of prime importance in day-to-day market operations.
* ''Medium-term'' ''forecasting'', from a few days to a few months ahead, is generally preferred for
balance sheet
In financial accounting, a balance sheet (also known as statement of financial position or statement of financial condition) is a summary of the financial balances of an individual or organization, whether it be a sole proprietorship, a busine ...
calculations,
risk management and
derivatives pricing
In finance, a derivative is a contract that ''derives'' its value from the performance of an underlying entity. This underlying entity can be an asset, index, or interest rate, and is often simply called the "underlying". Derivatives can be us ...
. In many cases, especially in electricity price forecasting, evaluation is based not on the actual point forecasts, but on the distributions of prices over certain future time periods. As this type of modeling has a long-standing tradition in
finance, an inflow of "finance solutions" is observed.
* ''Long-term'' ''forecasting'', with lead times measured in months, quarters or even years, concentrates on
investment profitability analysis and planning, such as determining the future sites or fuel sources of power plants.
Future of electricity price forecasting
In his extensive review paper, Weron
looks ahead and speculates on the directions EPF will or should take over the next decade or so:
Fundamental price drivers and input variables
Seasonality
A key point in electricity spot price modeling and forecasting is the appropriate treatment of seasonality.
The electricity price exhibits seasonality at three levels: the daily and weekly, and to some extent - the annual. In ''short-term forecasting'', the annual or long-term seasonality is usually ignored, but the daily and weekly patterns (including a separate treatment of holidays) are of prime importance. This, however, may not be the right approach. As Nowotarski and Weron have recently shown, decomposing a series of electricity prices into a long-term seasonal and a stochastic component, modeling them independently and combining their forecasts can bring - contrary to a common belief - an accuracy gain compared to an approach in which a given model is calibrated to the prices themselves.
In ''mid-term forecasting'', the daily patterns become less relevant and most EPF models work with average daily prices. However, the long-term trend-cycle component plays a crucial role. Its misspecification can introduce bias, which may lead to a bad estimate of the mean reversion level or of the price spike intensity and severity, and consequently, to underestimating the risk. Finally, in the ''long term'', when the time horizon is measured in years, the daily, weekly and even annual seasonality may be ignored, and long-term trends dominate. Adequate treatment - both in-sample and
out-of-sample
In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
- of seasonality has not been given enough attention in the literature so far.
Variable selection
Another crucial issue in electricity price forecasting is the appropriate choice of explanatory variables.
Apart from historical electricity prices, the current spot price is dependent on a large set of fundamental drivers, including system loads, weather variables, fuel costs, the
reserve margin (i.e., available generation minus/over predicted demand) and information about scheduled maintenance and forced
outages. Although "pure price" models are sometimes used for EPF, in the most common day-ahead forecasting scenario most authors select a combination of these fundamental drivers, based on the heuristics and experience of the forecaster.
Very rarely has an automated selection or
shrinkage procedure been carried out in EPF, especially for a large set of initial explanatory variables.
However, the
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
literature provides viable tools that can be broadly classified into two categories:
*
Feature or subset selection, which involves identifying a subset of predictors that we believe to be influential, then fitting a model on the reduced set of variables.
*
Shrinkage (also known as
regularization
Regularization may refer to:
* Regularization (linguistics)
* Regularization (mathematics)
* Regularization (physics)
* Regularization (solid modeling)
* Regularization Law, an Israeli law intended to retroactively legalize settlements
See also ...
), that fits the full model with all predictors using an algorithm that shrinks the estimated coefficients towards zero, which can significantly reduce their variance. Depending on what type of shrinkage is performed, some of the coefficients may be shrunk to zero itself. As such, some
shrinkage methods - like the
lasso
A lasso ( or ), also called lariat, riata, or reata (all from Castilian, la reata 're-tied rope'), is a loop of rope designed as a restraint to be thrown around a target and tightened when pulled. It is a well-known tool of the Spanish a ...
- ''de facto'' perform
variable selection.
Some of these techniques have been utilized in the context of EPF:
*
stepwise regression
In statistics, stepwise regression is a method of fitting regression models in which the choice of predictive variables is carried out by an automatic procedure. In each step, a variable is considered for addition to or subtraction from the set o ...
, including single step elimination,
*
Ridge regression
Ridge regression is a method of estimating the coefficients of multiple- regression models in scenarios where the independent variables are highly correlated. It has been used in many fields including econometrics, chemistry, and engineering. Also ...
,
*
lasso
A lasso ( or ), also called lariat, riata, or reata (all from Castilian, la reata 're-tied rope'), is a loop of rope designed as a restraint to be thrown around a target and tightened when pulled. It is a well-known tool of the Spanish a ...
,
* and
elastic net
Elastic maps provide a tool for nonlinear dimensionality reduction. By their construction, they are a system of elastic springs embedded in the data
space. This system approximates a low-dimensional manifold. The elastic coefficients of this sy ...
s,
but their use is not common. Further development and employment of methods for selecting the most effective input variables - from among the past electricity prices, as well as the past and predicted values of the fundamental drivers - is needed.
Spike forecasting and the reserve margin
When predicting spike occurrences or spot price volatility, one of the most influential fundamental variables is the
reserve margin, also called
surplus generation. It relates the available capacity (generation, supply),
, to the demand (load),
, at a given moment in time
. The traditional engineering notion of the reserve margin defines it as the difference between the two, i.e.,
, but many authors prefer to work with dimensionless ratios
,
or the so-called capacity utilization
.
Its rare application in EPF can be justified only by the difficulty of obtaining good quality reserve margin data. Given that more and more system operators (see e.g. http://www.elexon.co.uk) are disclosing such information nowadays, reserve margin data should be playing a significant role in EPF in the near future.
Probabilistic forecasts
The use of
prediction interval
In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are ...
s (PI) and densities, or
probabilistic forecasting Probabilistic forecasting summarizes what is known about, or opinions about, future events. In contrast to single-valued forecasts (such as forecasting that the maximum temperature at a given site on a given day will be 23 degrees Celsius, or that t ...
, has become much more common over the past three decades, as practitioners have come to understand the limitations of point forecasts. Despite the bold move by the organizers of the
Global Energy Forecasting Competition 2014 to require the participants to submit forecasts of the 99
percentile
In statistics, a ''k''-th percentile (percentile score or centile) is a score ''below which'' a given percentage ''k'' of scores in its frequency distribution falls (exclusive definition) or a score ''at or below which'' a given percentage falls ...
s of the predictive distribution (day-ahead in the price track) and not the point forecasts as in the 2012 edition, this does not seem to be a common case in EPF as yet.
If PIs are computed at all, they usually are distribution-based (and approximated by the standard deviation of the model residuals
) or empirical. A new forecast combination (see below) technique has been introduced recently in the context of EPF.
Quantile Regression Averaging (QRA) involves applying
quantile regression
Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regress ...
to the point forecasts of a small number of individual forecasting models or experts, hence allows to leverage existing development of point forecasting.
Combining forecasts
Consensus forecast
Used in a number of sciences, ranging from econometrics to meteorology, consensus forecasts are predictions of the future that are created by combining together several separate forecasts which have often been created using different methodologies ...
s, also known as ''combining forecasts'', ''forecast averaging'' or ''model averaging'' (in
econometrics
Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
and
statistics) and ''
committee machine A committee machine is a type of artificial neural network using a divide and conquer strategy in which the responses of multiple neural networks (experts) are combined into a single response.HAYKIN, S. Neural Networks - A Comprehensive Foundation. ...
s'', ''
ensemble averaging
In machine learning, particularly in the creation of artificial neural networks, ensemble averaging is the process of creating multiple models and combining them to produce a desired output, as opposed to creating just one model. Frequently an ens ...
'' or ''expert aggregation'' (in
machine learning
Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence.
Machine ...
), are predictions of the future that are created by combining several separate forecasts which have often been created using different methodologies. Despite their popularity in econometrics, averaged forecasts have not been used extensively in the context of
electricity market
In a broad sense, an electricity market is a system that facilitates the exchange of electricity-related goods and services. During more than a century of evolution of the electric power industry, the economics of the electricity markets had u ...
s to date. There is some limited evidence on the adequacy of combining forecasts of electricity demand, but it was only very recently that combining was used in EPF and only for point forecasts.
Combining probabilistic (i.e., interval and density) forecasts is much less popular, even in econometrics in general, mainly because of the increased complexity of the problem. Since
Quantile Regression Averaging (QRA) allows to leverage existing development of point forecasting,
it is particularly attractive from a practical point of view and may become a popular tool in EPF in the near future.
Multivariate factor models
The literature on forecasting daily electricity prices has concentrated largely on models that use only information at the aggregated (i.e., daily) level. On the other hand, the very rich body of literature on forecasting intra-day prices has used disaggregated data (i.e., hourly or half-hourly), but generally has not explored the complex dependence structure of the multivariate price series.
If we want to explore the structure of intra-day electricity prices, we need to use dimension reduction methods; for instance, factor models with factors estimated as
principal components
Principal component analysis (PCA) is a popular technique for analyzing large datasets containing a high number of dimensions/features per observation, increasing the interpretability of data while preserving the maximum amount of information, and ...
(PC). Empirical evidence indicates that there are forecast improvements from incorporating disaggregated (i.e., hourly or zonal) data for predicting daily system prices, especially when the forecast horizon exceeds one week.
With the increase of computational power, the real-time calibration of these complex models will become feasible and we may expect to see more EPF applications of the multivariate framework in the coming years.
A universal test ground
All major review publications conclude that there are problems with comparing the methods developed and used in the EPF literature.
This is due mainly to the use of different datasets, different software implementations of the forecasting models and different error measures, but also to the lack of statistical rigor in many studies. This calls for a comprehensive, thorough study involving (i) the same datasets, (ii) the same robust error evaluation procedures, and (iii) statistical testing of the significance of one model's outperformance of another. To some extent, the
Global Energy Forecasting Competition 2014 has addressed these issues. Yet more has to be done. A selection of the better-performing measures (weighted-MAE, seasonal MASE or RMSSE) should be used either exclusively or in conjunction with the more popular ones (MAPE, RMSE). The empirical results should be further tested for the significance of the differences in forecasting accuracies of the models.
See also
*
Energy forecasting Energy forecasting includes forecasting demand ( load) and price of electricity, fossil fuels (natural gas, oil, coal) and renewable energy sources (RES; hydro, wind, solar). Forecasting can be both expected price value and probabilistic forecast ...
*
Global Energy Forecasting Competition The Global Energy Forecasting Competition (GEFCom) is a competition conducted by a team led by Dr. Tao Hong that invites submissions around the world for forecasting energy demand. GEFCom was first held in 2012 on Kaggle, and the second GEFCom was h ...
s
References
{{reflist
Economic forecasting
Regression with time series structure
Artificial neural networks
Energy economics
Electricity markets