Electricity Price Forecasting
   HOME

TheInfoList



OR:

Electricity price forecasting (EPF) is a branch of
energy forecasting Energy forecasting includes forecasting demand ( load) and price of electricity, fossil fuels (natural gas, oil, coal) and renewable energy sources (RES; hydro, wind, solar). Forecasting can be both expected price value and probabilistic forecasti ...
which focuses on using
mathematical Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
,
statistical Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
and
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
models to predict electricity prices in the future. Over the last 30 years electricity price forecasts have become a fundamental input to energy companies’
decision-making In psychology, decision-making (also spelled decision making and decisionmaking) is regarded as the Cognition, cognitive process resulting in the selection of a belief or a course of action among several possible alternative options. It could be ...
mechanisms at the corporate level. Since the early 1990s, the process of
deregulation Deregulation is the process of removing or reducing state regulations, typically in the economic sphere. It is the repeal of governmental regulation of the economy. It became common in advanced industrial economies in the 1970s and 1980s, as a ...
and the introduction of competitive electricity markets have been reshaping the landscape of the traditionally monopolistic and government-controlled power sectors. Throughout Europe, North America, Australia and Asia, electricity is now traded under market rules using
spot Spot or SPOT may refer to: Places * Spot, North Carolina, a community in the United States * The Spot, New South Wales, a locality in Sydney, Australia * South Pole Traverse, sometimes called the South Pole Overland Traverse People * Spot Coll ...
and derivative contracts. However, electricity is a very special commodity: it is economically non-storable and power system stability requires a constant balance between production and consumption. At the same time, electricity demand depends on weather (temperature, wind speed, precipitation, etc.) and the intensity of business and everyday activities ( on-peak vs. off-peak hours, weekdays vs. weekends, holidays, etc.). These unique characteristics lead to price dynamics not observed in any other market, exhibiting daily, weekly and often annual
seasonality In time series data, seasonality refers to the trends that occur at specific regular intervals less than a year, such as weekly, monthly, or quarterly. Seasonality may be caused by various factors, such as weather, vacation, and holidays and consi ...
and abrupt, short-lived and generally unanticipated price spikes. Extreme price volatility, which can be up to two orders of magnitude higher than that of any other commodity or financial asset, has forced market participants to hedge not only volume but also price risk. Price forecasts from a few hours to a few months ahead have become of particular interest to power portfolio managers. A power market company able to forecast the volatile wholesale prices with a reasonable level of accuracy can adjust its bidding strategy and its own production or consumption schedule in order to reduce the risk or maximize the profits in day-ahead trading. A ballpark estimate of savings from a 1% reduction in the mean absolute percentage error (MAPE) of short-term price forecasts is $300,000 per year for a
utility In economics, utility is a measure of a certain person's satisfaction from a certain state of the world. Over time, the term has been used with at least two meanings. * In a normative context, utility refers to a goal or objective that we wish ...
with 1GW
peak load Peak demand on an electrical grid is the highest electrical power demand that has occurred over a specified time period (Gönen 2008). Peak demand is typically characterized as annual, daily or seasonal and has the unit of power. Peak demand, pe ...
. With the additional price forecasts, the savings double.


Forecasting methodology

The simplest model for day ahead forecasting is to ask each generation source to bid on blocks of generation and choose the cheapest bids. If not enough bids are submitted, the price is increased. If too many bids are submitted the price can reach zero or become negative. The offer price includes the generation cost as well as the transmission cost, along with any profit. Power can be sold or purchased from adjoining power pools. The concept of independent system operators (ISOs) fosters competition for generation among wholesale market participants by unbundling the operation of transmission and generation. ISOs use bid-based markets to determine economic dispatch. Wind and solar power are non- dispatchable. Such power is normally sold before any other bids, at a predetermined rate for each supplier. Any excess is sold to another grid operator, or stored, using
pumped-storage hydroelectricity Pumped-storage hydroelectricity (PSH), or pumped hydroelectric energy storage (PHES), is a type of hydroelectric energy storage used by electric power systems for load balancing (electrical power), load balancing. A PSH system stores energy i ...
, or in the worst case, curtailed. Curtailment could potentially significantly impact solar power's economic and environmental benefits at greater PV penetration levels. Allocation is done by bidding. The effect of the recent introduction of smart grids and integrating distributed renewable generation has been increased uncertainty of future supply, demand and prices. This uncertainty has driven much research into the topic of forecasting.


Driving factors

Electricity cannot be stored as easily as gas, it is produced at the exact moment of demand. All of the factors of supply and demand will, therefore, have an immediate impact on the price of electricity on the spot market. In addition to production
costs Cost is the value of money that has been used up to produce something or deliver a service, and hence is not available for use anymore. In business, the cost may be one of acquisition, in which case the amount of money expended to acquire it is ...
, electricity prices are set by supply and demand. However, some fundamental drivers are the most likely to be considered. Short-term prices are impacted the most by the weather. Demand due to heating in the winter and cooling in the summer are the main drivers for seasonal price spikes. Additional natural-gas fired capacity is driving down the price of electricity and increasing demand. A country's natural resource endowment, as well as its regulations in place greatly influence tariffs from the supply side. The supply side of the electricity supply is most influenced by fuel prices, and CO2 allowance prices. The EU carbon prices have doubled since 2017, making it a significant driving factor of price.


Weather

Studies show that demand for
electricity Electricity is the set of physical phenomena associated with the presence and motion of matter possessing an electric charge. Electricity is related to magnetism, both being part of the phenomenon of electromagnetism, as described by Maxwel ...
is driven largely by
temperature Temperature is a physical quantity that quantitatively expresses the attribute of hotness or coldness. Temperature is measurement, measured with a thermometer. It reflects the average kinetic energy of the vibrating and colliding atoms making ...
.
Heating In thermodynamics, heat is energy in transfer between a thermodynamic system and its surroundings by such mechanisms as thermal conduction, electromagnetic radiation, and friction, which are microscopic in nature, involving sub-atomic, atom ...
demand in the
winter Winter is the coldest and darkest season of the year in temperate and polar climates. It occurs after autumn and before spring. The tilt of Earth's axis causes seasons; winter occurs when a hemisphere is oriented away from the Sun. Dif ...
and cooling demand (
air conditioner Air conditioning, often abbreviated as A/C (US) or air con (UK), is the process of removing heat from an enclosed space to achieve a more comfortable interior temperature, and in some cases, also controlling the humidity of internal air. Air c ...
s) in the
summer Summer or summertime is the hottest and brightest of the four temperate seasons, occurring after spring and before autumn. At or centred on the summer solstice, daylight hours are the longest and darkness hours are the shortest, with day ...
are what primarily drive the seasonal peaks in most regions.
Heating degree day Heating degree day (HDD) is a measurement designed to quantify the demand for energy needed to heat a building. HDD is derived from measurements of outside air temperature. The estimated average heating energy requirements for a given building at ...
s and cooling degree days help measure energy consumption by referencing the outdoor temperature above and below 65 degrees
Fahrenheit The Fahrenheit scale () is a scale of temperature, temperature scale based on one proposed in 1724 by the German-Polish physicist Daniel Gabriel Fahrenheit (1686–1736). It uses the degree Fahrenheit (symbol: °F) as the unit. Several accou ...
, a commonly accepted baseline. In terms of renewable sources like solar and wind, weather impacts supply. California's duck curve shows the difference between electricity demand and the amount of solar energy available throughout the day. On a sunny day, solar power floods the
electricity generation Electricity generation is the process of generating electric power from sources of primary energy. For electric utility, utilities in the electric power industry, it is the stage prior to its Electricity delivery, delivery (Electric power transm ...
market and then drops during the evening, when electricity demand peaks. Forecasting for wind and solar renewable energy is becoming more important as the amount of energy generated from these sources increases. Meteorological forecasts can improve the accuracy of electricity price forecasting models. While day-ahead forecasts can take advantage of autoregressive effects, forecasts featuring meteorological data are more accurate for 2-4 day-ahead horizons. In some cases, renewable energy generation forecasts published by Transmission System Operators (TSOs) can be improved with simple prediction models and used provide more accurate electricity price predictions.


Hydropower availability

Snowpack Snowpack is an accumulation of snow that compresses with time and melts seasonally, often at high elevation or high latitude. Snowpacks are an important water resource that feed streams and rivers as they melt, sometimes leading to flooding. Snow ...
,
streamflow Streamflow, or channel runoff, is the flow of water in streams and other channels, and is a major element of the water cycle. It is one runoff component, the movement of water from the land to waterbodies, the other component being ''surface runo ...
s, seasonality,
salmon Salmon (; : salmon) are any of several list of commercially important fish species, commercially important species of euryhaline ray-finned fish from the genera ''Salmo'' and ''Oncorhynchus'' of the family (biology), family Salmonidae, native ...
, etc. all affect the amount of water that can flow through a
dam A dam is a barrier that stops or restricts the flow of surface water or underground streams. Reservoirs created by dams not only suppress floods but also provide water for activities such as irrigation, human consumption, industrial use, aqua ...
at any given time. Forecasting these variables predicts the available potential energy for a dam for a given period. Some regions such as Pakistan, Egypt,
China China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...
and the
Pacific Northwest The Pacific Northwest (PNW; ) is a geographic region in Western North America bounded by its coastal waters of the Pacific Ocean to the west and, loosely, by the Rocky Mountains to the east. Though no official boundary exists, the most common ...
get significant generation from
hydroelectric Hydroelectricity, or hydroelectric power, is Electricity generation, electricity generated from hydropower (water power). Hydropower supplies 15% of the world's electricity, almost 4,210 TWh in 2023, which is more than all other Renewable energ ...
dams. In 2015,
SAIDI A Ṣa‘īdī (, Coptic language, Coptic: ⲣⲉⲙⲣⲏⲥ ''Remris'') is a person from Upper Egypt (, Coptic language, Coptic: ⲙⲁⲣⲏⲥ ''Maris''). Etymology The word literally means "from Ṣa‘īd" (i.e. Upper Egypt), and can al ...
and
SAIFI The System Average Interruption Frequency Index (SAIFI) is commonly used as a reliability index by electric power utilities. This index measures the average number of times that a system customer experiences an outage during the year or during a g ...
more than doubled from the previous year in Zambia due to low water reserves in their hydroelectric dams caused by insufficient rainfall.J. Arlet. 2017. “Electricity sector constraints for firms across economies : a comparative analysis.” ''Doing business research notes;'' no.1. Washington, D.C. : World Bank Group. http://documents.worldbank.org/curated/en/409771499690745091/Electricity-sector-constraints-for-firms-across-economies-a-comparative-analysis p.10


Power plant and transmission outages

Whether planned or unplanned, outages affect the total amount of power that is available to the grid. Outages undermine electricity supply, which in turn affects the price.


Economic health

During times of economic hardship, many factories cut back production due to a reduction of consumer demand and therefore reduce production-related electrical demand.


Government regulation

Governments may choose to make electricity tariffs affordable for their population through subsidies to producers and consumers. Most countries characterized as having low energy access have electric power utilities that do not recover any of their capital and operating costs, due to high subsidy levels.


Taxonomy of modeling approaches

A variety of methods and ideas have been tried for electricity price forecasting (EPF), with varying degrees of success. They can be broadly classified into six groups.


Multi-agent models

''Multi-agent'' ( ''multi-agent simulation'''', equilibrium, game theoretic'') models simulate the operation of a system of heterogeneous agents (generating units, companies) interacting with each other, and build the price process by matching the demand and supply in the market. This class includes ''cost-based models'' (or ''production-cost models'', PCM), ''equilibrium'' or ''game theoretic'' approaches (like the Nash-Cournot framework, supply function equilibrium - SFE, strategic production-cost models - SPCM) and
agent-based model An agent-based model (ABM) is a computational model for simulating the actions and interactions of autonomous agents (both individual or collective entities such as organizations or groups) in order to understand the behavior of a system and ...
s. Multi-agent models generally focus on qualitative issues rather than quantitative results. They may provide insights as to whether or not prices will be above marginal costs, and how this might influence the players’ outcomes. However, they pose problems if more quantitative conclusions have to be drawn, particularly if electricity prices have to be predicted with a high level of precision.


Fundamental models

''Fundamental'' (''structural'') methods try to capture the basic physical and economic relationships which are present in the production and trading of electricity. The functional associations between fundamental drivers (loads, weather conditions, system parameters, etc.) are postulated, and the fundamental inputs are modeled and predicted independently, often via statistical, reduced-form or
computational intelligence In computer science, computational intelligence (CI) refers to concepts, paradigms, algorithms and implementations of systems that are designed to show " intelligent" behavior in complex and changing environments. These systems are aimed at m ...
techniques. In general, two subclasses of fundamental models can be identified: ''parameter rich models'' and ''parsimonious structural models'' of supply and demand. Two major challenges arise in the practical implementation of fundamental models: data availability and incorporation of stochastic fluctuations of the fundamental drivers. In building the model, we make specific assumptions about physical and economic relationships in the marketplace, and therefore the price projections generated by the models are very sensitive to violations of these assumptions.


Reduced-form models

''Reduced-form'' (''quantitative,
stochastic Stochastic (; ) is the property of being well-described by a random probability distribution. ''Stochasticity'' and ''randomness'' are technically distinct concepts: the former refers to a modeling approach, while the latter describes phenomena; i ...
'') models characterize the statistical properties of electricity prices over time, with the ultimate objective of derivatives valuation and
risk management Risk management is the identification, evaluation, and prioritization of risks, followed by the minimization, monitoring, and control of the impact or probability of those risks occurring. Risks can come from various sources (i.e, Threat (sec ...
. Their main intention is not to provide accurate hourly price forecasts, but rather to replicate the main characteristics of daily electricity prices, like
marginal distribution In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset. It gives the probabilities of various values of the variable ...
s at future time points, price dynamics, and correlations between commodity prices. If the price process chosen is not appropriate for capturing the main properties of electricity prices, the results from the model are likely to be unreliable. However, if the model is too complex, the computational burden will prevent its use on-line in trading departments. Depending on the type of market under consideration, reduced-form models can be classified as: * ''Spot price models'', which provide a parsimonious representation of the dynamics of spot prices. Their main drawback is the problem of pricing derivatives, i.e., the identification of the risk premium linking spot and forward prices. The two most popular subclasses include jump-diffusion and Markov regime-switching models. * ''Forward price models'' allow for the pricing of derivatives in a straightforward manner (but only of those written on the forward price of electricity). However, they too have their limitations; most importantly, the lack of data that can be used for calibration and the inability to derive the properties of spot prices from the analysis of forward curves.


Statistical models

''Statistical'' (such as ''
econometric Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
'') methods forecast the current price by using a mathematical combination of the previous prices and/or previous or current values of exogenous factors, typically consumption and production figures, or weather variables. The two most important categories are ''additive'' and ''multiplicative'' models. They differ in whether the predicted price is the sum (additive) of a number of components or the product (multiplicative) of a number of factors. The former are far more popular, but the two are closely related - a multiplicative model for prices can be transformed into an additive model for log-prices. Statistical models are attractive because some physical interpretation may be attached to their components, thus allowing engineers and system operators to understand their behavior. They are often criticized for their limited ability to model the (usually) nonlinear behavior of electricity prices and related fundamental variables. However, in practical applications, their performances are not worse than those of the non-linear
computational intelligence In computer science, computational intelligence (CI) refers to concepts, paradigms, algorithms and implementations of systems that are designed to show " intelligent" behavior in complex and changing environments. These systems are aimed at m ...
methods (see below). For instance, in the ''load forecasting track'' of the Global Energy Forecasting Competition (GEFCom2012) attracting hundreds of participants worldwide, the top four winning entries used regression-type models. Statistical models constitute a very rich class which includes: * Similar-day and
exponential smoothing Exponential smoothing or exponential moving average (EMA) is a rule of thumb technique for smoothing time series data using the exponential window function. Whereas in the simple moving average the past observations are weighted equally, exponen ...
methods. * Time series regression models models without ( AR,
ARMA Arma, ARMA or variants, may refer to: Places * Arma, Kansas, United States * Arma, Nepal * Arma District, Peru * Arma District, Yemen * Arma Mountains, Afghanistan People * Arma people, an ethnic group of the middle Niger River valley * Arma lan ...
,
ARIMA Arima, officially The Royal Chartered Borough of Arima is the easternmost and second largest in area of the three boroughs of Trinidad and Tobago. It is geographically adjacent to Sangre Grande and Arouca at the south central foothills of the ...
, Fractional ARIMA - FARIMA, Seasonal ARIMA - SARIMA, Threshold AR - TAR) and with exogenous variables (ARX, ARMAX, ARIMAX, SARIMAX, TARX). * Heteroskedastic time series models (
GARCH In econometrics, the autoregressive conditional heteroskedasticity (ARCH) model is a statistical model for time series data that describes the variance of the current error term or innovation as a function of the actual sizes of the previous time ...
, AR-GARCH, SV). *
Factor Factor (Latin, ) may refer to: Commerce * Factor (agent), a person who acts for, notably a mercantile and colonial agent * Factor (Scotland), a person or firm managing a Scottish estate * Factors of production, such a factor is a resource used ...
models. *
Functional data analysis Functional data analysis (FDA) is a branch of statistics that analyses data providing information about curves, surfaces or anything else varying over a continuum. In its most general form, under an FDA framework, each sample element of functional ...
models.


Computational intelligence models

''
Computational intelligence In computer science, computational intelligence (CI) refers to concepts, paradigms, algorithms and implementations of systems that are designed to show " intelligent" behavior in complex and changing environments. These systems are aimed at m ...
'' ('' artificial intelligence-based,
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
, non-parametric, non-linear statistical'') techniques combine elements of learning, evolution and fuzziness to create approaches that are capable of adapting to complex dynamic systems, and may be regarded as "intelligent" in this sense.
Artificial neural network In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure and functions of biological neural networks. A neural network consists of connected ...
s, including
deep neural networks Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
, explainable AI models and distributional neural networks, as well as
fuzzy systems A fuzzy control system is a control system based on fuzzy logic – a mathematics, mathematical system that analyzes analog signal, analog input values in terms of mathematical logic, logical variables that take on continuous values between 0 ...
and
support vector machine In machine learning, support vector machines (SVMs, also support vector networks) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laborato ...
s (SVM) are unquestionably the main classes of computational intelligence techniques in EPF. Their major strength is the ability to handle complexity and non-linearity. In general, computational intelligence methods are better at modeling these features of electricity prices than the statistical techniques (see above). At the same time, this flexibility is also their major weakness. The ability to adapt to nonlinear, spiky behavior does not necessarily lead to better point or probabilistic predictions, and a lot of effort is required to find the right hyper-parameters.


Hybrid models

Many of the modeling and price forecasting approaches considered in the literature are ''hybrid'' solutions, combining techniques from two or more of the groups listed above. Their classification is non-trivial, if possible at all. As an example of hybrid model AleaModel (AleaSoft) combines Neural Networks and Box Jenkins models.


Forecasting horizons

It is customary to talk about short-, medium- and long-term forecasting, but there is no consensus in the literature as to what the thresholds should actually be: * ''Short-term forecasting'' generally involves horizons from a few minutes up to a few days ahead, and is of prime importance in day-to-day market operations. * ''Medium-term'' ''forecasting'', from a few days to a few months ahead, is generally preferred for
balance sheet In financial accounting, a balance sheet (also known as statement of financial position or statement of financial condition) is a summary of the financial balances of an individual or organization, whether it be a sole proprietorship, a business ...
calculations,
risk management Risk management is the identification, evaluation, and prioritization of risks, followed by the minimization, monitoring, and control of the impact or probability of those risks occurring. Risks can come from various sources (i.e, Threat (sec ...
and
derivatives pricing In finance, a derivative is a contract between a buyer and a seller. The derivative can take various forms, depending on the transaction, but every derivative has the following four elements: # an item (the "underlier") that can or must be bou ...
. In many cases, especially in electricity price forecasting, evaluation is based not on the actual point forecasts, but on the distributions of prices over certain future time periods. As this type of modeling has a long-standing tradition in
finance Finance refers to monetary resources and to the study and Academic discipline, discipline of money, currency, assets and Liability (financial accounting), liabilities. As a subject of study, is a field of Business administration, Business Admin ...
, an inflow of "finance solutions" is observed. * ''Long-term'' ''forecasting'', with lead times measured in months, quarters or even years, concentrates on investment profitability analysis and planning, such as determining the future sites or fuel sources of power plants.


Future of electricity price forecasting

In his extensive review paper, Weron looks ahead and speculates on the directions EPF will or should take over the next decade or so:


Fundamental price drivers and input variables


Seasonality

A key point in electricity spot price modeling and forecasting is the appropriate treatment of seasonality. The electricity price exhibits seasonality at three levels: the daily and weekly, and to some extent - the annual. In ''short-term forecasting'', the annual or long-term seasonality is usually ignored, but the daily and weekly patterns (including a separate treatment of holidays) are of prime importance. This, however, may not be the right approach. As Nowotarski and Weron have recently shown, decomposing a series of electricity prices into a long-term seasonal and a stochastic component, modeling them independently and combining their forecasts can bring - contrary to a common belief - an accuracy gain compared to an approach in which a given model is calibrated to the prices themselves. In ''mid-term forecasting'', the daily patterns become less relevant and most EPF models work with average daily prices. However, the long-term trend-cycle component plays a crucial role. Its misspecification can introduce bias, which may lead to a bad estimate of the mean reversion level or of the price spike intensity and severity, and consequently, to underestimating the risk. Finally, in the ''long term'', when the time horizon is measured in years, the daily, weekly and even annual seasonality may be ignored, and long-term trends dominate. Adequate treatment - both in-sample and out-of-sample - of seasonality has not been given enough attention in the literature so far.


Variable selection

Another crucial issue in electricity price forecasting is the appropriate choice of explanatory variables. Apart from historical electricity prices, the current spot price is dependent on a large set of fundamental drivers, including system loads, weather variables, fuel costs, the reserve margin (i.e., available generation minus/over predicted demand) and information about scheduled maintenance and forced outages. Although "pure price" models are sometimes used for EPF, in the most common day-ahead forecasting scenario most authors select a combination of these fundamental drivers, based on the heuristics and experience of the forecaster. Very rarely has an automated selection or shrinkage procedure been carried out in EPF, especially for a large set of initial explanatory variables. However, the
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
literature provides viable tools that can be broadly classified into two categories: * Feature or subset selection, which involves identifying a subset of predictors that we believe to be influential, then fitting a model on the reduced set of variables. * Shrinkage (also known as
regularization Regularization may refer to: * Regularization (linguistics) * Regularization (mathematics) * Regularization (physics) * Regularization (solid modeling) * Regularization Law, an Israeli law intended to retroactively legalize settlements See also ...
), that fits the full model with all predictors using an algorithm that shrinks the estimated coefficients towards zero, which can significantly reduce their variance. Depending on what type of shrinkage is performed, some of the coefficients may be shrunk to zero itself. As such, some shrinkage methods - like the
lasso A lasso or lazo ( or ), also called reata or la reata in Mexico, and in the United States riata or lariat (from Mexican Spanish lasso for roping cattle), is a loop of rope designed as a restraint to be thrown around a target and tightened when ...
- ''de facto'' perform variable selection. Some of these techniques have been utilized in the context of EPF: *
stepwise regression In statistics, stepwise regression is a method of fitting regression models in which the choice of predictive variables is carried out by an automatic procedure. In each step, a variable is considered for addition to or subtraction from the set of ...
, including single step elimination, *
Ridge regression Ridge regression (also known as Tikhonov regularization, named for Andrey Tikhonov) is a method of estimating the coefficients of multiple- regression models in scenarios where the independent variables are highly correlated. It has been used in m ...
, *
lasso A lasso or lazo ( or ), also called reata or la reata in Mexico, and in the United States riata or lariat (from Mexican Spanish lasso for roping cattle), is a loop of rope designed as a restraint to be thrown around a target and tightened when ...
, * and elastic nets, but their use is not common. Further development and employment of methods for selecting the most effective input variables - from among the past electricity prices, as well as the past and predicted values of the fundamental drivers - is needed.


Spike forecasting and the reserve margin

When predicting spike occurrences or spot price volatility, one of the most influential fundamental variables is the reserve margin, also called surplus generation. It relates the available capacity (generation, supply), C_t, to the demand (load), D_t, at a given moment in time t. The traditional engineering notion of the reserve margin defines it as the difference between the two, i.e., RM = C_t - D_t, but many authors prefer to work with dimensionless ratios \rho_t=D_t/C_t, R_t=C_t/D_t -1 or the so-called capacity utilization CU_t=1 - D_t/C_t. Its rare application in EPF can be justified only by the difficulty of obtaining good quality reserve margin data. Given that more and more system operators (see e.g. http://www.elexon.co.uk) are disclosing such information nowadays, reserve margin data should be playing a significant role in EPF in the near future.


Probabilistic forecasts

The use of
prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval (statistics), interval in which a future observation will fall, with a certain probability, given what has already been observed. Pr ...
s (PI) and densities, or
probabilistic forecasting Probabilistic forecasting summarizes what is known about, or opinions about, future events. In contrast to single-valued forecasts (such as forecasting that the maximum temperature at a given site on a given day will be 23 degrees Celsius, or that t ...
, has become much more common over the past three decades, as practitioners have come to understand the limitations of point forecasts. Despite the bold move by the organizers of the Global Energy Forecasting Competition 2014 to require the participants to submit forecasts of the 99
percentile In statistics, a ''k''-th percentile, also known as percentile score or centile, is a score (e.g., a data point) a given percentage ''k'' of all scores in its frequency distribution exists ("exclusive" definition) or a score a given percentage ...
s of the predictive distribution (day-ahead in the price track) and not the point forecasts as in the 2012 edition, this does not seem to be a common case in EPF as yet. If PIs are computed at all, they usually are distribution-based (and approximated by the standard deviation of the model residuals) or empirical. A new forecast combination (see below) technique has been introduced recently in the context of EPF. Quantile Regression Averaging (QRA) involves applying
quantile regression Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regress ...
to the point forecasts of a small number of individual forecasting models or experts, hence allows to leverage existing development of point forecasting.


Combining forecasts

Consensus forecast A consensus forecast is a prediction of the future created by combining several separate forecasts which have often been created using different methodologies. They are used in a number of sciences, ranging from econometrics to meteorology, and a ...
s, also known as ''combining forecasts'', ''forecast averaging'' or ''model averaging'' (in
econometrics Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8 ...
and
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
) and '' committee machines'', '' ensemble averaging'' or ''expert aggregation'' (in
machine learning Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
), are predictions of the future that are created by combining several separate forecasts which have often been created using different methodologies. Despite their popularity in econometrics, averaged forecasts have not been used extensively in the context of
electricity market An electricity market is a system that enables the exchange of electrical energy, through an electrical grid. Historically, electricity has been primarily sold by companies that operate electric generators, and purchased by consumers or electr ...
s to date. There is some limited evidence on the adequacy of combining forecasts of electricity demand, but it was only very recently that combining was used in EPF and only for point forecasts. Combining probabilistic (i.e., interval and density) forecasts is much less popular, even in econometrics in general, mainly because of the increased complexity of the problem. Since Quantile Regression Averaging (QRA) allows to leverage existing development of point forecasting, it is particularly attractive from a practical point of view and may become a popular tool in EPF in the near future.


Multivariate factor models

The literature on forecasting daily electricity prices has concentrated largely on models that use only information at the aggregated (i.e., daily) level. On the other hand, the very rich body of literature on forecasting intra-day prices has used disaggregated data (i.e., hourly or half-hourly), but generally has not explored the complex dependence structure of the multivariate price series. If we want to explore the structure of intra-day electricity prices, we need to use dimension reduction methods; for instance, factor models with factors estimated as principal components (PC). Empirical evidence indicates that there are forecast improvements from incorporating disaggregated (i.e., hourly or zonal) data for predicting daily system prices, especially when the forecast horizon exceeds one week. With the increase of computational power, the real-time calibration of these complex models will become feasible and we may expect to see more EPF applications of the multivariate framework in the coming years.


A universal test ground

All major review publications conclude that there are problems with comparing the methods developed and used in the EPF literature. This is due mainly to the use of different datasets, different software implementations of the forecasting models and different error measures, but also to the lack of statistical rigor in many studies. This calls for a comprehensive, thorough study involving (i) the same datasets, (ii) the same robust error evaluation procedures, and (iii) statistical testing of the significance of one model's outperformance of another. To some extent, the Global Energy Forecasting Competition 2014 has addressed these issues. Yet more has to be done. A selection of the better-performing measures (weighted-MAE, seasonal MASE or RMSSE) should be used either exclusively or in conjunction with the more popular ones (MAPE, RMSE). The empirical results should be further tested for the significance of the differences in forecasting accuracies of the models.


See also

*
Energy forecasting Energy forecasting includes forecasting demand ( load) and price of electricity, fossil fuels (natural gas, oil, coal) and renewable energy sources (RES; hydro, wind, solar). Forecasting can be both expected price value and probabilistic forecasti ...
* Global Energy Forecasting Competitions


References

{{reflist Economic forecasting Regression with time series structure Artificial neural networks Energy economics Electricity markets