In statistics, the mean signed difference (MSD), also known as mean signed deviation and mean signed error, is a sample

statistic A statistic (singular) or sample statistic is any quantity computed from values in a sample which is considered for a statistical purpose. Statistical purposes include estimating a population parameter, describing a sample, or evaluating a hy ...

that summarises how well a set of estimates

\hat_i

match the quantities

\theta_i

that they are supposed to estimate. It is one of a number of statistics that can be used to assess an estimation procedure, and it would often be used in conjunction with a sample version of the mean square error. For example, suppose a

linear regression In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is ...

model has been estimated over a sample of data, and is then used to extrapolate predictions of the

dependent variable Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or dema ...

out of sample after the out-of-sample data points have become available. Then

\theta_i

would be the ''i''-th out-of-sample value of the dependent variable, and

\hat_i

would be its predicted value. The mean signed deviation is the average value of

\hat_i-\theta_i.

Definition

The mean signed difference is derived from a set of ''n'' pairs,

( \hat_i,\theta_i)

, where

\hat_i

is an estimate of the parameter

\theta

in a case where it is known that

\theta=\theta_i

. In many applications, all the quantities

\theta_i

will share a common value. When applied to

forecasting Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual ...

in a

time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

context, a forecasting procedure might be evaluated using the mean signed difference, with

\hat_i

being the predicted value of a series at a given

lead time A lead time is the latency between the initiation and completion of a process. For example, the lead time between the placement of an order and delivery of new cars by a given manufacturer might be between 2 weeks and 6 months, depending on vari ...

and

\theta_i

being the value of the series eventually observed for that time-point. The mean signed difference is defined to be :

\operatorname(\hat) = \frac\sum^_ \hat - \theta_ .

Definition

See also