In
statistical quality control
Statistical process control (SPC) or statistical quality control (SQC) is the application of statistical methods to monitor and control the quality of a production process. This helps to ensure that the process operates efficiently, producing ...
, the CUsUM (or cumulative sum
control chart
Control charts is a graph used in production control to determine whether quality and manufacturing processes are being controlled under stable conditions. (ISO 7870-1)
The hourly status is arranged on the graph, and the occurrence of abnormalit ...
) is a
sequential analysis
In statistics, sequential analysis or sequential hypothesis testing is statistical analysis where the sample size is not fixed in advance. Instead data are evaluated as they are collected, and further sampling is stopped in accordance with a p ...
technique developed by E. S. Page of the
University of Cambridge
, mottoeng = Literal: From here, light and sacred draughts.
Non literal: From this place, we gain enlightenment and precious knowledge.
, established =
, other_name = The Chancellor, Masters and Schola ...
. It is typically used for monitoring
change detection
In statistical analysis, change detection or change point detection tries to identify times when the probability distribution of a stochastic process or time series changes. In general the problem concerns both detecting whether or not a chang ...
.
CUSUM was announced in
Biometrika
''Biometrika'' is a peer-reviewed scientific journal published by Oxford University Press for thBiometrika Trust The editor-in-chief is Paul Fearnhead ( Lancaster University). The principal focus of this journal is theoretical statistics. It was ...
, in 1954, a few years after the publication of
Wald's
sequential probability ratio test
The sequential probability ratio test (SPRT) is a specific sequential hypothesis test, developed by Abraham Wald and later proven to be optimal by Wald and Jacob Wolfowitz. Neyman and Pearson's 1933 result inspired Wald to reformulate it as a se ...
(SPRT).
E. S. Page referred to a "quality number"
, by which he meant a parameter of the
probability distribution
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomeno ...
; for example, the
mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set.
For a data set, the '' ari ...
. He devised CUSUM as a method to determine changes in it, and proposed a criterion for deciding when to take corrective action. When the CUSUM method is applied to changes in mean, it can be used for
step detection of a
time series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. E ...
.
A few years later,
George Alfred Barnard
George Alfred Barnard (23 September 1915 – 9 August 2002) was a British statistician known particularly for his work on the foundations of statistics and on quality control.
Biography
George Barnard was born in Walthamstow, Lond ...
developed a visualization method, the V-mask chart, to detect both increases and decreases in
.
Method
As its name implies, CUSUM involves the calculation of a cumulative sum (which is what makes it "sequential"). Samples from a process
are assigned weights
, and summed as follows:
:
:
When the value of ''S'' exceeds a certain threshold value, a change in value has been found. The above formula only detects changes in the positive direction. When negative changes need to be found as well, the min operation should be used instead of the max operation, and this time a change has been found when the value of ''S'' is ''below'' the (negative) value of the threshold value.
Page did not explicitly say that
represents the
likelihood function
The likelihood function (often simply called the likelihood) represents the probability of random variable realizations conditional on particular values of the statistical parameters. Thus, when evaluated on a given sample, the likelihood funct ...
, but this is common usage.
Note that this differs from SPRT by always using zero function as the lower "holding barrier" rather than a lower "holding barrier".
[ Also, CUSUM does not require the use of the likelihood function.
As a means of assessing CUSUM's performance, Page defined the ''average run length'' (A.R.L.) ]metric
Metric or metrical may refer to:
* Metric system, an internationally adopted decimal system of measurement
* An adjective indicating relation to measurement in general, or a noun describing a specific type of measurement
Mathematics
In mathem ...
; "the expected number of articles sampled before action is taken." He further wrote:[
]
When the quality of the output is satisfactory the A.R.L. is a measure of the expense incurred by the scheme when it gives false alarms, i.e., Type I error
In statistical hypothesis testing, a type I error is the mistaken rejection of an actually true null hypothesis (also known as a "false positive" finding or conclusion; example: "an innocent person is convicted"), while a type II error is the f ...
s ( Neyman & Pearson, 1936). On the other hand, for constant poor quality the A.R.L. measures the delay and thus the amount of scrap produced before the rectifying action is taken, i.e., Type II error
In statistical hypothesis testing, a type I error is the mistaken rejection of an actually true null hypothesis (also known as a "false positive" finding or conclusion; example: "an innocent person is convicted"), while a type II error is the f ...
s.
Example
The following example shows 20 observations of a process with a mean of 0 and a standard deviation of 0.5.
From the column, it can be seen that never deviates by 3 standard deviations (), so simply alerting on a high deviation will not detect a failure, whereas CUSUM shows that the value exceeds 4 at the 17th observation.
where is a critical level parameter (tunable, same as threshold T) that's used to adjust the sensitivity of change detection: larger makes CUSUM less sensitive to the change and vice versa.
Variants
Cumulative observed-minus-expected plots are a related method.
References
Further reading
*
* Mishra, S., Vanli, O. A., & Park, C (2015)
"A Multivariate Cumulative Sum Method for Continuous Damage Monitoring with Lamb-wave Sensors"
''International Journal of Prognostics and Health Management'',
External links
{{DEFAULTSORT:Cusum
Statistical charts and diagrams
Quality control tools
Sequential methods