A Bland–Altman plot (difference plot) in

analytical chemistry Analytical chemistry studies and uses instruments and methods to separate, identify, and quantify matter. In practice, separation, identification or quantification may constitute the entire analysis or be combined with another method. Separati ...

or biomedicine is a method of data plotting used in analyzing the agreement between two different

assay An assay is an investigative (analytic) procedure in laboratory medicine, mining, pharmacology, environmental biology and molecular biology for qualitatively assessing or quantitatively measuring the presence, amount, or functional activity of ...

s. It is identical to a Tukey mean-difference plot, the name by which it is known in other fields, but was popularised in

medical statistics Medical statistics deals with applications of statistics to medicine and the health sciences, including epidemiology, public health, forensic medicine, and clinical research. Medical statistics has been a recognized branch of statistics in the ...

J. Martin Bland John Martin Bland (born 6 March 1947), known as Martin Bland, is a British statistician. He has been professor of health statistics at the University of York since 2003.

and Douglas G. Altman.

Agreement versus correlation

Bland and Altman drive the point that any two methods that are designed to measure the same parameter (or property) should have good

correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statisti ...

when a set of samples are chosen such that the property to be determined varies considerably. A high correlation for any two methods designed to measure the same property could thus in itself just be a sign that one has chosen a widespread sample. A high correlation does not necessarily imply that there is good agreement between the two methods.

Construction

Consider a sample consisting of

n

observations (for example, objects of unknown volume). Both assays (for example, different methods of volume measurement) are performed on each sample, resulting in

2n

data points. Each of the

n

samples is then represented on the graph by assigning the

mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set. For a data set, the '' ari ...

of the two measurements as the

x

-value, and the difference between the two values as the

y

-value. The

Cartesian coordinates A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured i ...

of a given sample

S

with values of

S_1

and

S_2

determined by the two assays is :

S(x,y)=\left( \frac, S_1-S_2 \right).

For comparing the dissimilarities between the two sets of samples independently from their mean values, it is more appropriate to look at the ratio of the pairs of measurements. Log transformation (base 2) of the measurements before the analysis will enable the standard approach to be used; so the plot will be given by the following equation: :

S(x,y)=\left( \frac, \log_2 S_1 -\log_2 S_2 \right).

This version of the plot is used in MA plot.

Application

One primary application of the Bland–Altman plot is to compare two clinical measurements each of which produced some error in their measures. It can also be used to compare a new measurement technique or method with a

gold standard A gold standard is a Backed currency, monetary system in which the standard economics, economic unit of account is based on a fixed quantity of gold. The gold standard was the basis for the international monetary system from the 1870s to the ...

, as even a gold standard does not—and should not—imply it to be without error. See

Analyse-it Analyse-it is a statistical analysis add-in for Microsoft Excel. Analyse-it is the successor to Astute, developed in 1992 for Excel 4 and the first statistical analysis add-in for Microsoft Excel. Analyse-it provides a range of standard parametric ...

, MedCalc, NCSS, GraphPad Prism, R, or

StatsDirect StatsDirect is a statistical software package designed for biomedical, public health, and general health science uses. The second generation of the software was reviewed in general medical and public health journals. Features and use StatsDire ...

for software providing Bland–Altman plots. Bland–Altman plots are extensively used to evaluate the agreement among two different instruments or two measurements techniques. Bland–Altman plots allow identification of any systematic difference between the measurements (i.e., fixed bias) or possible

outlier In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...

s. The mean difference is the estimated bias, and the SD of the differences measures the random fluctuations around this mean. If the mean value of the difference differs significantly from 0 on the basis of a 1-sample

t-test A ''t''-test is any statistical hypothesis test in which the test statistic follows a Student's ''t''-distribution under the null hypothesis. It is most commonly applied when the test statistic would follow a normal distribution if the value of a ...

, this indicates the presence of fixed bias. If there is a consistent bias, it can be adjusted for by subtracting the mean difference from the new method. It is common to compute 95%

limits of agreement In statistics, inter-rater reliability (also called by various similar names, such as inter-rater agreement, inter-rater concordance, inter-observer reliability, inter-coder reliability, and so on) is the degree of agreement among independent obse ...

for each comparison (average difference ± 1.96 standard deviation of the difference), which tells us how far apart measurements by two methods were more likely to be for most individuals. If the differences within mean ± 1.96 SD are not clinically important, the two methods may be used interchangeably. The 95% limits of agreement can be unreliable estimates of the population parameters especially for small sample sizes so, when comparing methods or assessing repeatability, it is important to calculate confidence intervals for 95% limits of agreement. This can be done by Bland and Altman's approximate method or by more precise methods. Bland–Altman plots were also used to investigate any possible relationship of the discrepancies between the measurements and the true value (i.e., proportional bias). The existence of proportional bias indicates that the methods do not agree equally through the range of measurements (i.e., the limits of agreement will depend on the actual measurement). To evaluate this relationship formally, the difference between the methods should be regressed on the average of the 2 methods. When a relationship between the differences and the true value was identified (i.e., a significant slope of the regression line), regression-based 95% limits of agreement should be provided.

Notes

A similar method was proposed in 1981 by Eksborg.Eksborg S (1981) Evaluation of method-comparison data. Clin Chem 27:1311–1312 This method was based on Deming regression—a method introduced by Adcock in 1878. Bland and Altman's Lancet paper was number 29 in a list of the top 100 most-cited papers of all time with over 23,000 citations.

References

{{DEFAULTSORT:Bland-Altman plot Analytical chemistry Statistical charts and diagrams Medical statistics

Agreement versus correlation

Construction

Application

See also

Notes

References