HOME

TheInfoList



OR:

In
applied statistics Statistics (from German: ''Statistik'', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industria ...
, a partial regression plot attempts to show the effect of adding another variable to a model that already has one or more independent variables. Partial regression plots are also referred to as added variable plots, adjusted variable plots, and individual coefficient plots. When performing a
linear regression In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is ...
with a single
independent variable Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or deman ...
, a
scatter plot A scatter plot (also called a scatterplot, scatter graph, scatter chart, scattergram, or scatter diagram) is a type of plot or mathematical diagram using Cartesian coordinates to display values for typically two variables for a set of data. ...
of the response variable against the independent variable provides a good indication of the nature of the relationship. If there is more than one independent variable, things become more complicated. Although it can still be useful to generate scatter plots of the response variable against each of the independent variables, this does not take into account the effect of the other independent variables in the model.


Calculation

Partial regression plots are formed by: #Computing the residuals of regressing the response variable against the independent variables but omitting ''X''i #Computing the residuals from regressing ''X''i against the remaining independent variables #Plotting the residuals from (1) against the residuals from (2). Velleman and Welsch express this mathematically as: : Y_\mathrmX_ where :''Y''/sub> = residuals from regressing Y (the response variable) against all the independent variables except Xi :''X''i• /sub> = residuals from regressing ''X''i against the remaining independent variables.


Properties

Velleman and Welsch list the following useful properties for this plot: #The least squares linear fit to this plot has the slope \beta_ and intercept zero. #The residuals from the least squares linear fit to this plot are identical to the residuals from the least squares fit of the original model (Y against all the independent variables including Xi). #The influences of individual data values on the estimation of a coefficient are easy to see in this plot. #It is easy to see many kinds of failures of the model or violations of the underlying assumptions (nonlinearity,
heteroscedasticity In statistics, a sequence (or a vector) of random variables is homoscedastic () if all its random variables have the same finite variance. This is also known as homogeneity of variance. The complementary notion is called heteroscedasticity. The s ...
, unusual patterns). . Partial regression plots are related to, but distinct from, partial residual plots. Partial regression plots are most commonly used to identify data points with high leverage and influential data points that might not have high leverage. Partial residual plots are most commonly used to identify the nature of the relationship between ''Y'' and ''X''i (given the effect of the other independent variables in the model). Note that since the simple correlation between the two sets of residuals plotted is equal to the
partial correlation In probability theory and statistics, partial correlation measures the degree of association between two random variables, with the effect of a set of controlling random variables removed. When determining the numerical relationship between two ...
between the response variable and ''X''i, partial regression plots will show the correct strength of the linear relationship between the response variable and ''X''i. This is not true for partial residual plots. On the other hand, for the partial regression plot, the x-axis is not ''X''i. This limits its usefulness in determining the need for a transformation (which is the primary purpose of the partial residual plot).


See also

* Partial residual plot * Partial leverage plot * Variance inflation factor for a multi-linear fit.


References


Further reading

* * * * *


External links


Partial Regression Plot
{{NIST-PD Statistical charts and diagrams Regression diagnostics