''Do''-calculus is a set of mathematical rules devised by
Judea Pearl in 1995 to determine whether
causal effects can be identified from observational data under specific assumptions encoded in a
causal graph. It provides a systematic method for transforming expressions involving the ''do''-operator (representing interventions) into expressions involving only observable probabilities, enabling the identification of causal relationships.
Definition and purpose
Causal queries involving interventions (e.g.,
) are considered ''identifiable'' if they can be expressed using observational data alone, independent of unmeasured parameters. The ''do''-calculus achieves this by leveraging graphical criteria from
directed acyclic graphs (DAGs) to remove ''do''-operators through algebraic manipulations.
The three rules of ''Do''-calculus
The rules apply to a causal graph
and assume the
Markov condition holds:
Rule 1: Insertion/deletion of observations
:
This rule allows the removal of irrelevant observations (
) if they are ''d''-separated from
given
and
in the graph where incoming edges to
are removed.
Rule 2: Action/observation exchange
:
This rule permits replacing an intervention (
) with an observation (
) if
and
are *d*-separated in the graph where outgoing edges from
are removed.
Rule 3: Insertion/deletion of interventions
:
This rule removes irrelevant interventions (
) if
and
are ''d''-separated in a graph modified to block paths through
.
Applications
''Do''-calculus can be applied to various domains within
causal inference
Causal inference is the process of determining the independent, actual effect of a particular phenomenon that is a component of a larger system. The main difference between causal inference and inference of association is that causal inference an ...
such as
mediation analysis in decomposing direct and indirect effects.
It can be used for meta-synthesis to combine the results from heterogeneous studies.
Completeness
The ''do''-calculus is considered
complete: if repeated application of the rules cannot eliminate the ''do''-operator, the causal effect is not identifiable. This result was formalized in 2006 by Huang, Valtorta, Shpitser, and Pearl.
Criticism
Critics have pointed out that other frameworks, such as
structural equation modeling (SEM) or
Bayesian networks
A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their Conditional dependence, conditional dependencies via a directed a ...
, may offer more intuitive approaches to causal inference for certain applications. These methods often emphasize parameter estimation rather than identifiability, which can be more relevant for applied research.
References
{{Authority control
Statistics
Inference
Causal inference