Potential Outcomes Framework: Causality

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Causality is linked to a modification that is applied to a unit.

True (A)

A unit can only be an individual person in the potential outcomes framework.

False (B)

A unit can be exposed to multiple alternative actions simultaneously in the potential outcomes framework.

False (B)

The potential part in 'potential outcomes' conveys the idea that multiple outcomes will be realized after the intervention.

False (B) Signup and view all the answers

Referring to Schrödinger's cat, a person simultaneously has and does not have Medicaid coverage.

False (B) Signup and view all the answers

The 'observed' outcome for a unit `i` is denoted as $D_i$ in causal inference.

False (B) Signup and view all the answers

If a unit i receives treatment, identified as $D_i = 1$, then $Y_i^0$ represents the observed outcome.

False (B) Signup and view all the answers

The causal effect of a treatment is evaluated without comparing potential outcomes.

False (B) Signup and view all the answers

The definition of causal effects depends only on the actual treatment.

False (B) Signup and view all the answers

The fundamental problem of causal inference is that we only observe all potential outcomes for each unit.

False (B) Signup and view all the answers

The counterfactual outcome refers to the outcome that is observed under treatment.

False (B) Signup and view all the answers

The same individual receiving treatment at two different times is considered the same unit in causal inference.

False (B) Signup and view all the answers

In the notation for potential outcomes, $Y_i$ represents the potential outcome for the same unit i under both treatment and control conditions.

False (B) Signup and view all the answers

Solving the causal inference problem with multiple units involves making predictions without using information from other units.

False (B) Signup and view all the answers

To predict the counterfactual, we need to compare dissimilar units and situations.

False (B) Signup and view all the answers

The average observed outcomes represent a causal effect without considering assignment bias.

False (B) Signup and view all the answers

If the outcome for the treated when they are not treated is the same as the outcome for the control units when not treated, then a simple comparison provides an estimate of average treatment effects.

True (A) Signup and view all the answers

Treatment assignment does not need to be independent of potential outcomes for identifying causal effects.

False (B) Signup and view all the answers

If each group provides a counterfactual of the other, it matters which group received the treatment when assessing causal effects.

False (B) Signup and view all the answers

Conditional independence is statistically unrelated to regression models.

False (B) Signup and view all the answers

Treating patients is conditional independence does not require understanding causality.

False (B) Signup and view all the answers

In simple randomization, treatment assignment is conditional on the value of one or more covariates.

False (B) Signup and view all the answers

In observational studies, understanding why some units ended up receiving treatment is not important.

False (B) Signup and view all the answers

Simple randomization means that both forms of assignment bias are always present.

False (B) Signup and view all the answers

When the control group provides an unbiased prediction as to what would have happened to the treated group if not treated, it is a poor prediction.

False (B) Signup and view all the answers

With conditional randomization, it is never necessary to condition on the variables that influenced treatment assignment.

False (B) Signup and view all the answers

Causal inference problems are solved without multiple units.

False (B) Signup and view all the answers

It is possible to estimate individual effects without further assumptions.

False (B) Signup and view all the answers

Under randomization, ATE equals ATET equals ATEC because the selection bias is always present.

False (B) Signup and view all the answers

Statistical adjustment is a way obtain causal effect.

True (A) Signup and view all the answers

The naive difference in expected values can be rewritten as $E[Y_i^0|D_i = 1] – E[Y_i^0|D_i = 1] + E[Y_i^1|D_i = 1] – E[Y_i^0|D_i = 0]$

False (B) Signup and view all the answers

$E[Y_i^1|D_i = 1] – E[Y_i^0|D_i = 1]$ is the definition of the average treatment effect on the treated (ATET)

True (A) Signup and view all the answers

If we know and observe covariates that affect selection intro treatment and outcome (confounders), we can extend this condition to be $E[Y_{0i}|X_i, D_i = 1] = E[Y_i|X_i, D_i = 0]$

True (A) Signup and view all the answers

The interacted model is never related to the estimates from the stratified analysis.

False (B) Signup and view all the answers

Regression adjustment does not need another assumption.

False (B) Signup and view all the answers

The potential outcomes for any unit can vary with the treatment assigned to other units

False (B) Signup and view all the answers

The definition of statistical effects does rely on theory

False (B) Signup and view all the answers

External validity refers to the extent to which a study can provide (identify) causal effects

False (B) Signup and view all the answers

Clinical trials have a strong internal validity but a poor external validity

True (A) Signup and view all the answers

Propensity score is equal to $P(D_i = 1|X_i)$

True (A) Signup and view all the answers

Flashcards

Potential outcome

The idea that only one outcome is observed after an intervention; the other is only potential.

Causal effect

A contrast between potential outcomes for a unit when treated versus not treated.

Fundamental problem of causal inference

The challenge of not observing both potential outcomes simultaneously.

Counterfactual outcome

An outcome that is not observed or realized.

Signup and view all the flashcards

Multiple units

The process of using multiple units to predict outcomes and solve causal inference problems.

Signup and view all the flashcards

Assignment bias

This occurs when the treatment assignment is not independent of potential outcomes, leading to biased causal effect estimates.

Signup and view all the flashcards

Association is causation

When treatment assignment is independent of potential outcomes.

Signup and view all the flashcards

Conditional independence

When treatment assignment is independent from potential outcomes given observed covariates.

Signup and view all the flashcards

Simple randomization

Units are randomly divided into groups, making assignment bias as close to zero as possible

Signup and view all the flashcards

Average Treatment Effect on the Treated (ATET)

Most common treatment effect; The effect of treatment on those who received the treatment

Signup and view all the flashcards

Conditional randomization

This occurs when treatment assignment is conditional on the value of one or more covariates.

Signup and view all the flashcards

Heterogeneous effects

Treatment impacts some individuals positively and some negatively.

Signup and view all the flashcards

Heterogeneous treatment effects

treatment effects using a model above adds a structure

Signup and view all the flashcards

Statistical adjustment

Causal inference model with observational data using regression.

Signup and view all the flashcards

Fundamental assumption for statistical adjustment

The assumption that the assignment of units to experimental groups is independent of potential outcomes when conditioning on Xn.

Signup and view all the flashcards

Confounder

A variable correlating with both dependent and independent variables in a model.

Signup and view all the flashcards

Moderator Variable

A variable/factor that could change the treatment effect

Signup and view all the flashcards

Mediator Variable

A variable/factor that explains a treatment effect

Signup and view all the flashcards

Overlap

Ensuring units exist in both treatment/control groups

Signup and view all the flashcards

Stable Unit Treatment Value Assumption (SUTVA)

Potential outcomes for any unit don't vary with treatments.

Signup and view all the flashcards

Internal validity

The extent to which a study can provide causal effects

Signup and view all the flashcards

External validity

The extent to which study results can be generalized.

Signup and view all the flashcards

Study Notes

The Potential Outcomes Framework

Examines potential outcomes and counterfactuals to define unit-specific causal effects.
Addresses the fundamental problem of causal inference across multiple units.
Explores solutions to the fundamental problem through simple and conditional randomization.
Covers ignorability of treatment assignment assumption, observational data, and strong ignorability.
Discusses the Stable Unit Treatment Value Assumption (SUTVA).
Focuses estimating treatment effects: (ATE), average treatment effect on the treated (ATET), and ATE conditional (ATEC).
Presents the importance of conceptual frameworks in causality studies.

Big Picture of Framework for Understanding Causality

Aims to review the basic framework for understanding causality, with the goal of helping create frameworks that mathematical notation can be applied to.
It aids in understanding when a measure of association can lead to a causal effect.
Allows the separation of research design from estimation.
It will cover ways to estimating causal effects that follow from the definition.

Basic Concepts in Framework

Causality is linked to a manipulation, such as a treatment, intervention, action, or strategy applied to a unit.
A unit can be a person, firm, hospital, nursing home, country, county, classroom, etc.
It refers to the entity that receives the action or is manipulated.
For simplicity, scenarios often involve only two possibilities: receiving or not receiving treatment (treatment vs. control unit).
The ideas extend to multilevel treatments, but the notation becomes more complex.
A unit’s treatment status is linked to a potential outcome.

Potential Outcomes for Units

The concept refers to the idea that only one outcome is realized after an intervention.
Dictionary definition: Potential refers to having or showing the capacity to become or develop into something in the future.
Before the intervention, there are two potential outcomes.
For example, a person may or may not receive Medicaid coverage, with the outcome being income or health status one year after the program, which can be binary.
Each person will either get medicaid or not.

Notation Used in Framework

The observed or realized outcome for a unit i is Y;
The treatment received by a unit i is D,, which can be 0 or 1, so D, ∈ {0,1}
There are two potential outcomes: Y¹ if the unit received treatment (D; = 1) and Y if not (D; = 0)
The index i helps keep track of the data structure.
Other common notations include W for treatment, used also in the Stata manual, and tother alternative notations for potential outcomes such as Yoi or Y,(0) which Stata uses yo etc.

Definition of Causal Effects

The effect of receiving treatment for a unit i involves a comparison of potential outcomes.
This can be done by comparing Y¹ - Yº or measuring the effect as a relative measure, such as Y¹/Yº.
The definition of causal effects does not depend on action taken.
Causal effect of receiving Medicare coverage for person A can be defined as Y¹ - Y°.

Further Discussion of Causal Effects

Comparison can take many forms depending on how the outcome is measured.
The outcome can be defined as the probability of being depression-free, with the causal effect being Pr(Y¹) – Pr(Yº), which is a marginal effect.
The odds-ratio method can be used, where you dividethe probability of an event occurring by the probability of it not occurring: Pr(Y) / 1-Pr(Y)
The way we make comparisons is related to how some statistical models measure those comparisons.

Real-World Examples of Causal Effects

Thinking about causal effects matches everyday thinking.
The causal effect of studying hard for class is the comparison of two potential outcomes: grade if studying hard versus grade if not.
Many single time-travel movies utilize the concept of one action causing an event to happen, and using this to "fix" the cause and future outcomes.

The Fundamental Problem of Causal Inference

It is that we do not observe both potential outcomes.
This is one of the biggest issues in program evaluation.
The non-observable or not-realized outcome is called the counterfactual outcome.
The understanding of causal effects is now called the Rubin causal model or the Neyman-Rubin causal model of causality.

Deeper Look in Causal Inference

The main issue is there is no way to know the counterfactual.
The same unit receiving a treatment at a different time is effectively a different unit.
A conceptual framework and story is also needed to find causal effects in situations like allergy testing.

Counterfactuals

Going back to our notation, we can write Y₁ = Y° + (Y – Y)D;
Is D, = 1 then the observed outcome is Y₁ = Y¹.
If D, = 0 then the observed outcome is Y₁ = Y
In this case, it is understood that a unit either receives the treatment or not, never both.
We can now re-frame the causal inference problem as a prediction problem.

Using Multiple Units for Causal Inference

Causal inference problems can be solved with predicting outcomes with multiple units.
Doing this in everyday situations involves using other units.
It can also involve the same individual changing the outcome at different times to figure out methods that work.
To estimate the counterfactual, we must ensure something: comparing similar units and situations.

Comparisons and Assignment Bias

With multiple units, some receive a binary treatment (D₁ = 1 if treated) and a continuous outcome is observed.
The average observed outcomes can then be compared through calculating conditional expectation of: δₙₐᵢᵥₑ = E[Y¡|D₁ = 1] – E[Y¡|D₁ = 0]
Bias needs to be identified in order to find causal effects.
Understanding assignment mechanism is key figuring out casual effects.

Association versus Causation

To interpret comparisons (δₙₐᵢᵥₑ = E[Y¡|D₁ = 1] – E[Y¡|D₁ = 0]), the bias must be zero in order to have a causal effect.
If the outcome for the treated when they are not treated [counterfactual] is the same as the observed outcome for the control units when they are not treated, then a simple comparison of average outcomes can provide an estimate of average treatment effects.
To identify causal effects, the method of treatment assignment must be independent of potential outcomes.

Conditioning & Flip Side of Assignment Bias

One form of assignment bias is: δₙₐᵢᵥₑ = δₐₜ𝒸 + E[Y;|D₁ = 1] – E[Y;|D;}\D₁ = 0]
With both forms of assignment bias it follows that: δₙₐᵢᵥₑ = δₐₜ𝒸 = δₐₜₜ = δₐₜ𝒸
In other words, each group provides a counterfactual.

Independence

This is a refresher on statistical independence.
Two random variables are independent if and only if: fx.y(x,y) = fx(x)fy(y).
For discrete random variables: P(X = x, Y = y) = P(X = x)P(Y = y)
In terms of events: P(A∩B) = P(A)P(B) (their joint probability is the same as the multiplication of the individual probabilities).
Conditional probability is: P(A\B) = P(A∩B) / P(B) = P(A)P(B) / P(B) = P(A)

Conditional Independence

Conditional independence is important regression models and conditional independence assumption.
Events A and B are conditionally independent if P(A∩B|Z) = P(A|Z)P(B|Z)
In other words, knowing B does not tell us about P(A) given Z.

For Causality, Key Understanding is Conditional Independence

If you randomly assign patients to treatment based on severity of illness.
and people with severe illness are more likely to receive treatment, and age A is related to the illness.
Therefore, D and A are not independent but D and A are conditionally independent given Z where P(D\Z, A) = P(D|Z)
once we know the severity of the illness, Z, age A doesn't tell anything about D.

Two Experimental Models

In simple randomization units are randomly divided into groups.
Then in conditional randomization treatment assignment is conditional.
Aim is to the study being as conditional randomization as possible, and understanding what influences treatment.
This involves designing observational studies that approximate conditional randomization as closely as possible.
Emphasizes that the key is understanding why some units ended up receiving treatment.

Further Elaborating Ways to Solve Causal Inference

a) Simple randomization: Control groups are best when they have no assignment bias.
After the manipulation in one group, the other provides the counterfactual.
Or precisely the potential outcomes independence from particular group assignments.
Groups are "exchangeable" so which group receives treatment doesn't matter.

Randomization and Independence

The observed can come form the counterfactual through: E[Y;|D₁ = 1] – E[Y;|D₁ = 0] = E[Y₁i|D₁ = 1] – E[Yoi|D₁ = 0]
If causal inference is a prediction problem, randomization solves it.

Even More Info on Solving Causal Inference

With conditional randomization "condition" on the variables that influence treatment assignment.
So these need to be true: E[Y;|D; = 0, Z = 1] = E[Y°|D; = 1, Z = 1] and E[Y; D; = 0, Z = 0] = E[Y°|D; = 1, Z = 0]

Average Treatment Versus Individual Effects

Only average treatment effects are obtained from the causal with multiple units.
Must assume that the effects are average to estimate individual effects, and that they aren't heterogeneous.

Treatment Effects Described: ATE, ATET, LATE

The equation is: E[Y;|D; = 1] – E[Y;|D; = 0] = E[Y1;|D; = 1] – E[Yo¡|D; = 1] + E[Yo;|D; = 1] – E[Yo¡|D; = 0]
Observed/naive average treatment effect = Average treatment effect on the treated + selection bias 1
Observed/naive average treatment effect = Average treatment effect on the control + selection bias 2
Under randomization, ATE (average treatment effect) = ATET (average treatment effect on the treated) = ATEC (average treatment effect, conditional)

More Ways Of Solving Causal Inference

b) Statistical adjustment: The provides intuition with (some form of) regression or other forms of adjustment.
We can experiment with randomization through severty classified patients 80% rate of treatment for severe, 20% rate of treatment if severity is low.
So a severity score can used with regression.

Conditional Randomization with Regression

The selection bias is not zero when the treated have different expected outcomes with E[Yo¡|D; = 1] ≠ E[Y;|D; = 0]
Model = Y; = βο + β₁ D; + €¡ so severity Z is part of e.
A simple fix is to run model like Y; = Yo + Y1 D¡ + Y3Z; + n; and run the other versions assuming the model is correct.

Fundamental Assumption for Statistical Adjustment

In order to justify statistical adjustment.
Observational variables must be present for the assignment of units to experimental groups to be independent.
This also can be looked at with ignorability: with the inability to control confounders meaning correlation doesn't mean causality, with the assumption that no unmeasured confounders are present.

Confounders

A variable in a statistical model that correlates with dependent variable and independent variable.
Another way: a confounder predicts both a covariate and outcome
Though antidepressants can be, caution must also be taken as all confounders aren't created equal.

Definitions for Moderator, Mediator, Confounder

Moderator: A variable/factor that could change the treatment effect.
Mediator: A variable factor that explains effects and the mechanism relating to the variable's connection between the two variables. Also called a "pacifier"
Without a framework of mediator, it is hard to truly discern relationships.

Assumptions

To determine biases, comparing what variables are affected and knowing their relationship to treatment and outcome is needed.
More nuance is also needed especially in instances when covariates are used.

Heterogeneous Treatment Effect

The model for treatment effect: Y; = βο + β₁ D; + β2Z; + €¡
Stratified models are needed to compare severty as 1 for different treatment, zero in other cases.
Another model for this is Y¡ = Yo + Y1 D; + Y2Z; + ¥3D¡ × Z¡ + Ni
We need this extended definition to better understand what variables cause the treatment.

Regression Adjustment Requires Another Assumption: Overlap

The group has been actually need another assumption that, has oversight, which also helps with causal effects by reading with observational data
If the data only includes people of either extreme, it will have an affect.
For the assumption, the overlap should have no restrictions. All are required, also the covariate needs to a defined definition from its proper city area. This includes a certain score.

Stable Unit Treatment Value Assumption (SUTVA)

This assumes that the outcome of the treatment will have no interference.
This is needed when it is shown to be related more.
It is important to consider that all assumptions can have exclusions.

Other Important Concepts

Internal validy helps determine how well its effects can be.
This could include some sort of instrumental design.
So the pop up, might be hard to hard with difficulty but always with the valid assessment.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Potential Outcomes Framework: Causality

Choose a study mode

Podcast

Questions and Answers

Causality is linked to a modification that is applied to a unit.

A unit can only be an individual person in the potential outcomes framework.

A unit can be exposed to multiple alternative actions simultaneously in the potential outcomes framework.

The potential part in 'potential outcomes' conveys the idea that multiple outcomes will be realized after the intervention.

Referring to Schrödinger's cat, a person simultaneously has and does not have Medicaid coverage.

The 'observed' outcome for a unit i is denoted as $D_i$ in causal inference.

If a unit i receives treatment, identified as $D_i = 1$, then $Y_i^0$ represents the observed outcome.

The causal effect of a treatment is evaluated without comparing potential outcomes.

The definition of causal effects depends only on the actual treatment.

The fundamental problem of causal inference is that we only observe all potential outcomes for each unit.

The counterfactual outcome refers to the outcome that is observed under treatment.

The same individual receiving treatment at two different times is considered the same unit in causal inference.

In the notation for potential outcomes, $Y_i$ represents the potential outcome for the same unit i under both treatment and control conditions.

Solving the causal inference problem with multiple units involves making predictions without using information from other units.

To predict the counterfactual, we need to compare dissimilar units and situations.

The average observed outcomes represent a causal effect without considering assignment bias.

If the outcome for the treated when they are not treated is the same as the outcome for the control units when not treated, then a simple comparison provides an estimate of average treatment effects.

Treatment assignment does not need to be independent of potential outcomes for identifying causal effects.

If each group provides a counterfactual of the other, it matters which group received the treatment when assessing causal effects.

Conditional independence is statistically unrelated to regression models.

Treating patients is conditional independence does not require understanding causality.

In simple randomization, treatment assignment is conditional on the value of one or more covariates.

In observational studies, understanding why some units ended up receiving treatment is not important.

Simple randomization means that both forms of assignment bias are always present.

When the control group provides an unbiased prediction as to what would have happened to the treated group if not treated, it is a poor prediction.

With conditional randomization, it is never necessary to condition on the variables that influenced treatment assignment.

Causal inference problems are solved without multiple units.

It is possible to estimate individual effects without further assumptions.

Under randomization, ATE equals ATET equals ATEC because the selection bias is always present.

Statistical adjustment is a way obtain causal effect.

The naive difference in expected values can be rewritten as $E[Y_i^0|D_i = 1] – E[Y_i^0|D_i = 1] + E[Y_i^1|D_i = 1] – E[Y_i^0|D_i = 0]$

$E[Y_i^1|D_i = 1] – E[Y_i^0|D_i = 1]$ is the definition of the average treatment effect on the treated (ATET)

If we know and observe covariates that affect selection intro treatment and outcome (confounders), we can extend this condition to be $E[Y_{0i}|X_i, D_i = 1] = E[Y_i|X_i, D_i = 0]$

The interacted model is never related to the estimates from the stratified analysis.

Regression adjustment does not need another assumption.

The potential outcomes for any unit can vary with the treatment assigned to other units

The definition of statistical effects does rely on theory

External validity refers to the extent to which a study can provide (identify) causal effects

Clinical trials have a strong internal validity but a poor external validity

Propensity score is equal to $P(D_i = 1|X_i)$

Flashcards

Potential outcome

Causal effect

Fundamental problem of causal inference

Counterfactual outcome

Multiple units

Assignment bias

Association is causation

Conditional independence

Simple randomization

Average Treatment Effect on the Treated (ATET)

Conditional randomization

Heterogeneous effects

Heterogeneous treatment effects

Statistical adjustment

Fundamental assumption for statistical adjustment

Confounder

Moderator Variable

Mediator Variable

Overlap

Stable Unit Treatment Value Assumption (SUTVA)

Internal validity

External validity

Study Notes

The Potential Outcomes Framework

Big Picture of Framework for Understanding Causality

Basic Concepts in Framework

Potential Outcomes for Units

Notation Used in Framework

Definition of Causal Effects

Further Discussion of Causal Effects

Real-World Examples of Causal Effects

The Fundamental Problem of Causal Inference

Deeper Look in Causal Inference

Counterfactuals

Using Multiple Units for Causal Inference

The 'observed' outcome for a unit `i` is denoted as $D_i$ in causal inference.