Lecture 1: Causal Inference: Potential Outcomes

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Which of the following scenarios best illustrates the intuitive definition of causality, where an action's absence would prevent a particular effect?

A car accident occurs during a rainstorm.
A plant dies because it was not watered. (correct)
A company's profits increase after launching a new marketing campaign.
A student studies diligently and receives a good grade on the exam.

In the context of causal relations, what does the notation $D_i = 1$ typically represent?

Unit _i_ is not exposed to the treatment.
Unit _i_ has a potential outcome of 1, irrespective of treatment.
Unit _i_'s outcome variable is equal to 1.
Unit _i_ is exposed to the treatment. (correct)

What is the primary significance of the potential outcome model (counterfactual framework) in causal inference?

It acknowledges that each unit has two potential outcomes under treatment and no treatment, only one of which is observed. (correct)
It focuses solely on the observed outcome of a unit, ignoring the unobserved.
It allows us to observe both treated and untreated outcomes for the same unit simultaneously.
It enables the estimation of treatment effects by comparing outcomes of different units.

For an individual i, $Y_{1i}$ represents the potential labour market outcome if the person participated in a job search program, and $Y_{0i}$ represents the potential labour market outcome if the person did not participate. Which statement accurately describes the observed reality in the potential outcome framework?

Only $Y_{1i}$ is observed if the person participated in the program; otherwise, $Y_{0i}$ is observed. (B) Signup and view all the answers

In the potential outcome framework, what term is used to describe the potential outcome that is not observed for a unit?

Counterfactual outcome. (B) Signup and view all the answers

Given the definition of a causal effect at the unit level as $\Delta_i = Y_{1i} - Y_{0i}$, which of the following represents the most accurate interpretation of $\Delta_i$?

The individual causal effect of the treatment for unit i. (A) Signup and view all the answers

A researcher observes that unemployed workers who participated in a job search program have, on average, worse employment outcomes than those who did not participate. Why might concluding that the program causes worse employment outcomes be a flawed interpretation?

Those who chose to participate may have unobserved characteristics that made them less employable to begin with. (B) Signup and view all the answers

What is the fundamental problem of causal inference?

Only one of the potential outcomes, $Y_{1i}$ or $Y_{0i}$, can be observed for an individual. (B) Signup and view all the answers

Which of the following assumptions is part of the 'scientific solution' to the counterfactual problem?

Temporal stability: The value of the outcome $y_i$ does not depend on when the treatment takes place. (B) Signup and view all the answers

Why is the 'scientific solution' to the counterfactual problem often unsuitable for social sciences?

Social science environments are rarely perfectly controllable, unlike in a lab setting. (B) Signup and view all the answers

What does the Average Treatment Effect (ATE) represent?

The average benefit an individual receives from the treatment across the entire population. (D) Signup and view all the answers

In the context of causal inference, what is a 'potential outcome'?

The outcome that would occur if a subject received the treatment ($Y_{1i}$) or did not receive the treatment ($Y_{0i}$). (A) Signup and view all the answers

What distinguishes the Average Treatment Effect on the Treated (ATET) from the Average Treatment Effect (ATE)?

ATET focuses on the treatment effect specifically for those who received the treatment, while ATE considers the entire population. (A) Signup and view all the answers

Which expression represents the unobserved counterfactual needed to calculate the Average Treatment Effect on the Treated (ATET)?

$E[Y_{0i} | D_i = 1]$ (C) Signup and view all the answers

Given the potential outcomes $Y_{1i}$ (employment outcome if participating) and $Y_{0i}$ (employment outcome if not participating), if one observes that workers in a program have worse labor market prospects, what critical consideration must be addressed to establish causality?

Accounting for the fact that those who joined the program may have had worse labor market prospects to begin with. (A) Signup and view all the answers

Suppose a researcher aims to estimate the Average Treatment Effect (ATE) of a job training program but can only observe participants' post-training employment outcomes. What is the most significant obstacle to obtaining an unbiased ATE estimate, and what assumptions must be made to address it?

The inability to observe the counterfactual outcome for program participants, requiring assumptions about the similarity between participants and non-participants. (A) Signup and view all the answers

In the context of treatment assignment, what does 'cream-skimming' primarily imply?

Choosing individuals expected to show the most positive outcomes if treated. (D) Signup and view all the answers

What is the central challenge in evaluating the true effect of a treatment or intervention?

The difficulty in observing individuals' outcomes both with and without the treatment simultaneously. (A) Signup and view all the answers

How does randomization specifically address the 'selection problem' in treatment evaluation?

By making the assignment to treatment independent of individuals' potential outcomes. (B) Signup and view all the answers

Given that treatment assignment is randomized, what does the equation E[Y1i | Di = 1] = E[Y1i | Di = 0] = E[Y1i] imply?

The expected outcome under treatment is the same regardless of whether an individual was actually treated in the experiment or not, and is equal to the overall expected outcome under treatment. (C) Signup and view all the answers

While randomization aims to solve the selection problem, what critical assumption must hold for the conclusions drawn from a randomized experiment to be valid concerning the treatment's effect?

The treatment must be implemented consistently and as intended for all individuals assigned to the treatment group. (A) Signup and view all the answers

What is the primary issue with estimating the Average Treatment Effect (ATE) using the simple difference in means between treated and untreated groups, $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0]$?

It fails to account for pre-existing differences between the treated and untreated groups that may influence outcomes. (B) Signup and view all the answers

In the equation $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0] = E[Y_{1i} - Y_{0i} | D_i = 1] + (E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0])$, which term represents the 'bias term' arising from self-selection?

$E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ (B) Signup and view all the answers

Consider the job search example. If individuals who are more motivated to find work are more likely to participate in a job training program (treatment), what is the likely direction of the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ when estimating the effect of the training program on employment?

Positive, because motivated individuals would have higher $Y_{0i}$ even without the training. (C) Signup and view all the answers

In the model 'I am in it, if it is worth it': $D = 1$ if $Y_{1i} - Y_{0i} > c$, what does 'c' represent?

The cost (monetary and mental) of participating in the treatment. (A) Signup and view all the answers

According to the principle of self-selection based on 'worth it', if individuals participate in treatment when $Y_{1i} - Y_{0i} > c$, what can we generally infer about the relationship between $E[Y_{0i} | D_i = 1]$ and $E[Y_{0i} | D_i = 0]$?

$E[Y_{0i} | D_i = 1] eq E[Y_{0i} | D_i = 0]$ because the groups are likely to differ systematically in their potential outcomes even without treatment. (B) Signup and view all the answers

Which of the following best describes 'comparative advantage' as a source of selection bias in treatment evaluation?

Individuals choose to participate in treatments where they expect to gain the most, leading to systematic differences in potential outcomes. (B) Signup and view all the answers

In the context of selection bias, if treatment participants generally have smaller $Y_{0i}$ but larger potential gains ($Y_{1i} - Y_{0i}$), what is the likely direction of the selection bias when naively estimating the ATE?

Bias is likely to be positive, overestimating the true ATE. (D) Signup and view all the answers

Besides self-selection, what are other potential sources of selection bias mentioned in the text?

Administrative rules and selection by treatment providers. (D) Signup and view all the answers

Assume that participation in a voluntary training program is determined by the rule $D = 1$ if $Y_{1i} - Y_{0i} > c$. If the cost 'c' is negatively correlated with $Y_{0i}$ (i.e., individuals with lower potential earnings without training face lower costs to participate), how would this affect the selection bias, and in which direction would it likely lean?

Bias would be negative and increased because lower $Y_{0i}$ individuals are more likely to participate. (A) Signup and view all the answers

Consider a scenario where a highly selective job training program only admits individuals deemed 'most likely to succeed' by the program administrators. How would this selection process most likely influence the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ and the naive estimate of the program's effectiveness?

The bias term would likely be positive, potentially overestimating the true program effect. (C) Signup and view all the answers

What question does the Average Treatment Effect (ATE) aim to answer in the context of a job search program?

How much would employment increase on average if all workers participated in the job search program? (D) Signup and view all the answers

What is the primary focus of the Average Treatment Effect on the Treated (ATET)?

The effect of the treatment specifically on those who chose to receive it. (A) Signup and view all the answers

What is the fundamental problem in estimating both ATE and ATET?

The need to compare observed outcomes to counterfactual outcomes, which are inherently unobservable. (B) Signup and view all the answers

What critical assumption is required to simply compare the average outcomes of treated and non-treated individuals to estimate the average treatment effect?

That the average potential outcome as non-treated for the treated is the same as for the non-treated. (D) Signup and view all the answers

In the context of the job search assistance program, why does self-selection pose a problem for estimating treatment effects?

Because individuals who volunteer for the program may be inherently different from those who do not. (D) Signup and view all the answers

What does the notation $E(Y_{0i} | D_i = 1) \neq E(Y_{0i} | D_i = 0)$ imply in the context of self-selection?

The potential outcomes as untreated, for the treated ones, are not the same as the potential outcomes as untreated for those who were actually not treated. (B) Signup and view all the answers

With self-selection, what is implied by $E(Y_{1i} \mid D_i = 1) \neq E(Y_{1i} \mid D_i = 0)$?

The potential outcomes as treated, for the treated ones, are not the same as the potential outcomes as treated for those who were not treated. (A) Signup and view all the answers

How does self-selection typically bias the estimation of the impact of a job search program on employment rates?

It leads to an overestimation of the program's effectiveness because more motivated individuals participate. (B) Signup and view all the answers

Assuming that a job search program significantly increases employment for participants, which statement best represents the likely relationship between $E(Y_{0i} | D_i = 1)$ and $E(Y_{0i} | D_i = 0)$ if participation is driven by motivation?

$E(Y_{0i} | D_i = 1)$ would be greater than $E(Y_{0i} | D_i = 0)$, indicating that, even without the program, participants were more likely to find employment. (B) Signup and view all the answers

Consider a scenario where a job search program boasts a high success rate. However, participation is voluntary, and only individuals with extensive prior work experience opt to enroll. How would this self-selection bias likely affect the interpretation of the program's ATE and ATET?

Both ATE and ATET would likely overestimate the program's true effectiveness in the broader unemployed population due to the pre-existing advantages of participants. (A) Signup and view all the answers

Flashcards

Causality (intuitive definition)

An action causes an effect if that effect would not have occurred without the action.

Y1i (potential outcome with treatment)

A unit's outcome if exposed to a treatment (Di = 1).

Y0i (potential outcome without treatment)

A unit's outcome if NOT exposed to a treatment (Di = 0).

Potential Outcomes

Each unit has two potential outcomes: Y1i (with treatment) and Y0i (without treatment), whether or not the unit was actually treated.