Lecture 1: Causal Inference: Potential Outcomes
41 Questions
4 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which of the following scenarios best illustrates the intuitive definition of causality, where an action's absence would prevent a particular effect?

  • A car accident occurs during a rainstorm.
  • A plant dies because it was not watered. (correct)
  • A company's profits increase after launching a new marketing campaign.
  • A student studies diligently and receives a good grade on the exam.

In the context of causal relations, what does the notation $D_i = 1$ typically represent?

  • Unit _i_ is not exposed to the treatment.
  • Unit _i_ has a potential outcome of 1, irrespective of treatment.
  • Unit _i_'s outcome variable is equal to 1.
  • Unit _i_ is exposed to the treatment. (correct)

What is the primary significance of the potential outcome model (counterfactual framework) in causal inference?

  • It acknowledges that each unit has two potential outcomes under treatment and no treatment, only one of which is observed. (correct)
  • It focuses solely on the observed outcome of a unit, ignoring the unobserved.
  • It allows us to observe both treated and untreated outcomes for the same unit simultaneously.
  • It enables the estimation of treatment effects by comparing outcomes of different units.

For an individual i, $Y_{1i}$ represents the potential labour market outcome if the person participated in a job search program, and $Y_{0i}$ represents the potential labour market outcome if the person did not participate. Which statement accurately describes the observed reality in the potential outcome framework?

<p>Only $Y_{1i}$ is observed if the person participated in the program; otherwise, $Y_{0i}$ is observed. (B)</p> Signup and view all the answers

In the potential outcome framework, what term is used to describe the potential outcome that is not observed for a unit?

<p>Counterfactual outcome. (B)</p> Signup and view all the answers

Given the definition of a causal effect at the unit level as $\Delta_i = Y_{1i} - Y_{0i}$, which of the following represents the most accurate interpretation of $\Delta_i$?

<p>The individual causal effect of the treatment for unit <em>i</em>. (A)</p> Signup and view all the answers

A researcher observes that unemployed workers who participated in a job search program have, on average, worse employment outcomes than those who did not participate. Why might concluding that the program causes worse employment outcomes be a flawed interpretation?

<p>Those who chose to participate may have unobserved characteristics that made them less employable to begin with. (B)</p> Signup and view all the answers

What is the fundamental problem of causal inference?

<p>Only one of the potential outcomes, $Y_{1i}$ or $Y_{0i}$, can be observed for an individual. (B)</p> Signup and view all the answers

Which of the following assumptions is part of the 'scientific solution' to the counterfactual problem?

<p>Temporal stability: The value of the outcome $y_i$ does not depend on when the treatment takes place. (B)</p> Signup and view all the answers

Why is the 'scientific solution' to the counterfactual problem often unsuitable for social sciences?

<p>Social science environments are rarely perfectly controllable, unlike in a lab setting. (B)</p> Signup and view all the answers

What does the Average Treatment Effect (ATE) represent?

<p>The average benefit an individual receives from the treatment across the entire population. (D)</p> Signup and view all the answers

In the context of causal inference, what is a 'potential outcome'?

<p>The outcome that would occur if a subject received the treatment ($Y_{1i}$) or did not receive the treatment ($Y_{0i}$). (A)</p> Signup and view all the answers

What distinguishes the Average Treatment Effect on the Treated (ATET) from the Average Treatment Effect (ATE)?

<p>ATET focuses on the treatment effect specifically for those who received the treatment, while ATE considers the entire population. (A)</p> Signup and view all the answers

Which expression represents the unobserved counterfactual needed to calculate the Average Treatment Effect on the Treated (ATET)?

<p>$E[Y_{0i} | D_i = 1]$ (C)</p> Signup and view all the answers

Given the potential outcomes $Y_{1i}$ (employment outcome if participating) and $Y_{0i}$ (employment outcome if not participating), if one observes that workers in a program have worse labor market prospects, what critical consideration must be addressed to establish causality?

<p>Accounting for the fact that those who joined the program may have had worse labor market prospects to begin with. (A)</p> Signup and view all the answers

Suppose a researcher aims to estimate the Average Treatment Effect (ATE) of a job training program but can only observe participants' post-training employment outcomes. What is the most significant obstacle to obtaining an unbiased ATE estimate, and what assumptions must be made to address it?

<p>The inability to observe the counterfactual outcome for program participants, requiring assumptions about the similarity between participants and non-participants. (A)</p> Signup and view all the answers

In the context of treatment assignment, what does 'cream-skimming' primarily imply?

<p>Choosing individuals expected to show the most positive outcomes if treated. (D)</p> Signup and view all the answers

What is the central challenge in evaluating the true effect of a treatment or intervention?

<p>The difficulty in observing individuals' outcomes both with and without the treatment simultaneously. (A)</p> Signup and view all the answers

How does randomization specifically address the 'selection problem' in treatment evaluation?

<p>By making the assignment to treatment independent of individuals' potential outcomes. (B)</p> Signup and view all the answers

Given that treatment assignment is randomized, what does the equation E[Y1i | Di = 1] = E[Y1i | Di = 0] = E[Y1i] imply?

<p>The expected outcome under treatment is the same regardless of whether an individual was actually treated in the experiment or not, and is equal to the overall expected outcome under treatment. (C)</p> Signup and view all the answers

While randomization aims to solve the selection problem, what critical assumption must hold for the conclusions drawn from a randomized experiment to be valid concerning the treatment's effect?

<p>The treatment must be implemented consistently and as intended for all individuals assigned to the treatment group. (A)</p> Signup and view all the answers

What is the primary issue with estimating the Average Treatment Effect (ATE) using the simple difference in means between treated and untreated groups, $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0]$?

<p>It fails to account for pre-existing differences between the treated and untreated groups that may influence outcomes. (B)</p> Signup and view all the answers

In the equation $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0] = E[Y_{1i} - Y_{0i} | D_i = 1] + (E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0])$, which term represents the 'bias term' arising from self-selection?

<p>$E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ (B)</p> Signup and view all the answers

Consider the job search example. If individuals who are more motivated to find work are more likely to participate in a job training program (treatment), what is the likely direction of the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ when estimating the effect of the training program on employment?

<p>Positive, because motivated individuals would have higher $Y_{0i}$ even without the training. (C)</p> Signup and view all the answers

In the model 'I am in it, if it is worth it': $D = 1$ if $Y_{1i} - Y_{0i} > c$, what does 'c' represent?

<p>The cost (monetary and mental) of participating in the treatment. (A)</p> Signup and view all the answers

According to the principle of self-selection based on 'worth it', if individuals participate in treatment when $Y_{1i} - Y_{0i} > c$, what can we generally infer about the relationship between $E[Y_{0i} | D_i = 1]$ and $E[Y_{0i} | D_i = 0]$?

<p>$E[Y_{0i} | D_i = 1] eq E[Y_{0i} | D_i = 0]$ because the groups are likely to differ systematically in their potential outcomes even without treatment. (B)</p> Signup and view all the answers

Which of the following best describes 'comparative advantage' as a source of selection bias in treatment evaluation?

<p>Individuals choose to participate in treatments where they expect to gain the most, leading to systematic differences in potential outcomes. (B)</p> Signup and view all the answers

In the context of selection bias, if treatment participants generally have smaller $Y_{0i}$ but larger potential gains ($Y_{1i} - Y_{0i}$), what is the likely direction of the selection bias when naively estimating the ATE?

<p>Bias is likely to be positive, overestimating the true ATE. (D)</p> Signup and view all the answers

Besides self-selection, what are other potential sources of selection bias mentioned in the text?

<p>Administrative rules and selection by treatment providers. (D)</p> Signup and view all the answers

Assume that participation in a voluntary training program is determined by the rule $D = 1$ if $Y_{1i} - Y_{0i} > c$. If the cost 'c' is negatively correlated with $Y_{0i}$ (i.e., individuals with lower potential earnings without training face lower costs to participate), how would this affect the selection bias, and in which direction would it likely lean?

<p>Bias would be negative and increased because lower $Y_{0i}$ individuals are more likely to participate. (A)</p> Signup and view all the answers

Consider a scenario where a highly selective job training program only admits individuals deemed 'most likely to succeed' by the program administrators. How would this selection process most likely influence the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ and the naive estimate of the program's effectiveness?

<p>The bias term would likely be positive, potentially overestimating the true program effect. (C)</p> Signup and view all the answers

What question does the Average Treatment Effect (ATE) aim to answer in the context of a job search program?

<p>How much would employment increase on average if all workers participated in the job search program? (D)</p> Signup and view all the answers

What is the primary focus of the Average Treatment Effect on the Treated (ATET)?

<p>The effect of the treatment specifically on those who chose to receive it. (A)</p> Signup and view all the answers

What is the fundamental problem in estimating both ATE and ATET?

<p>The need to compare observed outcomes to counterfactual outcomes, which are inherently unobservable. (B)</p> Signup and view all the answers

What critical assumption is required to simply compare the average outcomes of treated and non-treated individuals to estimate the average treatment effect?

<p>That the average potential outcome as non-treated for the treated is the same as for the non-treated. (D)</p> Signup and view all the answers

In the context of the job search assistance program, why does self-selection pose a problem for estimating treatment effects?

<p>Because individuals who volunteer for the program may be inherently different from those who do not. (D)</p> Signup and view all the answers

What does the notation $E(Y_{0i} | D_i = 1) \neq E(Y_{0i} | D_i = 0)$ imply in the context of self-selection?

<p>The potential outcomes as untreated, for the treated ones, are not the same as the potential outcomes as untreated for those who were actually not treated. (B)</p> Signup and view all the answers

With self-selection, what is implied by $E(Y_{1i} \mid D_i = 1) \neq E(Y_{1i} \mid D_i = 0)$?

<p>The potential outcomes as treated, for the treated ones, are not the same as the potential outcomes as treated for those who were not treated. (A)</p> Signup and view all the answers

How does self-selection typically bias the estimation of the impact of a job search program on employment rates?

<p>It leads to an overestimation of the program's effectiveness because more motivated individuals participate. (B)</p> Signup and view all the answers

Assuming that a job search program significantly increases employment for participants, which statement best represents the likely relationship between $E(Y_{0i} | D_i = 1)$ and $E(Y_{0i} | D_i = 0)$ if participation is driven by motivation?

<p>$E(Y_{0i} | D_i = 1)$ would be greater than $E(Y_{0i} | D_i = 0)$, indicating that, even without the program, participants were more likely to find employment. (B)</p> Signup and view all the answers

Consider a scenario where a job search program boasts a high success rate. However, participation is voluntary, and only individuals with extensive prior work experience opt to enroll. How would this self-selection bias likely affect the interpretation of the program's ATE and ATET?

<p>Both ATE and ATET would likely overestimate the program's true effectiveness in the broader unemployed population due to the pre-existing advantages of participants. (A)</p> Signup and view all the answers

Flashcards

Causality (intuitive definition)

An action causes an effect if that effect would not have occurred without the action.

Y1i (potential outcome with treatment)

A unit's outcome if exposed to a treatment (Di = 1).

Y0i (potential outcome without treatment)

A unit's outcome if NOT exposed to a treatment (Di = 0).

Potential Outcomes

Each unit has two potential outcomes: Y1i (with treatment) and Y0i (without treatment), whether or not the unit was actually treated.

Signup and view all the flashcards

Counterfactual Framework

The potential outcome model imagines different scenarios.

Signup and view all the flashcards

Causal Effect (Δi)

The effect of treatment for a unit, calculated as the difference in potential outcomes (Y1i - Y0i).

Signup and view all the flashcards

Counterfactual Outcome

The outcome that is NOT observed in reality, because you can only be treated or not treated. It serves as a benchmark for understanding actual outcomes.

Signup and view all the flashcards

Potential outcome 𝑌1𝑖

The employment outcome of person i if they participate in a program.

Signup and view all the flashcards

Potential outcome 𝑌0𝑖

The employment outcome of person i if they do not participate in a program.

Signup and view all the flashcards

Causal Effect ∆𝑖

The causal effect of participating by comparing outcomes with and without participation. ∆𝑖 = 𝑌1𝑖 − 𝑌0𝑖

Signup and view all the flashcards

Fundamental Problem of Causal Inference

It's impossible to observe both outcomes (participating and not) for the same person. Only one can be observed, making direct causal inference at the individual level impossible.

Signup and view all the flashcards

Temporal Stability

The outcome isn't affected by when the treatment happens. This means the timing of the program doesn't change its impact.

Signup and view all the flashcards

Causal Transience

A previous treatment does not affect the outcome. Each treatment acts independently.

Signup and view all the flashcards

Homogeneity of Units

Units with the same characteristics will have same outcomes. If two people have all the same traits, they'll respond to treatment the same way.

Signup and view all the flashcards

Average Treatment Effect (ATE)

The average impact of a treatment on a population. ATE = E[𝑌1𝑖] − E[𝑌0𝑖]

Signup and view all the flashcards

Average Treatment Effect on the Treated (ATET)

The average impact of a treatment on those who received it. 𝐸[𝑌1𝑖|𝐷𝑖=1] − 𝐸[𝑌0𝑖|𝐷𝑖=1]

Signup and view all the flashcards

Cream-skimming (selection)

"Cream-skimming" refers to selecting the "best" candidates, leading to higher expected outcomes in the treated group compared to the untreated (E[Y0i | Di = 1] > E[Y0i | Di = 0]).

Signup and view all the flashcards

Negative Selection (selection)

Negative selection involves selecting the "weakest" candidates, resulting in lower expected outcomes in the treated group versus the untreated (E[Y0i | Di = 1] < E[Y0i | Di = 0]).

Signup and view all the flashcards

Key Problem in Treatment Evaluation

The fundamental challenge in treatment evaluation is the inability to observe or estimate all potential outcomes (E[Y0i | Di = 1], E[Y1i | Di = 0], E[Y1i], and E[Y0i]).

Signup and view all the flashcards

Exogenous Variation

Finding exogenous variation that influences treatment assignment without affecting potential outcomes is crucial for identifying a valid counterfactual.

Signup and view all the flashcards

Randomization solves selection

Randomization makes treatment assignment statistically independent of potential outcomes, ensuring that treated and untreated groups are comparable on average.

Signup and view all the flashcards

ATE (Average Treatment Effect)

The average treatment effect, answering: If all workers participated, how much would employment increase?

Signup and view all the flashcards

ATET (Average Treatment Effect on the Treated)

The average treatment effect on the treated, answering: How much would employment increase for program participants?

Signup and view all the flashcards

Self-Selection Problem

The problem that arises because treated and untreated groups may differ systematically, biasing the estimation of treatment effects.

Signup and view all the flashcards

Naive Estimation

Comparing outcomes of treated and untreated groups directly, without accounting for pre-existing differences.

Signup and view all the flashcards

E[Y0i | Di = 1]

The potential outcome as non-treated for the actually treated ones.

Signup and view all the flashcards

E[Y0i | Di = 0]

The potential outcome as non-treated for the actually non-treated ones.

Signup and view all the flashcards

Self-Selection Example (Job Search)

The motivations and characteristics of those who choose to enroll in job search programs often differ from those who don't.

Signup and view all the flashcards

E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]

The potential outcomes as untreated, for the treated ones, are not the same as the potential outcomes as untreated for those who were actually not treated.

Signup and view all the flashcards

E[Y1i | Di = 1] ≠ E[Y1i | Di = 0]

The potential outcomes as treated, for the treated ones, are not the same as the potential outcomes as treated for those who were not treated.

Signup and view all the flashcards

Bias from Self-Selection

The systematic difference between the treated and untreated groups that biases the estimation of treatment effects.

Signup and view all the flashcards

Naive ATE Estimation

Naive ATE estimation directly compares the average outcomes of treated and untreated groups, without accounting for pre-existing differences.

Signup and view all the flashcards

Self-Selection Bias

The self-selection bias arises because individuals choose whether or not to participate in a treatment based on their characteristics.

Signup and view all the flashcards

ATET and the Bias Term

ATET + Bias Term = E[Y1i − Y0i | Di = 1] + (a bias term), where the bias term accounts for the difference that would still exist without treatment.

Signup and view all the flashcards

Bias Term in Job Search Example

Captures the difference in the untreated outcomes between those who received treatment and those who did not.

Signup and view all the flashcards

Importance of Selection Bias

Selection bias is significant because individuals make rational decisions about whether the benefit of participating is worth the cost.

Signup and view all the flashcards

Participation Decision

Individuals participate (D = 1) if the potential gain from treatment exceeds the cost of participation.

Signup and view all the flashcards

Smaller Y0i

Those who participate in the treatment are likely to have a smaller Y0i (outcome if not treated), leading to a larger potential gain and are thus more likely to participate.

Signup and view all the flashcards

Unequal Groups E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]

E[Y0i | Di = 1] ≠ E[Y0i | Di = 0] because these groups differ in comparative advantages.

Signup and view all the flashcards

Selection by Administrators

Treatment selection can be influenced administrative rules and bias by administrators

Signup and view all the flashcards

Selection by Treatment Providers

Treatment providers can influence treatment participation due to bias.

Signup and view all the flashcards

Study Notes

Roadmap for Causality Lectures

  • Focus will be on causality.
  • Examination of how economists view causality.
  • Discuss the difficulty in estimating causal effects without experimental research design.
  • Why randomized trials are considered the ‘golden standard’.
  • Guidance on analyzing data from experiments.
  • Reasons why even randomized trials can fail to produce causal estimates.
  • Examples of randomization in economics research.

Readings for Causality

  • Angrist and Pischke, chapters 1 and 2.
  • Wooldridge, chapter 1.4.
  • Articles.

Why Focus on Causality?

  • The most compelling research in economics frequently addresses cause-and-effect relationships.
  • A causal relationship is valuable for forecasting the effects of changing policies.
  • While it may serve other purposes, purely descriptive research isn't very helpful in that regard.

Causal Relations & Experimental Approach

  • The experimental approach plays a crucial role when examining causal relationships.
  • Modern microeconomics has increasingly adopted scientific experiments as the "golden standard" for inference.
  • Throughout history, numerous scientific advancements have been facilitated through experiments, dating back to the Renaissance with figures like Galileo.

Questions Involving Causality in Economics

  • Labour economics seeks to understand how participating in an active labor market program affects employment probabilities.
  • Education economics aims to determine whether smaller classes improve study results.
  • Health economics explores the impact of absolute/relative income on health and mortality.
  • Macroeconomics investigates the effect of unanticipated changes in short-term interest rates on current and future activity.
  • Crime economics questions whether more police presence reduces crime rates.

Causal Relations in Economics

  • Researchers are interested in the causal effect of participating in a treatment on future outcomes.
  • Models for analyzing causal effects are also models for treatment/policy/program evaluation.
  • “Treatment” is defined broadly, including actions like attending university or getting married.

Definition of Causality

  • An action causes a particular effect if the effect would not have occurred without the action.

Causality Formalized

  • A framework for thinking about causality involves a population of N units, such as individuals, firms, or countries.
  • An outcome variable Y and a variable D is observed for each unit.
  • There is an assumption Y and D are correlated.
  • Correlation does not imply causation.
  • It is valuable to know under what specific circumstances it is possible to infer that D causes Y.

Causal Relations in Economics Defined

  • "i" represents the index for a particular unit in the population.
  • Dᵢ symbolizes a "treatment".
  • Assume "treatment" is binary, either "yes" or "no".
    • Dᵢ = 1 if unit i is exposed to treatment.
    • Dᵢ = 0 if unit i is not exposed to treatment.
    • Yᵢ(Dᵢ) represents the observed outcome.

Potential Outcome Model

  • Defining potential outcomes of unit i creates a counterfactual framework.
  • Each unit has two potential outcomes (Y₁i with treatment; Y₀i without).
  • Y₁i and Y₀i refer to potential outcomes (treated/non-treated) for unit i whether the unit was actually treated.

Potential Outcome Examples

  • An unemployed person seeking a job could participate in a job search program.
  • Y₁i: potential labor market outcome if the person participated in the program.
  • Y₀i: potential labor market outcome if the person did not participate.
  • Those are potential outcomes unrelated to actual treatment status.

Definition of Causal Effect

  • In potential outcome framework, only one potential outcome is observed
  • The unobserved potential outcome is also called the counterfactual outcome.
  • For any given unit, the impact of partaking in the treatment is:
    • Δᵢ = Y₁ᵢ – Y₀ᵢ is the difference in potential outcomes
    • This is also the definition of causal effect at the unit level

Causal Effect Example: Job Search Program

  • Example: comparing unemployed workers who went to a job search program and those who didn't may incorrectly assess the program.
  • Workers who went to the program may have worse labor market prospects.
  • Potential outcomes:
    • Y₁i: the employment outcome of person i going to the program
    • Y₀i: the employment outcome of person i not going to the program
  • The causal effect of participating in the program: Δᵢ = Y₁ᵢ – Y₀ᵢ

Fundamental Problem of Causal Interference

  • Δᵢ is unobservable because only Y₁ᵢ or Y₀ᵢ can be observed.
  • It is impossible to derive causality at the unit level because units cannot receive and not receive treatment at the same time.
  • This difficult problem can be approached with either a "scientific“ or a statistical solution."

Overcoming Counterfactual Problems: Scientific Solutions

  • Consider a unit i with the assumptions:
    • Temporal Stability: Outcome y does not depend on when the treatment takes place.
    • Causal Transience: Outcome y is independent of any prior treatment.
    • Homogeneity of Units: An assumption for other units j ≠ i such that yi(xi) = yj(xj) for xi = xj
  • Assumptions are used in natural sciences to infer causality.
  • These are unlikely to hold in social sciences because the environment is is not controllable such as it might be in a lab.

Statistical Solution to the Counterfactual Problem

  • Methods that compute the average causal effect for the entire population or subgroups of interest
  • In economic literature, the Average Treatment Effect (ATE) is used.
  • Formula: ATE = E[Δᵢ] = E[Y₁ᵢ - Y₀ᵢ] = E[Y₁ᵢ] - E[Y₀ᵢ]
    • The ATE of Dᵢ = 1 gauges how much, on average, an individual benefits from receiving the defined treatment.
  • ATE compares the potential outcome when all units receive the treatment to the potential outcome when no units receive treatment.
  • Neither outcome, however, is typically directly observed.

Average Treatment Effect on the Treated (ATET) in Economics

  • The average treatment effect on the treated describes how much on average the individuals who actually received treatment benefited from the treatment:
    • E[∆ᵢ|Dᵢ = 1] = E[Y₁ᵢ - Y₀ᵢ |Dᵢ = 1] = E[Y₁ᵢ |Dᵢ = 1] – E[Y₀ᵢ |Dᵢ = 1] = 1, involves one unobserved counterfactual that indicates the potential outcome as untreated for those who actually received treatment.

ATE and ATET: Job Program Example

  • For a job-search program:
    • ATE answers: If all unemployed workers participate in the program, how much would employment increase?
    • ATET answers: How much would employment increase for workers who selected into the program?
  • The questions cannot readily be answered, because they require comparison the person's observed outcome to the counterfactual outcome.

Estimating ATE and ATET: Self-Selection

  • A self-selection problem must be addressed to estimate average treatment effects:
    • E[Y₁ᵢ|Dᵢ = 1]- E[Y₀ᵢ|Dᵢ = 0].
    • In other words, can't average outcomes can just be compared for those who were treated to outcomes for those not treated?
  • This would require the average potential outcome in a non-treated condition to be the same for treated individuals as for non-treated individuals. -E[Y₀ᵢ|Dᵢ = 1] = E[Y₀ᵢ|Dᵢ = 0]
  • This is an unlikely event without an experimental variable.

Example of Self-Selection

  • Workers motivated to find work are more likely to participate in a job search program.
  • The above individuals potential outcomes, treated and untreated, are both more motivating than those less motivated to find work.
  • Potential outcomes are not independent of actual treatment status.

Self-Selection Conundrum

  • With self-selection:
    • E[Y₀ᵢ | Dᵢ = 1] ≠ E[Y₀ᵢ | Dᵢ = 0] and E[Y₁ᵢ | Dᵢ = 1] ≠ E[Y₁ᵢ | Dᵢ = 0].
  • The potential outcomes as untreated for those who were treated are not the same as the potential outcomes as untreated for those who were not treated.
  • E[Y₀ᵢ | Dᵢ = 1] is not the same as E[Y₀ᵢ | Dᵢ = 0].
  • Likewise the potential outcomes as treated for those who were treated are not the same as the potential outcomes as treated for those who were not treated.
  • E[Y₁ᵢ | Dᵢ = 1] is not the same as E[Y₁ᵢ | Dᵢ = 0].

Formalizing Bias Resulting from Self-Selection

  • It is possible to formalize self-selection induced bias.
  • The Average Treatment Effect (ATE) is naively estimated via:
  • E[Y₁ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0]
  • Adding and subtracting E[Y₀ᵢ|Dᵢ = 1] to the expression above produces:
  • E[Y₁ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 1] + E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0]
    • Simplifies the yield: -E[Y1i - Yoi |Di = 1] + E[Yoi|Di = 1] - E[Yoi|Di = 0]
  • Which can be written as:
    • E[Y1i - Yoi |Di = 1] + (a bias term)
  • A result gives ATET + bias term.

Example of Bias

  • The bias term, E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0], captures the difference in non-treated employment outcomes between those who did and did not get treatment.
  • E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0] > 0, since selecting a treatment is in some way dependent on the individual feeling it will improve some aspect of their lives where intervention is desired.

Why Selection Bias Matters

  • Selection bias arises from optimizing decisions by rational actors.
    • I am in it if worth it.
    • D = 1 if Y1i - Yoi > c ,
    • C represents the mental and monetary cost of participating in the treatment.

  • The above can then be used to write:
    • E[Yoi |Di = 1] = E[Yoi |Yoi < Y1i -c]
    • E[Yoi |Di = 0] = E[Yoi |Yoi > Y1i -c]
  • With groups differing in terms of comparative advantages.
  • Y1i - Yoi is large for some, small for others.
    • The most simple instances: treatment participants have smaller Yoi, thus larger potential gain.

Other Selection Sources

  • May occur due to administrative rules or by selection coming from treatment providers.
    • "Cream-skimming": Choosing “the best” suggests that E[Yoi|Di = 1] > E[Yoi|Di = 0].
  • Selection bias also results come from negative treatment:
    • Negative Treatment Example: Placing "weak" kids in small classes produces the formula:
    • E[Yoi|Di = 1] < E[Yoi|Di = 0]

The Key Problem for Treatment Evaluation

  • Key unobservable/unestimatable values with the formula E[Y₀ᵢ|Dᵢ = 1], E[Y₁ᵢ|Dᵢ = 0], E[Y₁ᵢ] and E[Y₀ᵢ]. Treatment evaluation's primary issue is isolating something, with exogenous variation, that only influences treatment assignment and not potential outcomes.
  • In other words, the treatment status to be independent of potential outcomes and have a valid couterfactual to evaluate treatment

Randomization Solves the Self-Selection Problem

  • Social experiments randomize treatment assignment across individuals via a lottery
  • Randomization implies treatment assignment that is statistically independent of potential outcomes and other variables:
  • (Y0i, Y1i) ┴ Dį
  • In the job search example: using a lottery to assign individuals.
  • The "treatment" is independent of potential outcomes.

How Randomization Solves the Selection Problem: Example

  • In a job search, motivated workers often selected treatment.
  • Randomizing treatment creates equal distributions of motivation in treatment/non treatment groups.
  • With completely random treatment the groups will look alike, removing selection
  • With treatment status the sole group differentiator, each group counteracts the others' factuality.

Randomization Solves The Selection Problem Concluded

  • With randomization, treatment status is independent of potential outcomes with these mathematical effects:
    • E[Y1i|Di = 1] = E[Y1i|Di = 0] = E[Y1i]
    • E[Yoi|Di = 1] = E[Yoi|Di = 0] = E[Yoi]
  • In a job search program, this implies the following:

Randomization Solves the Selection Problem Concluded 2

  • Random assignment of Dᵢ eliminates selection.
    • ATET = E[Y1i|Di = 1] – E[Yoi|Di = 1]=E[Y1i|Di = 1] – E[Yoi|Di = 0]
    • We swap out E[Y1i|Di = 1] for E[Yoi|Di = 0] because treatment status is independent of potential outcomes.
  • As a result, E[Y1i|Di = 1] and E[Yoi|Di = 0] are observed.

  • E[Y1i]) is the same as the potential outcomes of those who actually took the program E[Y1i|Di = 1]).
  • With randomization ATE = ATET as groups are similar despite treatment.

Important Effects of Randomization- World Evidence

  • In the social sciences randomization isn't always possible and control strategies (controlling factors that differentiate outcomes) are frequently applied instead
  • There are many occasions where both experiments are tried to find a similar result.
  • This allows comparison for better alignment with experimental ideal
  • Sadly, medicine and economics studies show randomization is superior to control strategies

How Important is Randomization- HRT Medicine Example?

  • Recent instances come from hormone replacement therapy investigation
  • Evidence from the Nurse's Health Study, a large, influential non experimental survey, showed better health with HRT used with a control strategy
  • Results from similar randomized trails showed exceeding, harmful health risks.
    • The prior positive reflection was self-selective, in that health conscious women got selected which explains the ATET + bias term

Important Effects of Randomization World Economics Example

  • There are few, non experimental studies with labor market training paradoxically find less earnings from participants compared to nonparticipants.
  • One may suspect selection bias from training serving low income individuals leading to naive comparisons
  • Randomized training programs, however, boast positive effects.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Description

Explore causality, potential outcomes, and counterfactuals. Understand the notation, significance, and interpretation of causal effects. Learn about observed and unobserved outcomes in causal inference studies.

More Like This

Use Quizgecko on...
Browser
Browser