Podcast
Questions and Answers
Which of the following scenarios best illustrates the intuitive definition of causality, where an action's absence would prevent a particular effect?
Which of the following scenarios best illustrates the intuitive definition of causality, where an action's absence would prevent a particular effect?
- A car accident occurs during a rainstorm.
- A plant dies because it was not watered. (correct)
- A company's profits increase after launching a new marketing campaign.
- A student studies diligently and receives a good grade on the exam.
In the context of causal relations, what does the notation $D_i = 1$ typically represent?
In the context of causal relations, what does the notation $D_i = 1$ typically represent?
- Unit _i_ is not exposed to the treatment.
- Unit _i_ has a potential outcome of 1, irrespective of treatment.
- Unit _i_'s outcome variable is equal to 1.
- Unit _i_ is exposed to the treatment. (correct)
What is the primary significance of the potential outcome model (counterfactual framework) in causal inference?
What is the primary significance of the potential outcome model (counterfactual framework) in causal inference?
- It acknowledges that each unit has two potential outcomes under treatment and no treatment, only one of which is observed. (correct)
- It focuses solely on the observed outcome of a unit, ignoring the unobserved.
- It allows us to observe both treated and untreated outcomes for the same unit simultaneously.
- It enables the estimation of treatment effects by comparing outcomes of different units.
For an individual i, $Y_{1i}$ represents the potential labour market outcome if the person participated in a job search program, and $Y_{0i}$ represents the potential labour market outcome if the person did not participate. Which statement accurately describes the observed reality in the potential outcome framework?
For an individual i, $Y_{1i}$ represents the potential labour market outcome if the person participated in a job search program, and $Y_{0i}$ represents the potential labour market outcome if the person did not participate. Which statement accurately describes the observed reality in the potential outcome framework?
In the potential outcome framework, what term is used to describe the potential outcome that is not observed for a unit?
In the potential outcome framework, what term is used to describe the potential outcome that is not observed for a unit?
Given the definition of a causal effect at the unit level as $\Delta_i = Y_{1i} - Y_{0i}$, which of the following represents the most accurate interpretation of $\Delta_i$?
Given the definition of a causal effect at the unit level as $\Delta_i = Y_{1i} - Y_{0i}$, which of the following represents the most accurate interpretation of $\Delta_i$?
A researcher observes that unemployed workers who participated in a job search program have, on average, worse employment outcomes than those who did not participate. Why might concluding that the program causes worse employment outcomes be a flawed interpretation?
A researcher observes that unemployed workers who participated in a job search program have, on average, worse employment outcomes than those who did not participate. Why might concluding that the program causes worse employment outcomes be a flawed interpretation?
What is the fundamental problem of causal inference?
What is the fundamental problem of causal inference?
Which of the following assumptions is part of the 'scientific solution' to the counterfactual problem?
Which of the following assumptions is part of the 'scientific solution' to the counterfactual problem?
Why is the 'scientific solution' to the counterfactual problem often unsuitable for social sciences?
Why is the 'scientific solution' to the counterfactual problem often unsuitable for social sciences?
What does the Average Treatment Effect (ATE) represent?
What does the Average Treatment Effect (ATE) represent?
In the context of causal inference, what is a 'potential outcome'?
In the context of causal inference, what is a 'potential outcome'?
What distinguishes the Average Treatment Effect on the Treated (ATET) from the Average Treatment Effect (ATE)?
What distinguishes the Average Treatment Effect on the Treated (ATET) from the Average Treatment Effect (ATE)?
Which expression represents the unobserved counterfactual needed to calculate the Average Treatment Effect on the Treated (ATET)?
Which expression represents the unobserved counterfactual needed to calculate the Average Treatment Effect on the Treated (ATET)?
Given the potential outcomes $Y_{1i}$ (employment outcome if participating) and $Y_{0i}$ (employment outcome if not participating), if one observes that workers in a program have worse labor market prospects, what critical consideration must be addressed to establish causality?
Given the potential outcomes $Y_{1i}$ (employment outcome if participating) and $Y_{0i}$ (employment outcome if not participating), if one observes that workers in a program have worse labor market prospects, what critical consideration must be addressed to establish causality?
Suppose a researcher aims to estimate the Average Treatment Effect (ATE) of a job training program but can only observe participants' post-training employment outcomes. What is the most significant obstacle to obtaining an unbiased ATE estimate, and what assumptions must be made to address it?
Suppose a researcher aims to estimate the Average Treatment Effect (ATE) of a job training program but can only observe participants' post-training employment outcomes. What is the most significant obstacle to obtaining an unbiased ATE estimate, and what assumptions must be made to address it?
In the context of treatment assignment, what does 'cream-skimming' primarily imply?
In the context of treatment assignment, what does 'cream-skimming' primarily imply?
What is the central challenge in evaluating the true effect of a treatment or intervention?
What is the central challenge in evaluating the true effect of a treatment or intervention?
How does randomization specifically address the 'selection problem' in treatment evaluation?
How does randomization specifically address the 'selection problem' in treatment evaluation?
Given that treatment assignment is randomized, what does the equation E[Y1i | Di = 1] = E[Y1i | Di = 0] = E[Y1i] imply?
Given that treatment assignment is randomized, what does the equation E[Y1i | Di = 1] = E[Y1i | Di = 0] = E[Y1i] imply?
While randomization aims to solve the selection problem, what critical assumption must hold for the conclusions drawn from a randomized experiment to be valid concerning the treatment's effect?
While randomization aims to solve the selection problem, what critical assumption must hold for the conclusions drawn from a randomized experiment to be valid concerning the treatment's effect?
What is the primary issue with estimating the Average Treatment Effect (ATE) using the simple difference in means between treated and untreated groups, $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0]$?
What is the primary issue with estimating the Average Treatment Effect (ATE) using the simple difference in means between treated and untreated groups, $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0]$?
In the equation $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0] = E[Y_{1i} - Y_{0i} | D_i = 1] + (E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0])$, which term represents the 'bias term' arising from self-selection?
In the equation $E[Y_{1i} | D_i = 1] - E[Y_{0i} | D_i = 0] = E[Y_{1i} - Y_{0i} | D_i = 1] + (E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0])$, which term represents the 'bias term' arising from self-selection?
Consider the job search example. If individuals who are more motivated to find work are more likely to participate in a job training program (treatment), what is the likely direction of the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ when estimating the effect of the training program on employment?
Consider the job search example. If individuals who are more motivated to find work are more likely to participate in a job training program (treatment), what is the likely direction of the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ when estimating the effect of the training program on employment?
In the model 'I am in it, if it is worth it': $D = 1$ if $Y_{1i} - Y_{0i} > c$, what does 'c' represent?
In the model 'I am in it, if it is worth it': $D = 1$ if $Y_{1i} - Y_{0i} > c$, what does 'c' represent?
According to the principle of self-selection based on 'worth it', if individuals participate in treatment when $Y_{1i} - Y_{0i} > c$, what can we generally infer about the relationship between $E[Y_{0i} | D_i = 1]$ and $E[Y_{0i} | D_i = 0]$?
According to the principle of self-selection based on 'worth it', if individuals participate in treatment when $Y_{1i} - Y_{0i} > c$, what can we generally infer about the relationship between $E[Y_{0i} | D_i = 1]$ and $E[Y_{0i} | D_i = 0]$?
Which of the following best describes 'comparative advantage' as a source of selection bias in treatment evaluation?
Which of the following best describes 'comparative advantage' as a source of selection bias in treatment evaluation?
In the context of selection bias, if treatment participants generally have smaller $Y_{0i}$ but larger potential gains ($Y_{1i} - Y_{0i}$), what is the likely direction of the selection bias when naively estimating the ATE?
In the context of selection bias, if treatment participants generally have smaller $Y_{0i}$ but larger potential gains ($Y_{1i} - Y_{0i}$), what is the likely direction of the selection bias when naively estimating the ATE?
Besides self-selection, what are other potential sources of selection bias mentioned in the text?
Besides self-selection, what are other potential sources of selection bias mentioned in the text?
Assume that participation in a voluntary training program is determined by the rule $D = 1$ if $Y_{1i} - Y_{0i} > c$. If the cost 'c' is negatively correlated with $Y_{0i}$ (i.e., individuals with lower potential earnings without training face lower costs to participate), how would this affect the selection bias, and in which direction would it likely lean?
Assume that participation in a voluntary training program is determined by the rule $D = 1$ if $Y_{1i} - Y_{0i} > c$. If the cost 'c' is negatively correlated with $Y_{0i}$ (i.e., individuals with lower potential earnings without training face lower costs to participate), how would this affect the selection bias, and in which direction would it likely lean?
Consider a scenario where a highly selective job training program only admits individuals deemed 'most likely to succeed' by the program administrators. How would this selection process most likely influence the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ and the naive estimate of the program's effectiveness?
Consider a scenario where a highly selective job training program only admits individuals deemed 'most likely to succeed' by the program administrators. How would this selection process most likely influence the bias term $E[Y_{0i} | D_i = 1] - E[Y_{0i} | D_i = 0]$ and the naive estimate of the program's effectiveness?
What question does the Average Treatment Effect (ATE) aim to answer in the context of a job search program?
What question does the Average Treatment Effect (ATE) aim to answer in the context of a job search program?
What is the primary focus of the Average Treatment Effect on the Treated (ATET)?
What is the primary focus of the Average Treatment Effect on the Treated (ATET)?
What is the fundamental problem in estimating both ATE and ATET?
What is the fundamental problem in estimating both ATE and ATET?
What critical assumption is required to simply compare the average outcomes of treated and non-treated individuals to estimate the average treatment effect?
What critical assumption is required to simply compare the average outcomes of treated and non-treated individuals to estimate the average treatment effect?
In the context of the job search assistance program, why does self-selection pose a problem for estimating treatment effects?
In the context of the job search assistance program, why does self-selection pose a problem for estimating treatment effects?
What does the notation $E(Y_{0i} | D_i = 1) \neq E(Y_{0i} | D_i = 0)$ imply in the context of self-selection?
What does the notation $E(Y_{0i} | D_i = 1) \neq E(Y_{0i} | D_i = 0)$ imply in the context of self-selection?
With self-selection, what is implied by $E(Y_{1i} \mid D_i = 1) \neq E(Y_{1i} \mid D_i = 0)$?
With self-selection, what is implied by $E(Y_{1i} \mid D_i = 1) \neq E(Y_{1i} \mid D_i = 0)$?
How does self-selection typically bias the estimation of the impact of a job search program on employment rates?
How does self-selection typically bias the estimation of the impact of a job search program on employment rates?
Assuming that a job search program significantly increases employment for participants, which statement best represents the likely relationship between $E(Y_{0i} | D_i = 1)$ and $E(Y_{0i} | D_i = 0)$ if participation is driven by motivation?
Assuming that a job search program significantly increases employment for participants, which statement best represents the likely relationship between $E(Y_{0i} | D_i = 1)$ and $E(Y_{0i} | D_i = 0)$ if participation is driven by motivation?
Consider a scenario where a job search program boasts a high success rate. However, participation is voluntary, and only individuals with extensive prior work experience opt to enroll. How would this self-selection bias likely affect the interpretation of the program's ATE and ATET?
Consider a scenario where a job search program boasts a high success rate. However, participation is voluntary, and only individuals with extensive prior work experience opt to enroll. How would this self-selection bias likely affect the interpretation of the program's ATE and ATET?
Flashcards
Causality (intuitive definition)
Causality (intuitive definition)
An action causes an effect if that effect would not have occurred without the action.
Y1i (potential outcome with treatment)
Y1i (potential outcome with treatment)
A unit's outcome if exposed to a treatment (Di = 1).
Y0i (potential outcome without treatment)
Y0i (potential outcome without treatment)
A unit's outcome if NOT exposed to a treatment (Di = 0).
Potential Outcomes
Potential Outcomes
Signup and view all the flashcards
Counterfactual Framework
Counterfactual Framework
Signup and view all the flashcards
Causal Effect (Δi)
Causal Effect (Δi)
Signup and view all the flashcards
Counterfactual Outcome
Counterfactual Outcome
Signup and view all the flashcards
Potential outcome 𝑌1𝑖
Potential outcome 𝑌1𝑖
Signup and view all the flashcards
Potential outcome 𝑌0𝑖
Potential outcome 𝑌0𝑖
Signup and view all the flashcards
Causal Effect ∆𝑖
Causal Effect ∆𝑖
Signup and view all the flashcards
Fundamental Problem of Causal Inference
Fundamental Problem of Causal Inference
Signup and view all the flashcards
Temporal Stability
Temporal Stability
Signup and view all the flashcards
Causal Transience
Causal Transience
Signup and view all the flashcards
Homogeneity of Units
Homogeneity of Units
Signup and view all the flashcards
Average Treatment Effect (ATE)
Average Treatment Effect (ATE)
Signup and view all the flashcards
Average Treatment Effect on the Treated (ATET)
Average Treatment Effect on the Treated (ATET)
Signup and view all the flashcards
Cream-skimming (selection)
Cream-skimming (selection)
Signup and view all the flashcards
Negative Selection (selection)
Negative Selection (selection)
Signup and view all the flashcards
Key Problem in Treatment Evaluation
Key Problem in Treatment Evaluation
Signup and view all the flashcards
Exogenous Variation
Exogenous Variation
Signup and view all the flashcards
Randomization solves selection
Randomization solves selection
Signup and view all the flashcards
ATE (Average Treatment Effect)
ATE (Average Treatment Effect)
Signup and view all the flashcards
ATET (Average Treatment Effect on the Treated)
ATET (Average Treatment Effect on the Treated)
Signup and view all the flashcards
Self-Selection Problem
Self-Selection Problem
Signup and view all the flashcards
Naive Estimation
Naive Estimation
Signup and view all the flashcards
E[Y0i | Di = 1]
E[Y0i | Di = 1]
Signup and view all the flashcards
E[Y0i | Di = 0]
E[Y0i | Di = 0]
Signup and view all the flashcards
Self-Selection Example (Job Search)
Self-Selection Example (Job Search)
Signup and view all the flashcards
E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]
E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]
Signup and view all the flashcards
E[Y1i | Di = 1] ≠ E[Y1i | Di = 0]
E[Y1i | Di = 1] ≠ E[Y1i | Di = 0]
Signup and view all the flashcards
Bias from Self-Selection
Bias from Self-Selection
Signup and view all the flashcards
Naive ATE Estimation
Naive ATE Estimation
Signup and view all the flashcards
Self-Selection Bias
Self-Selection Bias
Signup and view all the flashcards
ATET and the Bias Term
ATET and the Bias Term
Signup and view all the flashcards
Bias Term in Job Search Example
Bias Term in Job Search Example
Signup and view all the flashcards
Importance of Selection Bias
Importance of Selection Bias
Signup and view all the flashcards
Participation Decision
Participation Decision
Signup and view all the flashcards
Smaller Y0i
Smaller Y0i
Signup and view all the flashcards
Unequal Groups E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]
Unequal Groups E[Y0i | Di = 1] ≠ E[Y0i | Di = 0]
Signup and view all the flashcards
Selection by Administrators
Selection by Administrators
Signup and view all the flashcards
Selection by Treatment Providers
Selection by Treatment Providers
Signup and view all the flashcards
Study Notes
Roadmap for Causality Lectures
- Focus will be on causality.
- Examination of how economists view causality.
- Discuss the difficulty in estimating causal effects without experimental research design.
- Why randomized trials are considered the ‘golden standard’.
- Guidance on analyzing data from experiments.
- Reasons why even randomized trials can fail to produce causal estimates.
- Examples of randomization in economics research.
Readings for Causality
- Angrist and Pischke, chapters 1 and 2.
- Wooldridge, chapter 1.4.
- Articles.
Why Focus on Causality?
- The most compelling research in economics frequently addresses cause-and-effect relationships.
- A causal relationship is valuable for forecasting the effects of changing policies.
- While it may serve other purposes, purely descriptive research isn't very helpful in that regard.
Causal Relations & Experimental Approach
- The experimental approach plays a crucial role when examining causal relationships.
- Modern microeconomics has increasingly adopted scientific experiments as the "golden standard" for inference.
- Throughout history, numerous scientific advancements have been facilitated through experiments, dating back to the Renaissance with figures like Galileo.
Questions Involving Causality in Economics
- Labour economics seeks to understand how participating in an active labor market program affects employment probabilities.
- Education economics aims to determine whether smaller classes improve study results.
- Health economics explores the impact of absolute/relative income on health and mortality.
- Macroeconomics investigates the effect of unanticipated changes in short-term interest rates on current and future activity.
- Crime economics questions whether more police presence reduces crime rates.
Causal Relations in Economics
- Researchers are interested in the causal effect of participating in a treatment on future outcomes.
- Models for analyzing causal effects are also models for treatment/policy/program evaluation.
- “Treatment” is defined broadly, including actions like attending university or getting married.
Definition of Causality
- An action causes a particular effect if the effect would not have occurred without the action.
Causality Formalized
- A framework for thinking about causality involves a population of N units, such as individuals, firms, or countries.
- An outcome variable Y and a variable D is observed for each unit.
- There is an assumption Y and D are correlated.
- Correlation does not imply causation.
- It is valuable to know under what specific circumstances it is possible to infer that D causes Y.
Causal Relations in Economics Defined
- "i" represents the index for a particular unit in the population.
- Dᵢ symbolizes a "treatment".
- Assume "treatment" is binary, either "yes" or "no".
- Dᵢ = 1 if unit i is exposed to treatment.
- Dᵢ = 0 if unit i is not exposed to treatment.
- Yᵢ(Dᵢ) represents the observed outcome.
Potential Outcome Model
- Defining potential outcomes of unit i creates a counterfactual framework.
- Each unit has two potential outcomes (Y₁i with treatment; Y₀i without).
- Y₁i and Y₀i refer to potential outcomes (treated/non-treated) for unit i whether the unit was actually treated.
Potential Outcome Examples
- An unemployed person seeking a job could participate in a job search program.
- Y₁i: potential labor market outcome if the person participated in the program.
- Y₀i: potential labor market outcome if the person did not participate.
- Those are potential outcomes unrelated to actual treatment status.
Definition of Causal Effect
- In potential outcome framework, only one potential outcome is observed
- The unobserved potential outcome is also called the counterfactual outcome.
- For any given unit, the impact of partaking in the treatment is:
- Δᵢ = Y₁ᵢ – Y₀ᵢ is the difference in potential outcomes
- This is also the definition of causal effect at the unit level
Causal Effect Example: Job Search Program
- Example: comparing unemployed workers who went to a job search program and those who didn't may incorrectly assess the program.
- Workers who went to the program may have worse labor market prospects.
- Potential outcomes:
- Y₁i: the employment outcome of person i going to the program
- Y₀i: the employment outcome of person i not going to the program
- The causal effect of participating in the program: Δᵢ = Y₁ᵢ – Y₀ᵢ
Fundamental Problem of Causal Interference
- Δᵢ is unobservable because only Y₁ᵢ or Y₀ᵢ can be observed.
- It is impossible to derive causality at the unit level because units cannot receive and not receive treatment at the same time.
- This difficult problem can be approached with either a "scientific“ or a statistical solution."
Overcoming Counterfactual Problems: Scientific Solutions
- Consider a unit i with the assumptions:
- Temporal Stability: Outcome y does not depend on when the treatment takes place.
- Causal Transience: Outcome y is independent of any prior treatment.
- Homogeneity of Units: An assumption for other units j ≠ i such that yi(xi) = yj(xj) for xi = xj
- Assumptions are used in natural sciences to infer causality.
- These are unlikely to hold in social sciences because the environment is is not controllable such as it might be in a lab.
Statistical Solution to the Counterfactual Problem
- Methods that compute the average causal effect for the entire population or subgroups of interest
- In economic literature, the Average Treatment Effect (ATE) is used.
- Formula: ATE = E[Δᵢ] = E[Y₁ᵢ - Y₀ᵢ] = E[Y₁ᵢ] - E[Y₀ᵢ]
- The ATE of Dᵢ = 1 gauges how much, on average, an individual benefits from receiving the defined treatment.
- ATE compares the potential outcome when all units receive the treatment to the potential outcome when no units receive treatment.
- Neither outcome, however, is typically directly observed.
Average Treatment Effect on the Treated (ATET) in Economics
- The average treatment effect on the treated describes how much on average the individuals who actually received treatment benefited from the treatment:
- E[∆ᵢ|Dᵢ = 1] = E[Y₁ᵢ - Y₀ᵢ |Dᵢ = 1] = E[Y₁ᵢ |Dᵢ = 1] – E[Y₀ᵢ |Dᵢ = 1] = 1, involves one unobserved counterfactual that indicates the potential outcome as untreated for those who actually received treatment.
ATE and ATET: Job Program Example
- For a job-search program:
- ATE answers: If all unemployed workers participate in the program, how much would employment increase?
- ATET answers: How much would employment increase for workers who selected into the program?
- The questions cannot readily be answered, because they require comparison the person's observed outcome to the counterfactual outcome.
Estimating ATE and ATET: Self-Selection
- A self-selection problem must be addressed to estimate average treatment effects:
- E[Y₁ᵢ|Dᵢ = 1]- E[Y₀ᵢ|Dᵢ = 0].
- In other words, can't average outcomes can just be compared for those who were treated to outcomes for those not treated?
- This would require the average potential outcome in a non-treated condition to be the same for treated individuals as for non-treated individuals. -E[Y₀ᵢ|Dᵢ = 1] = E[Y₀ᵢ|Dᵢ = 0]
- This is an unlikely event without an experimental variable.
Example of Self-Selection
- Workers motivated to find work are more likely to participate in a job search program.
- The above individuals potential outcomes, treated and untreated, are both more motivating than those less motivated to find work.
- Potential outcomes are not independent of actual treatment status.
Self-Selection Conundrum
- With self-selection:
- E[Y₀ᵢ | Dᵢ = 1] ≠ E[Y₀ᵢ | Dᵢ = 0] and E[Y₁ᵢ | Dᵢ = 1] ≠ E[Y₁ᵢ | Dᵢ = 0].
- The potential outcomes as untreated for those who were treated are not the same as the potential outcomes as untreated for those who were not treated.
- E[Y₀ᵢ | Dᵢ = 1] is not the same as E[Y₀ᵢ | Dᵢ = 0].
- Likewise the potential outcomes as treated for those who were treated are not the same as the potential outcomes as treated for those who were not treated.
- E[Y₁ᵢ | Dᵢ = 1] is not the same as E[Y₁ᵢ | Dᵢ = 0].
Formalizing Bias Resulting from Self-Selection
- It is possible to formalize self-selection induced bias.
- The Average Treatment Effect (ATE) is naively estimated via:
- E[Y₁ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0]
- Adding and subtracting E[Y₀ᵢ|Dᵢ = 1] to the expression above produces:
- E[Y₁ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 1] + E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0]
- Simplifies the yield: -E[Y1i - Yoi |Di = 1] + E[Yoi|Di = 1] - E[Yoi|Di = 0]
- Which can be written as:
- E[Y1i - Yoi |Di = 1] + (a bias term)
- A result gives ATET + bias term.
Example of Bias
- The bias term, E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0], captures the difference in non-treated employment outcomes between those who did and did not get treatment.
- E[Y₀ᵢ|Dᵢ = 1] - E[Y₀ᵢ|Dᵢ = 0] > 0, since selecting a treatment is in some way dependent on the individual feeling it will improve some aspect of their lives where intervention is desired.
Why Selection Bias Matters
- Selection bias arises from optimizing decisions by rational actors.
- I am in it if worth it.
- D = 1 if Y1i - Yoi > c ,
- C represents the mental and monetary cost of participating in the treatment.
- The above can then be used to write:
- E[Yoi |Di = 1] = E[Yoi |Yoi < Y1i -c]
- E[Yoi |Di = 0] = E[Yoi |Yoi > Y1i -c]
- With groups differing in terms of comparative advantages.
- Y1i - Yoi is large for some, small for others.
- The most simple instances: treatment participants have smaller Yoi, thus larger potential gain.
Other Selection Sources
- May occur due to administrative rules or by selection coming from treatment providers.
- "Cream-skimming": Choosing “the best” suggests that E[Yoi|Di = 1] > E[Yoi|Di = 0].
- Selection bias also results come from negative treatment:
- Negative Treatment Example: Placing "weak" kids in small classes produces the formula:
- E[Yoi|Di = 1] < E[Yoi|Di = 0]
The Key Problem for Treatment Evaluation
- Key unobservable/unestimatable values with the formula E[Y₀ᵢ|Dᵢ = 1], E[Y₁ᵢ|Dᵢ = 0], E[Y₁ᵢ] and E[Y₀ᵢ]. Treatment evaluation's primary issue is isolating something, with exogenous variation, that only influences treatment assignment and not potential outcomes.
- In other words, the treatment status to be independent of potential outcomes and have a valid couterfactual to evaluate treatment
Randomization Solves the Self-Selection Problem
- Social experiments randomize treatment assignment across individuals via a lottery
- Randomization implies treatment assignment that is statistically independent of potential outcomes and other variables:
- (Y0i, Y1i) ┴ Dį
- In the job search example: using a lottery to assign individuals.
- The "treatment" is independent of potential outcomes.
How Randomization Solves the Selection Problem: Example
- In a job search, motivated workers often selected treatment.
- Randomizing treatment creates equal distributions of motivation in treatment/non treatment groups.
- With completely random treatment the groups will look alike, removing selection
- With treatment status the sole group differentiator, each group counteracts the others' factuality.
Randomization Solves The Selection Problem Concluded
- With randomization, treatment status is independent of potential outcomes with these mathematical effects:
- E[Y1i|Di = 1] = E[Y1i|Di = 0] = E[Y1i]
- E[Yoi|Di = 1] = E[Yoi|Di = 0] = E[Yoi]
- In a job search program, this implies the following:
Randomization Solves the Selection Problem Concluded 2
- Random assignment of Dᵢ eliminates selection.
- ATET = E[Y1i|Di = 1] – E[Yoi|Di = 1]=E[Y1i|Di = 1] – E[Yoi|Di = 0]
- We swap out E[Y1i|Di = 1] for E[Yoi|Di = 0] because treatment status is independent of potential outcomes.
- As a result, E[Y1i|Di = 1] and E[Yoi|Di = 0] are observed.
- E[Y1i]) is the same as the potential outcomes of those who actually took the program E[Y1i|Di = 1]).
- With randomization ATE = ATET as groups are similar despite treatment.
Important Effects of Randomization- World Evidence
- In the social sciences randomization isn't always possible and control strategies (controlling factors that differentiate outcomes) are frequently applied instead
- There are many occasions where both experiments are tried to find a similar result.
- This allows comparison for better alignment with experimental ideal
- Sadly, medicine and economics studies show randomization is superior to control strategies
How Important is Randomization- HRT Medicine Example?
- Recent instances come from hormone replacement therapy investigation
- Evidence from the Nurse's Health Study, a large, influential non experimental survey, showed better health with HRT used with a control strategy
- Results from similar randomized trails showed exceeding, harmful health risks.
-
- The prior positive reflection was self-selective, in that health conscious women got selected which explains the ATET + bias term
Important Effects of Randomization World Economics Example
- There are few, non experimental studies with labor market training paradoxically find less earnings from participants compared to nonparticipants.
- One may suspect selection bias from training serving low income individuals leading to naive comparisons
- Randomized training programs, however, boast positive effects.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Explore causality, potential outcomes, and counterfactuals. Understand the notation, significance, and interpretation of causal effects. Learn about observed and unobserved outcomes in causal inference studies.