Podcast
Questions and Answers
What does ATE stand for in the context of causal inference?
What does ATE stand for in the context of causal inference?
- Actual Treatment Estimate
- Adjusted Treatment Effect
- Average Treatment Effect (correct)
- Average Total Effect
What does the letter B represent in the provided equation?
What does the letter B represent in the provided equation?
- Base effect of training
- Benefit of treatment over control
- Selection bias between groups (correct)
- Bias related to treatment differences
Which implementation design involves a control group receiving training later?
Which implementation design involves a control group receiving training later?
- Encouragement design
- Random assignment design
- Phase in design (correct)
- Direct control design
What is a significant issue with the noncompliance in RCTs?
What is a significant issue with the noncompliance in RCTs?
What does the term LATE refer to in the context of RCTs?
What does the term LATE refer to in the context of RCTs?
Which of the following is NOT an implementation design for RCTs mentioned?
Which of the following is NOT an implementation design for RCTs mentioned?
What type of effect is achieved when the number treated in the treatment group exceeds those in the control group?
What type of effect is achieved when the number treated in the treatment group exceeds those in the control group?
Which of these is NOT one of the issues associated with the implementation of RCTs?
Which of these is NOT one of the issues associated with the implementation of RCTs?
What does 'externally changing training status' refer to in the context of causal effects?
What does 'externally changing training status' refer to in the context of causal effects?
What is the fundamental problem of causal inference highlighted in the content?
What is the fundamental problem of causal inference highlighted in the content?
What does ATT stand for in the context of treatment effects?
What does ATT stand for in the context of treatment effects?
Why is selection bias a critical issue in estimating causal effects?
Why is selection bias a critical issue in estimating causal effects?
How can the average earnings of non-trainees introduce bias?
How can the average earnings of non-trainees introduce bias?
What implication can be drawn from the statement 'ATT ≠ATE'?
What implication can be drawn from the statement 'ATT ≠ATE'?
Why should one avoid controlling for factors determined after training assignment?
Why should one avoid controlling for factors determined after training assignment?
What is meant by 'heterogeneous treatment effects' in the content?
What is meant by 'heterogeneous treatment effects' in the content?
What is the purpose of the Law of Iterated Expectations in causal inference?
What is the purpose of the Law of Iterated Expectations in causal inference?
In the regression equation $Y_i = \beta + \alpha D_i + \epsilon_i$, what does $\hat{\alpha}_{OLS}$ represent?
In the regression equation $Y_i = \beta + \alpha D_i + \epsilon_i$, what does $\hat{\alpha}_{OLS}$ represent?
What is a significant challenge with OLS estimation in causal inference?
What is a significant challenge with OLS estimation in causal inference?
Which statement best describes the relationship between covariance and the OLS coefficient in the given regression?
Which statement best describes the relationship between covariance and the OLS coefficient in the given regression?
Which aspect is emphasized by the specification of the selection into treatment?
Which aspect is emphasized by the specification of the selection into treatment?
What is the implication of $E(D_i) = P(D_i)$ in the context of causal inference?
What is the implication of $E(D_i) = P(D_i)$ in the context of causal inference?
What advantage does the Law of Iterated Expectations provide in statistical analysis?
What advantage does the Law of Iterated Expectations provide in statistical analysis?
How does the term $E(Y_i|D_i = 1) - E(Y_i|D_i = 0)$ relate to the OLS estimation?
How does the term $E(Y_i|D_i = 1) - E(Y_i|D_i = 0)$ relate to the OLS estimation?
What is the implication of a zero covariance in the extended Roy model?
What is the implication of a zero covariance in the extended Roy model?
Which of the following statements about quantile treatment effects is true?
Which of the following statements about quantile treatment effects is true?
Which method can identify λ(zγ) in the identification process of the Roy model?
Which method can identify λ(zγ) in the identification process of the Roy model?
What do the residuals of the regression identify in the identification of the Roy model?
What do the residuals of the regression identify in the identification of the Roy model?
In the context of quantile treatment effects, when do the effects satisfy the equality Q(Y1) − Q(Y0) = Q(Y1 − Y0)?
In the context of quantile treatment effects, when do the effects satisfy the equality Q(Y1) − Q(Y0) = Q(Y1 − Y0)?
What does the identification process in the extended Roy model help to uncover?
What does the identification process in the extended Roy model help to uncover?
What is a limitation of quantile treatment effects concerning causal inference?
What is a limitation of quantile treatment effects concerning causal inference?
Which statistic is necessary to ascertain Cov(U0, U1) in the extended Roy model?
Which statistic is necessary to ascertain Cov(U0, U1) in the extended Roy model?
What is the primary issue that the generalized Roy model addresses regarding training selection?
What is the primary issue that the generalized Roy model addresses regarding training selection?
In the generalized Roy model, what does the variable R represent?
In the generalized Roy model, what does the variable R represent?
Which assumption is crucial for the correctness of a linear specification in the generalized Roy model?
Which assumption is crucial for the correctness of a linear specification in the generalized Roy model?
What does the notation Φ represent in the probability equation provided in the generalized Roy model?
What does the notation Φ represent in the probability equation provided in the generalized Roy model?
How is the cost of training defined in the generalized Roy model?
How is the cost of training defined in the generalized Roy model?
What does the variable ν represent in the equation R = Zγ + ν?
What does the variable ν represent in the equation R = Zγ + ν?
What does the term 'self-selection' imply in the context of training programs?
What does the term 'self-selection' imply in the context of training programs?
Which of the following is a critical component of the net benefit R computation in the generalized Roy model?
Which of the following is a critical component of the net benefit R computation in the generalized Roy model?
What does the equation E(U1 − U0 |D = 1) represent in the context of selection bias?
What does the equation E(U1 − U0 |D = 1) represent in the context of selection bias?
In the Roy model, what is implied if individuals with high returns tend to self-select into training?
In the Roy model, what is implied if individuals with high returns tend to self-select into training?
According to the information, what is a consequence of voluntary participation in treatment?
According to the information, what is a consequence of voluntary participation in treatment?
What is the effect of OLS estimators in the context of selection bias as suggested in the content?
What is the effect of OLS estimators in the context of selection bias as suggested in the content?
What does the note regarding RCTs among voluntary candidates indicate?
What does the note regarding RCTs among voluntary candidates indicate?
What limitation is mentioned about RCTs conducted on a full population?
What limitation is mentioned about RCTs conducted on a full population?
How might employment agencies influence selection bias according to the discussion?
How might employment agencies influence selection bias according to the discussion?
What does the concept of 'positive discrimination' lead to according to the content?
What does the concept of 'positive discrimination' lead to according to the content?
Flashcards
Causal Effect of Training
Causal Effect of Training
The difference in someone's earnings caused by them participating in a training program, holding other factors constant.
Determinants of Earnings/Training
Determinants of Earnings/Training
Factors that influence a person's earnings or decision to participate in training, like their education level, health, or motivation.
Fundamental Problem of Causal Inference
Fundamental Problem of Causal Inference
The problem of not being able to observe how someone (like an untrained worker) would have performed had they been in the other state (in training).
Selection Bias in Training
Selection Bias in Training
Signup and view all the flashcards
Average Treatment Effect (ATE) vs. Average Treatment Effect on the Treated (ATT)
Average Treatment Effect (ATE) vs. Average Treatment Effect on the Treated (ATT)
Signup and view all the flashcards
Heterogeneous Treatment Effects
Heterogeneous Treatment Effects
Signup and view all the flashcards
Selection on Unobservables
Selection on Unobservables
Signup and view all the flashcards
Post-Treatment Variables
Post-Treatment Variables
Signup and view all the flashcards
ATE
ATE
Signup and view all the flashcards
Selection Bias
Selection Bias
Signup and view all the flashcards
ITT Effect
ITT Effect
Signup and view all the flashcards
LATE
LATE
Signup and view all the flashcards
RCTs
RCTs
Signup and view all the flashcards
Implementation of RCTs
Implementation of RCTs
Signup and view all the flashcards
Noncompliance
Noncompliance
Signup and view all the flashcards
Hawthorne effects
Hawthorne effects
Signup and view all the flashcards
Causal Effect
Causal Effect
Signup and view all the flashcards
Selection into Treatment
Selection into Treatment
Signup and view all the flashcards
OLS Estimation
OLS Estimation
Signup and view all the flashcards
Law of Iterated Expectations (LIE)
Law of Iterated Expectations (LIE)
Signup and view all the flashcards
Randomized Controlled Trial (RCT)
Randomized Controlled Trial (RCT)
Signup and view all the flashcards
ATE vs. ATT
ATE vs. ATT
Signup and view all the flashcards
Generalized Roy Model
Generalized Roy Model
Signup and view all the flashcards
Linear Specification Correctness
Linear Specification Correctness
Signup and view all the flashcards
Normal Distribution of Unobserved Factors
Normal Distribution of Unobserved Factors
Signup and view all the flashcards
Probability of Training Participation
Probability of Training Participation
Signup and view all the flashcards
Idiosyncratic Return to Training
Idiosyncratic Return to Training
Signup and view all the flashcards
Individual Treatment Effect (ITE)
Individual Treatment Effect (ITE)
Signup and view all the flashcards
Average Treatment Effect (ATE)
Average Treatment Effect (ATE)
Signup and view all the flashcards
Average Treatment Effect on the Treated (ATT)
Average Treatment Effect on the Treated (ATT)
Signup and view all the flashcards
Selection on Gains
Selection on Gains
Signup and view all the flashcards
Inverse Mills Ratio
Inverse Mills Ratio
Signup and view all the flashcards
Instrumental Variables (IV) Regression
Instrumental Variables (IV) Regression
Signup and view all the flashcards
Quantile Treatment Effects vs. Gains to Treatment
Quantile Treatment Effects vs. Gains to Treatment
Signup and view all the flashcards
Identifying Gains to Treatment in the Roy Model
Identifying Gains to Treatment in the Roy Model
Signup and view all the flashcards
Identifying F(Y1 - Y0) in the Roy Model
Identifying F(Y1 - Y0) in the Roy Model
Signup and view all the flashcards
Cost-Benefit Analysis in the Extended Roy
Cost-Benefit Analysis in the Extended Roy
Signup and view all the flashcards
Critique of Statistical Approach: Assumption Dependence
Critique of Statistical Approach: Assumption Dependence
Signup and view all the flashcards
Critique of Statistical Approach: Unobserved Potential Outcomes
Critique of Statistical Approach: Unobserved Potential Outcomes
Signup and view all the flashcards
Econometric Critique: Emphasis on Economic Mechanisms
Econometric Critique: Emphasis on Economic Mechanisms
Signup and view all the flashcards
Debate Between Statistical and Econometric Approaches
Debate Between Statistical and Econometric Approaches
Signup and view all the flashcards
Study Notes
Causal Inference in Microeconometrics with Applications to Program Evaluation
- Lecture 1: Introduction by Bart Cockx, Ghent University, 2024
- Course Structure
- 11 three-hour sessions on Fridays from October 4th to December 20th, 2024 (excluding November 1st)
- Two two-hour sessions per session, each 50 minutes long, with 15-minute breaks and a 1-hour lunch break
- Low quality recordings available via MS Teams
- Guest lecturer (William Pariente) covering Randomized Controlled Trials (RCTs) on November 29th, December 6th, and 13th, addressing a particular topic
Evaluation Method
- Attendance is mandatory.
- -1 point for each unjustified absence.
- -2 points for any absence from the 3rd absence onwards.
- Assignments (10 points)
- Groups of 2–3 students
- Registration on Ufora
- Model solutions posted on the platform
- Support from Natalia Bermudez and Giulia Tarullo
- Presentations (10 points)
- Group presentations on research or lecture-related topics
- Maximum of 3 students per group
- Options for selected papers or self-selected research
- Aim: explaining lecture materials in own words (re-explaining lectures in own words)
- Time: 15 min presentation + 5 mins discussion
- Deadline: end of January (date to be confirmed)
Teaching Material
- All lecture materials, including slides, recordings, and documents, are available on Ufora.
- Non-Ghent University Students need official registration on OASIS to access and take the exam.
- Required Reading: Blundell and Costa Dias (2009)
- General Complementary Readings: (specific readings are described in individual lectures)
- Imbens and Wooldridge (2009) or Angrist and Pischke (2009)
- Cunningham (2021) (free online resources included)
- Book of Imbens and Rubin (2015)
- Abadie and Cattaneo (2018)
- Huber (2019)
- Arkhangelsky and Imbens (2024) (recent overviews)
- Imbens (2022) and Imbens and Xu (2024) (Credibility Revolution)
What is Causality?
- Key questions investigated: smoking and lung cancer, aspirin and heart attacks, unemployment training, schooling, FDI
- Focus is on identifying causal relationships
Example: Training of Unemployed Workers
- Causality: The impact of participation in training on earnings, keeping other earning/training factors constant
- Determinants: Characteristics (education, motivation), predetermined factors (working time reduction, unemployment)
- Fundamental Problem: Observing individuals in one state only (training or not) presents an issue of missing counterfactual data.
- Selection Bias: Replacing missing data with the average earnings of non-trainees can be misleading due to workers with high/low earning capacities.
Random Assignment
- The Statistical Solution: A method for identifying the average causal effect when randomization is possible.
- Random assignment balances the treatment and control groups.
- Random assignment as "gold standard": Randomization is preferred to avoid bias in identifying/isolating causal effects. Methods of identifying causal effects in cases where randomization isn't possible are required, for example by utilizing "natural experiments." These alternative methods include causal inference methodologies such as (propensity score) matching, re-weighting, regression, (Difference-in-differences (DiD), Regression Discontinuity Design (RDD), and Instrumental Variables (IV).
Limitations and Key Ingredients of Modern Causal Inference
- Multiple methods are needed to detect and exploit hidden forms of randomization when randomization isn't possible
- Non-experimental methods hinge on untestable assumptions.
- Identification of treatment effect is fundamentally non-parametric and requires precise characterization of the affected population.
Overview of Identification Methods
- Methods Based on Unconfoundedness/CIA:
- Difference-in-differences (DID): Comparing outcomes in groups that are not affected by treatment before/after the treatment is introduced.
- Regression Discontinuity Design (RDD): Evaluating treatment effect through comparisons around a treatment threshold, for example, a cutoff
- Instrumental Variables (IV): Identifying causal effects through correlation, using an instrument (Z) correlated with treatment (T) but not with outcome (Y) directly.
Outline of the Course
- Major Topics:
- The Problem of Causality in Microeconometrics
- Methods Based on Unconfoundedness
- Difference-in-Differences (DID), Instrumental Variables (IV), Regression Discontinuity Design (RDD)
- Randomized Controlled Trials
- Causal Machine Learning
- Lecture by a Guest Lecturer covering Randomized Controlled Trials (RCTs)
Outline for the continuation of this lecture
- Focus on the problem of causality
- Formal frameworks to understand causality
- RCTs as a solution
- Causality in a regression framework
- Specification issues with selection
- Critique of OLS Estimation
Sources for the Continuation of this Lecture
- Key sources influencing/used in the lecture discussed in the presented lecture.
A Formal Framework about Causality
- Correlation vs. Causation: Correlation does not imply causation. Confounding factors and reverse causality can confound the relationship.
The Fundamental Problem of Causal Inference
- Definition : Causal effect is the difference in potential outcomes between the treatment and the control group.
- Proposition : The causal effect is logically unobservable without counterfactual evidence
The Statistical Solution
- ATE: Average effect of treatment on the treated.
- ATT: Average treatment effect on the treated
Is Comparison by Treatment Status Informative for ATT?
- Bias: Comparison by treatment status is biased because outcomes of treated and control subjects are not identical in the absense of treatment.
- Solution: The bias caused by the difference of the outcomes for both control and treatment groups in the absence of treatment must be evaluated to accurately calculate the ATT.
RCTs as Solution
- Randomization solves fundamental problems of causal inference.
- Treatment/control groups are statistically equivalent.
- Practical Implementation: Random assignment via lottery, phased implementation, encouragement designs
Some Issues with Implementation of RCTs
- Noncompliance: Not all subjects adhere to the study's planned treatment / control allocation.
- Other Issues: Issues include Hawthorne effects, external validity, spillover effects.
Causality in a regression framework
- Regression equation relating outcome variable Y to treatment variable D.
- Issues with OLS estimation: Selection into treatment, and unobservables that correlate with the treatment or outcome may induce bias, thereby impacting outcomes.
- Law of Iterated Expectations is a key principle for understanding potential issues and suggesting potential solutions
Specification of the selection into treatment
- The participation rule determining individuals' involvement in treatment, which links the participation decision to the outcome(s) of interest.
The model in the compact form
- Compact representation of the causal inference model
The statistical effects of treatment in this model
- Treatment effect (ATE/ATT): the estimated change in the outcome based on the treatment variable
- Idiosyncratic gain from treatment
Problems with OLS Estimation
- Bias in the estimated treatment effect (ATE/ATT) resulting from selection bias.
The Generalized Roy Model
- Model framework: Modeling earnings to isolate the effects of treatment / training.
- Cost of training: Factor that determines the costs or benefits of training, for instance, the costs of attending training; costs dependent on location of event to participation
Alternative reasons for selection bias, for example, participation of individuals in programs for reasons other than economic reasons
Limitations/Critiques of/on the NRH Causal Model
- Potential outcomes are invariant to assignment
- Social or equilibrium interactions are not explicitly accounted
Limitations/Critiques (5)
- Focus on objective outcomes rather than subjective evaluations
- Causal effects of treatment on subjective outcomes/measures of welfare not modelled
- Effects where one event can influence another are not easily captured by the model
Limitations/Critiques (4)
- The causal effect is conceptualised as a black box or opaque process, lacking theoretical explanation
- Historical or previous interventions may not inform / predict future outcomes; limitations associated with external validity, for instance
- External validity limited unless there is a guarantee that interventions are replicated across similar or comparable situations / environments
Limitations/Critiques (3)
- The proposed model only considers realized outcomes; it does not quantify the impact of subjective evaluations of the outcome of interest such as welfare
- Limitations associated with focusing on only objective outcomes rather than considering anticipated outcomes
Identification of F(Y1-Y0) in the (extended) Roy model
- Implications for identifying the proportion of individuals benefiting from treatment in the Roy model; relationships between variables such as Z, Y, and the treatment factor
Some Important Neglected Methods
- Statistical bounds
- Bunching
- Timing of Events
- Designed-Based Identification
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.