Podcast
Questions and Answers
In a Fuzzy Regression Discontinuity Design (RDD), how does the treatment probability change at the cutoff?
In a Fuzzy Regression Discontinuity Design (RDD), how does the treatment probability change at the cutoff?
- It decreases the probability of treatment, acting as a deterrent.
- It switches treatment on and off completely, similar to a sharp RDD.
- It increases the probability of treatment but doesn't guarantee treatment. (correct)
- It remains constant, as the treatment assignment is independent of the running variable.
What key assumption must hold true for Regression Discontinuity Design (RDD) to provide a valid estimate of a causal effect?
What key assumption must hold true for Regression Discontinuity Design (RDD) to provide a valid estimate of a causal effect?
- Units far below and far above the cutoff are similar and comparable.
- Units just below and just above the cutoff are similar and comparable and cannot manipulate their running variable. (correct)
- There is no relationship between the running variable and the probability of treatment.
- The treatment effect is constant across all values of the running variable.
Which of the following is a threat to the validity of a Regression Discontinuity Design (RDD)?
Which of the following is a threat to the validity of a Regression Discontinuity Design (RDD)?
- A clear and distinct cutoff point.
- Individuals sorting around the cutoff. (correct)
- The ability to extrapolate causal effects far from the cutoff.
- A large amount of data around the cutoff.
What does the control group's outcome after the treatment period represent in Difference-in-Differences (DiD)?
What does the control group's outcome after the treatment period represent in Difference-in-Differences (DiD)?
Which of the following assumptions is most critical for the validity of a Difference-in-Differences (DiD) design?
Which of the following assumptions is most critical for the validity of a Difference-in-Differences (DiD) design?
In the 2x2 regression model for Difference-in-Differences (DiD), $y_{it} = α + β \cdot treated_i + γ \cdot after_t + δ \cdot treated_i \cdot after_t + u_{it}$, what does the coefficient $δ$ represent?
In the 2x2 regression model for Difference-in-Differences (DiD), $y_{it} = α + β \cdot treated_i + γ \cdot after_t + δ \cdot treated_i \cdot after_t + u_{it}$, what does the coefficient $δ$ represent?
In a Regression Discontinuity Design (RDD), what is primarily used to estimate the causal effect of a treatment?
In a Regression Discontinuity Design (RDD), what is primarily used to estimate the causal effect of a treatment?
Which test helps to validate the assumption that individuals cannot precisely manipulate their running variable in a Regression Discontinuity Design (RDD)?
Which test helps to validate the assumption that individuals cannot precisely manipulate their running variable in a Regression Discontinuity Design (RDD)?
In a Difference-in-Differences (DID) model, what does the coefficient on the interaction term (Treated * After) represent?
In a Difference-in-Differences (DID) model, what does the coefficient on the interaction term (Treated * After) represent?
What is the key identifying assumption that must hold for a Difference-in-Differences (DID) analysis to provide a valid causal estimate?
What is the key identifying assumption that must hold for a Difference-in-Differences (DID) analysis to provide a valid causal estimate?
How can researchers provide evidence supporting the parallel trends assumption in a Difference-in-Differences (DID) analysis?
How can researchers provide evidence supporting the parallel trends assumption in a Difference-in-Differences (DID) analysis?
What is a 'common shock' in the context of Difference-in-Differences (DID) analysis, and why is it important to consider?
What is a 'common shock' in the context of Difference-in-Differences (DID) analysis, and why is it important to consider?
In the context of Difference-in-Differences (DID) with staggered treatment adoption, what is a key challenge when comparing groups?
In the context of Difference-in-Differences (DID) with staggered treatment adoption, what is a key challenge when comparing groups?
In a study examining the impact of air pollution on infant health, what is a potential confounding factor that needs to be addressed?
In a study examining the impact of air pollution on infant health, what is a potential confounding factor that needs to be addressed?
In the context of DID analysis, what is the implication of heterogeneous treatment effects (where treatment effects differ over time)?
In the context of DID analysis, what is the implication of heterogeneous treatment effects (where treatment effects differ over time)?
What type of errors can arise when analyzing data with staggered treatment timing and heterogeneous treatment effects using standard DID methods?
What type of errors can arise when analyzing data with staggered treatment timing and heterogeneous treatment effects using standard DID methods?
In Regression Discontinuity Design (RDD), what key assumption must hold true to ensure valid causal inference?
In Regression Discontinuity Design (RDD), what key assumption must hold true to ensure valid causal inference?
What does a noticeable jump in the number of treated observations around the cutoff point in an RDD suggest?
What does a noticeable jump in the number of treated observations around the cutoff point in an RDD suggest?
What is the purpose of falsification tests in Regression Discontinuity Design (RDD)?
What is the purpose of falsification tests in Regression Discontinuity Design (RDD)?
Why is it important to analyze predetermined values in the same way as the outcome of interest in RDD?
Why is it important to analyze predetermined values in the same way as the outcome of interest in RDD?
In the context of Regression Discontinuity Design (RDD), what is the primary purpose of conducting a placebo test by replacing the true cutoff value with a fake cutoff value?
In the context of Regression Discontinuity Design (RDD), what is the primary purpose of conducting a placebo test by replacing the true cutoff value with a fake cutoff value?
What does the 'local randomization' interpretation of RDD imply about the generalizability of the findings?
What does the 'local randomization' interpretation of RDD imply about the generalizability of the findings?
What is the primary trade-off when selecting the bandwidth size in RDD?
What is the primary trade-off when selecting the bandwidth size in RDD?
How can authors demonstrate the robustness of their RDD results?
How can authors demonstrate the robustness of their RDD results?
When using Instrumental Variables (IV) to estimate causal effects, which condition is the MOST challenging to empirically verify?
When using Instrumental Variables (IV) to estimate causal effects, which condition is the MOST challenging to empirically verify?
What is the purpose of the first-stage regression in an Instrumental Variables (IV) analysis?
What is the purpose of the first-stage regression in an Instrumental Variables (IV) analysis?
Which of the following scenarios would MOST likely violate the exogeneity assumption in an Instrumental Variables (IV) regression?
Which of the following scenarios would MOST likely violate the exogeneity assumption in an Instrumental Variables (IV) regression?
In the context of Instrumental Variables (IV), what does the term 'reduced form' refer to?
In the context of Instrumental Variables (IV), what does the term 'reduced form' refer to?
Suppose researchers are studying the effect of education (T) on income (Y) and use proximity to a college as an instrument (Z). What would constitute a violation of the exclusion restriction?
Suppose researchers are studying the effect of education (T) on income (Y) and use proximity to a college as an instrument (Z). What would constitute a violation of the exclusion restriction?
A researcher uses rainfall shocks (Z) as an instrument for economic shocks (T) when studying the effect on conflict (Y). Which condition is MOST directly tested by examining the correlation between rainfall shocks and pre-existing conflict levels?
A researcher uses rainfall shocks (Z) as an instrument for economic shocks (T) when studying the effect on conflict (Y). Which condition is MOST directly tested by examining the correlation between rainfall shocks and pre-existing conflict levels?
In an Instrumental Variables (IV) framework, the ratio of the reduced form effect to the first-stage effect can be interpreted as:
In an Instrumental Variables (IV) framework, the ratio of the reduced form effect to the first-stage effect can be interpreted as:
When is an Instrumental Variable (IV) strategy MOST appropriate?
When is an Instrumental Variable (IV) strategy MOST appropriate?
In the context of evaluating public housing demolition, what is the primary purpose of conducting balance tests between the displaced (T) and non-displaced (C) groups?
In the context of evaluating public housing demolition, what is the primary purpose of conducting balance tests between the displaced (T) and non-displaced (C) groups?
What key assumption must hold true to ensure the validity of a quasi-experimental study examining the effects of public housing demolition on displaced residents?
What key assumption must hold true to ensure the validity of a quasi-experimental study examining the effects of public housing demolition on displaced residents?
In the context of instrumental variables, what is the role of an 'exogenous variable'?
In the context of instrumental variables, what is the role of an 'exogenous variable'?
In an experiment with imperfect compliance, some participants randomized to the treatment group do not receive the treatment, and some in the control group do. Why does randomization still provide value in this scenario?
In an experiment with imperfect compliance, some participants randomized to the treatment group do not receive the treatment, and some in the control group do. Why does randomization still provide value in this scenario?
What does the Intention-To-Treat (ITT) effect measure in the context of an experiment with imperfect compliance?
What does the Intention-To-Treat (ITT) effect measure in the context of an experiment with imperfect compliance?
If a researcher is interested in estimating the treatment effect specifically for 'compliers' in an instrumental variables setting, which effect are they trying to estimate?
If a researcher is interested in estimating the treatment effect specifically for 'compliers' in an instrumental variables setting, which effect are they trying to estimate?
In the context of estimating treatment effects with instrumental variables, what is a key challenge in directly observing 'compliers'?
In the context of estimating treatment effects with instrumental variables, what is a key challenge in directly observing 'compliers'?
In the context of quasi-experiments, which of the following scenarios best exemplifies the use of an exogenous variable to induce variation in treatment?
In the context of quasi-experiments, which of the following scenarios best exemplifies the use of an exogenous variable to induce variation in treatment?
What is the purpose of examining 'heterogeneous effects' in a study?
What is the purpose of examining 'heterogeneous effects' in a study?
A researcher aims to study the effect of a new educational program (treatment) on student test scores (outcome). The researcher notices that students who enroll in the program are generally more motivated and have higher baseline scores than those who don't. What type of selection issue does this scenario represent?
A researcher aims to study the effect of a new educational program (treatment) on student test scores (outcome). The researcher notices that students who enroll in the program are generally more motivated and have higher baseline scores than those who don't. What type of selection issue does this scenario represent?
What is a key limitation of using Randomized Controlled Trials (RCTs) when studying certain social phenomena?
What is a key limitation of using Randomized Controlled Trials (RCTs) when studying certain social phenomena?
In a study examining the impact of neighborhood quality on children's educational outcomes, what challenge does 'selection on unobservables' pose when comparing families living in different neighborhoods?
In a study examining the impact of neighborhood quality on children's educational outcomes, what challenge does 'selection on unobservables' pose when comparing families living in different neighborhoods?
A researcher is studying the effect of a new job training program on employment rates. They compare individuals who voluntarily enroll in the program to those who do not. What potential bias should the researcher be most concerned about?
A researcher is studying the effect of a new job training program on employment rates. They compare individuals who voluntarily enroll in the program to those who do not. What potential bias should the researcher be most concerned about?
A researcher uses a Regression Discontinuity Design (RDD) to evaluate the impact of receiving a scholarship on college graduation rates. What key assumption underlies the validity of this approach?
A researcher uses a Regression Discontinuity Design (RDD) to evaluate the impact of receiving a scholarship on college graduation rates. What key assumption underlies the validity of this approach?
A researcher aims to investigate the impact of a new environmental regulation on the profitability of manufacturing firms. They plan to use a Difference-in-Differences (DiD) approach, comparing firms in regions that implemented the regulation to firms in regions that did not. What is a critical assumption for the validity of the DiD approach?
A researcher aims to investigate the impact of a new environmental regulation on the profitability of manufacturing firms. They plan to use a Difference-in-Differences (DiD) approach, comparing firms in regions that implemented the regulation to firms in regions that did not. What is a critical assumption for the validity of the DiD approach?
Consider a scenario where a policy change affects only a specific industry. To analyze the causal effect of this policy on firm performance, which quasi-experimental method would be most appropriate if you have data on firm performance before and after the policy change for both the affected industry and a similar, unaffected industry?
Consider a scenario where a policy change affects only a specific industry. To analyze the causal effect of this policy on firm performance, which quasi-experimental method would be most appropriate if you have data on firm performance before and after the policy change for both the affected industry and a similar, unaffected industry?
Flashcards
Treatment (T)
Treatment (T)
Variable of interest or manipulated variable
Outcome variable (Y)
Outcome variable (Y)
The outcome or the effect being measured
Observational data
Observational data
Data collected from natural societal functions, not direct experiments.
Observational study
Observational study
Signup and view all the flashcards
Selection on observables
Selection on observables
Signup and view all the flashcards
Selection on unobservables
Selection on unobservables
Signup and view all the flashcards
Quasi/Natural experiment
Quasi/Natural experiment
Signup and view all the flashcards
Neighborhood effect
Neighborhood effect
Signup and view all the flashcards
Public Housing Demolition Context
Public Housing Demolition Context
Signup and view all the flashcards
Quasi-Experiment Setup
Quasi-Experiment Setup
Signup and view all the flashcards
Key Assumption I (Demolition)
Key Assumption I (Demolition)
Signup and view all the flashcards
Key Assumption II (No Spillover)
Key Assumption II (No Spillover)
Signup and view all the flashcards
Balance Test
Balance Test
Signup and view all the flashcards
Instrumental Variable (IV)
Instrumental Variable (IV)
Signup and view all the flashcards
Intention to Treat Effect (ITT)
Intention to Treat Effect (ITT)
Signup and view all the flashcards
Local Average Treatment Effect (LATE)
Local Average Treatment Effect (LATE)
Signup and view all the flashcards
RDD Underlying Assumption
RDD Underlying Assumption
Signup and view all the flashcards
RDD: Sorting the Running Variable
RDD: Sorting the Running Variable
Signup and view all the flashcards
RDD Falsification Test
RDD Falsification Test
Signup and view all the flashcards
RDD Placebo Test 1
RDD Placebo Test 1
Signup and view all the flashcards
RDD Placebo Test 2
RDD Placebo Test 2
Signup and view all the flashcards
RDD Local Randomization
RDD Local Randomization
Signup and view all the flashcards
RDD Bandwidth Selection
RDD Bandwidth Selection
Signup and view all the flashcards
RDD Bias-Variance Trade-off
RDD Bias-Variance Trade-off
Signup and view all the flashcards
Exclusion Restriction
Exclusion Restriction
Signup and view all the flashcards
Exogeneity of IV
Exogeneity of IV
Signup and view all the flashcards
Relevance Condition
Relevance Condition
Signup and view all the flashcards
Purpose of IV Method
Purpose of IV Method
Signup and view all the flashcards
Reduced Form
Reduced Form
Signup and view all the flashcards
First Stage
First Stage
Signup and view all the flashcards
RF/FS Interpretation
RF/FS Interpretation
Signup and view all the flashcards
Fuzzy Regression Discontinuity (RDD)
Fuzzy Regression Discontinuity (RDD)
Signup and view all the flashcards
Regression Discontinuity Design (RDD)
Regression Discontinuity Design (RDD)
Signup and view all the flashcards
Running Variable (in RDD)
Running Variable (in RDD)
Signup and view all the flashcards
Cutoff (in RDD)
Cutoff (in RDD)
Signup and view all the flashcards
Difference-in-Differences (DiD)
Difference-in-Differences (DiD)
Signup and view all the flashcards
Parallel Trends Assumption (in DiD)
Parallel Trends Assumption (in DiD)
Signup and view all the flashcards
Control Group Outcome (in DiD)
Control Group Outcome (in DiD)
Signup and view all the flashcards
Difference-in-Differences Estimate
Difference-in-Differences Estimate
Signup and view all the flashcards
Treated*after
Treated*after
Signup and view all the flashcards
Alpha (α) in DID
Alpha (α) in DID
Signup and view all the flashcards
γ (gamma) in DID
γ (gamma) in DID
Signup and view all the flashcards
β (beta) in DID
β (beta) in DID
Signup and view all the flashcards
σ (sigma) in DID
σ (sigma) in DID
Signup and view all the flashcards
Parallel Trends Assumption
Parallel Trends Assumption
Signup and view all the flashcards
Parallel Pre-trends
Parallel Pre-trends
Signup and view all the flashcards
Common Shocks
Common Shocks
Signup and view all the flashcards
Study Notes
- T is the independent variable, also known as the "treatment" or variable of interest.
- Y is the dependent variable, also known as the outcome variable.
Limits of Randomized Controlled Trials (RCTs)
- RCTs can be costly and unethical.
- RCTs are not helpful when studying historical questions or understanding market level phenomena.
- Observational data and clever designs enable researchers to study causal questions without needing a specific experiment.
- Observation data is data collected as part of how societies and institutions normally function.
- Observational studies draw inferences from a sample of populations where the independent variable is not controlled by the researcher.
Selection on Observables vs. Unobservables
- Selection on observables means that treatment (T) and control groups (C) differ from each other only with respect to observable characteristics.
- Selection on unobservables means that T and C groups differ from each other in unobservable characteristics.
- This may happen when something unexpected affects some people almost randomly.
- An exogenous variable can induce a variation in treatment, which is an instrumental variable (IV).
- The selection mechanism can be known (Regression Discontinuity Design, RDD).
- Treatment and controls are observed before and after treatment (Difference in Differences, DiD).
Natural/Quasi-Experiments
- Natural/quasi-experiments occur when an unexpected event, like a government policy or natural event, affects some households similarly to an experiment.
- They provide both a treatment and a control group.
- Segregation by income occurs in cities, where rich people live in the city and poor people in the suburbs, stemming from income inequality, neighbourhood quality and optimizing behavior.
- Neighbourhood effects are direct or indirect impacts on socio-economic outcomes based on where you live.
Isolating the effect of treatment
- To isolate the effect of treatment, control for observable differences by comparing people with similar, measurable characteristics.
- Unobservable differences arise when similar families make different residential location choices because some invest more resources in parenting.
Public housing demolition
- Public housing demolition provides low-income households with resources to move to different residential areas.
- Being forced to relocate due to demolition results in receiving housing vouchers.
- Treatment and control occur naturally without planning, which qualifies as a quasi-experiment.
- Compare outcomes of young adults displaced and non-displaced from the same public housing project where T is displaced and C is non-displaced.
Key Assumptions
- Key assumption I: the decision to demolish buildings is unrelated to tenant characteristics and households and children are similar in treatment and control groups.
- If groups are similar in observable characteristics, it's plausible they are similar in unobservable characteristics too.
- Balance tests can assess this.
- Key assumption II: Demolition has no effect on children who were not displaced or there is no treatment effect on the control group.
- Balance test is used to assess if treatment and control groups are comparable across observable characteristics.
- Balance tests are crucial as quasi-experiments rely on non-random assignment methods.
Effects
- Heterogeneous effects describe how the outcome variable differs by subgroup.
Instrumental Variable
- An exogenous variable induces variation in treatment creating an instrumental variable
- Imperfect compliance is when some randomized into treatment do not receive treatment, and some randomized into control receive treatment
Groups
- Always-takers are people who get the treatment even if randomized into the control group.
- Compliers are people whose treatment status is decided by randomization.
- Never-takers are people who will not take the treatment even when randomized into the treatment group.
- Randomization ensures shares of each group is equally large in the treatment and control groups.
- Comparing everyone randomized into treatment to everyone randomized into control group is a valid comparison.
- Intention to treat (ITT) is the impact of being assigned to the treatment group versus being assigned to control, regardless of compliance.
- Local average treatment effect (LATE) estimates treatment effect on compliers in treatment and control groups although we cannot directly observe the compliers.
- The share of compliers can be estimated using the Wald Estimator
Wald estimator
- The Wald estimator formula: BLATE = (E[Y|Z = 1] - E[Y|Z = 0]) / (E[D|Z = 1] - E[D|Z = 0])
- Y is the outcome.
- Z indicates randomization into the treatment group.
- D indicates if treatment was actually received.
- For LATE with IV, ITT is the expected value for the outcome variable for T and C and calculates the ITT.
- Share compliance is the proportion of participants in each group that actually received treatment, it measures compliance rate.
- BLATE = E[Y1 - Yo|complier] is the local average treatment effect, where the treatment impact may differ from the impact on never-takers and always-takers.
Instrument Variables
- Answers the causal question: does T affect Y?
- Instruments are exogenous factors that only affect T and whose effect on Y is to be estimated.
- Instrument relevance condition: the instrument should be correlated with the variable of interest and have causal effects on it.
- Exogeneity means that the instrument is randomly assigned and unrelated to omitted variables.
- Exclusion restriction states the instrument affects outcomes only through the treatment variable.
Using Instrument Variable
- First stage is the relationship between the IV and the explanatory variable where the IV is the winning lottery and the explanatory variable is likely to attend.
- Second stage concerns outcome Y and treatment T which is attending.
- The causal effect of attending school is isolated, by controlling for cofounding factors correlated with attendance and grades.
- Relevance Condition: winning the lottery is tied to likelihood to attend.
- Exogeneity stipulates unrelated connection between winning the lottery not correlated with motivation/grades because the lottery is randomly assigned.
- Exclusion restriction states winning the lottery has no impact on grades other than attending school.
- Starting point: estimate the effect of T on the outcome, however there are factors correlating with treatment status and outcome.
- Possible solutions involve finding exogenous random variation in treatment where IV should not affect outcome directly and cannot be correlated with confounding factors.
Testing
- First stage regression to check IV and the variable of interest (T).
- Exogeneity cannot be fully tested because correlation of IV with unobservables cannot be checked, but correlation between IV and observable confounding factors can be checked.
- Exclusion restriction cannot be tested, but provide arguments in favour of it, and suggest and address exclusion restriction threats.
- The treatment is correlated with unobservables which also affect outcome.
Endogeneity issues
- IV is exogenous variable that only affects Y through treatment.
- Creating exogenous variation in treatment allows in isolate causal effect from treatment.
- There is an estimated treatment affect because treatment status changes.
- Economic shock T, conflict Y, rainfall shock Z.
- Relevancy Condition: rainfall and economic correlated.
- Exogeneity: rainfall and conflict do not have correlating unobserved factors.
- Exclusion Restriction: rainfall and conflict don't affect each other expect by effect of rainfall on economic shock.
Instrumental Variable (IV) requirements
- Potential omitted variable bias (OVB) affecting both T and Y.
- The IV should be correlated with the variable T which affects Y.
- IV should not directly affect Y.
- IV must be randomly assigned.
- IV must be strongly correlated with T and relevance must be tested in the first stage.
- Reduced form equation: effect of Z on Y, and since Z is exogenous then the only factor affecting Y is T.
- First stage: how IV affects treated people. Reduced form/first stage = causal effect in treatment units.
Regression Discontinuity Design (RDD)
- RDD isolates the causal effect of T in situations when individuals become treated after crossing some cutoff.
- Sharp RDD: treatment received one probability above cutoff, and zero below.
- Fuzzy RDD: probability of receiving treatment increases discontinuously at threshold with imperfect compliance.
- Smooth evolution across cutoff assumption.
- Observations should be very similar, to have a valid control group.
- Units cannot be above or below the cutoff.
- Absence of common support cannot see the outcome when units are not cut off.
- Treatment effect is within cutoff- treatment groups.
- There are controls and units.
- A key assumption is that units are comparable except the treatment.
- Conditions should mimic conditions of random experiment with units being equally assigned. Continuity of the cutoff, where outcomes are known.
Running the Variable
- Units should be not manipulate variables, and do not sort themselves depending on the running variable.
- Without manipulation, the # of observations should be = for each group.
- With signs of sorting, it should have jump the treat group, the transition should be smooth for the other.
Test
- Near cutoff treatment is near controls in observable characteristics.
- Placebo: the replacement should be replaced with a fake cutoff if significant treatment, it will show only at constant cutoffs. If not the treatment will equal zero.
- The alternate option is that the outcome should be affected by treatment, so it will show the other side of the cutoff and will not.
RDD Limitations
- It can be randomized near the cutoff. Can segment narrow for the results from it.
- The smaller it is the more data it should to work
- RDD means, that data should be below and above, which means it needs bandwidth data (how far away from the cutoff we can utilize)
- Different Bandwidths can be used.
Sharp and Fuzzy RDD
- Fuzzy RDD is when a treatment is switched off or completely instead of assigning to control when passing the cutoff.
- The cutoff determines a treatment, and use the cutoff to the rule, we can use without RCT.
- If units have similar characteristics then variables cannot be manipulated.
- Need extra lot of data/hard to extrapolate data of effects form cutoff.
Difference-in-Differences (DID)
- Treatment/controls shows before and after treatment.
- DID = two groups with two time periods
- Groups of timer the group is the same for all.
- The control group captures changes
Common effect
- Same trend lines that impact with treatment
- Treated = 1 if treatment, O if not.
- after = observations
parallel
- Follow trends = the outcome after treatment and control should follow the same trend
- Parallel pre-trends in similar manners.
- Check the shocks during the same period and impact on the groups.
- Research information depending on reform.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.