Podcast
Questions and Answers
Consider an econometric model where schooling is suspected to be endogenous. A researcher proposes using a change in student loan policies as an instrument. Under what stringent condition would this instrument be considered valid, assuming it is relevant?
Consider an econometric model where schooling is suspected to be endogenous. A researcher proposes using a change in student loan policies as an instrument. Under what stringent condition would this instrument be considered valid, assuming it is relevant?
- The change in student loan policies must be independent of potential earnings outcomes, conditional on observed covariates, and only affect earnings through its effect on schooling. (correct)
- The change in student loan policies must affect schooling decisions differently across various demographic groups, ensuring heterogeneous treatment effects.
- The change in student loan policies must directly influence the quality of education received, thereby creating a separate pathway to affect earnings, independent of years of schooling.
- The change in student loan policies must be correlated with potential earnings outcomes, such that individuals from lower socioeconomic backgrounds are more likely to benefit.
In the context of Angrist's (1990) study on the effects of Vietnam-era military service on earnings, which of the following scenarios would most severely threaten the validity of draft-eligibility as an instrument for veteran status?
In the context of Angrist's (1990) study on the effects of Vietnam-era military service on earnings, which of the following scenarios would most severely threaten the validity of draft-eligibility as an instrument for veteran status?
- A significant proportion of draft-eligible men pursued further education to avoid service, thereby altering their human capital accumulation.
- Men who were draft-eligible but volunteered for service exhibited systematically different pre-service earnings potential compared to those who were drafted.
- Exemptions from military service due to health reasons were correlated with pre-existing health conditions that independently affected post-service labor market outcomes. (correct)
- The draft lottery numbers were not perfectly random, with some birth dates having a slightly higher probability of selection due to administrative errors.
Suppose a researcher aims to estimate the causal effect of an additional year of schooling on wages using the Wald estimator. The reduced form estimate shows that individuals exposed to a policy change completed 0.2 years more schooling and earned 2% higher wages. However, the first-stage effect (policy change on schooling) is found to be statistically insignificant. Which of the following is the MOST appropriate conclusion?
Suppose a researcher aims to estimate the causal effect of an additional year of schooling on wages using the Wald estimator. The reduced form estimate shows that individuals exposed to a policy change completed 0.2 years more schooling and earned 2% higher wages. However, the first-stage effect (policy change on schooling) is found to be statistically insignificant. Which of the following is the MOST appropriate conclusion?
- The Wald estimator is valid, and the estimated effect should be interpreted with caution due to the small sample size.
- The Wald estimator is valid, but the conclusion suggests that there is no effect of wages on schooling.
- The Wald estimator is invalid, and the instrumental variable approach should not be used.
- The Wald estimator is invalid due to the weak instrument problem, rendering the results unreliable. (correct)
In the context of Angrist and Krueger's (1991) study using quarter of birth as an instrument for education, which unobserved factor would MOST severely bias the Wald estimate of the return to schooling?
In the context of Angrist and Krueger's (1991) study using quarter of birth as an instrument for education, which unobserved factor would MOST severely bias the Wald estimate of the return to schooling?
Consider a scenario where a researcher attempts to use proximity to a new community college as an instrument for college attendance. What specific threat to the exclusion restriction should the researcher be MOST concerned about?
Consider a scenario where a researcher attempts to use proximity to a new community college as an instrument for college attendance. What specific threat to the exclusion restriction should the researcher be MOST concerned about?
In the context of the 'constant effect case' model, if a researcher posits that an individual's wage ($y_i$) is determined by $\alpha$, the return to schooling ($p s_i$), and a composite error term ($\eta_i$), and they further decompose $\eta_i$ into observed abilities ($A_i' \gamma$) and a random error term ($\upsilon_i$), under what specific condition does ordinary least squares (OLS) estimation of the 'short' regression ($y_i = \alpha + p s_i + \eta_i$) yield biased estimates of $p$?
In the context of the 'constant effect case' model, if a researcher posits that an individual's wage ($y_i$) is determined by $\alpha$, the return to schooling ($p s_i$), and a composite error term ($\eta_i$), and they further decompose $\eta_i$ into observed abilities ($A_i' \gamma$) and a random error term ($\upsilon_i$), under what specific condition does ordinary least squares (OLS) estimation of the 'short' regression ($y_i = \alpha + p s_i + \eta_i$) yield biased estimates of $p$?
Given the model $y_i = \alpha + p s_i + A_i' \gamma + \upsilon_i$, where $y_i$ represents wages, $s_i$ represents schooling, $A_i$ represents observed abilities, $\gamma$ represents the effect of abilities, and $\upsilon_i$ is a random error term, what key identifying assumption must hold true to ensure that the population regression of $y_i$ on $s_i$ and $A_i$ yields unbiased estimates of the causal effect of schooling ($p$) on wages?
Given the model $y_i = \alpha + p s_i + A_i' \gamma + \upsilon_i$, where $y_i$ represents wages, $s_i$ represents schooling, $A_i$ represents observed abilities, $\gamma$ represents the effect of abilities, and $\upsilon_i$ is a random error term, what key identifying assumption must hold true to ensure that the population regression of $y_i$ on $s_i$ and $A_i$ yields unbiased estimates of the causal effect of schooling ($p$) on wages?
In the context of instrumental variables (IV) estimation, which of the following scenarios would unequivocally violate the exclusion restriction, rendering the instrument invalid for consistently estimating the causal effect of schooling ($s_i$) on wages ($y_i$)?
In the context of instrumental variables (IV) estimation, which of the following scenarios would unequivocally violate the exclusion restriction, rendering the instrument invalid for consistently estimating the causal effect of schooling ($s_i$) on wages ($y_i$)?
Consider a scenario where an instrumental variable $z_i$ is proposed to address the endogeneity of schooling ($s_i$) in a wage regression. While $z_i$ exhibits a statistically significant correlation with $s_i$, a formal overidentification test using multiple instruments fails to reject the null hypothesis that all instruments are valid. Which of the following inferences is most warranted?
Consider a scenario where an instrumental variable $z_i$ is proposed to address the endogeneity of schooling ($s_i$) in a wage regression. While $z_i$ exhibits a statistically significant correlation with $s_i$, a formal overidentification test using multiple instruments fails to reject the null hypothesis that all instruments are valid. Which of the following inferences is most warranted?
In an instrumental variables (IV) framework designed to estimate the causal effect of schooling ($s_i$) on wages ($y_i$), where the error term ($\eta_i$) is potentially correlated with schooling, what specific assumption regarding the instrument ($z_i$) ensures that it breaks the endogenous variation in $s_i$ into components that are both correlated and uncorrelated with $\eta_i$ , allowing for consistent estimation of the causal effect?
In an instrumental variables (IV) framework designed to estimate the causal effect of schooling ($s_i$) on wages ($y_i$), where the error term ($\eta_i$) is potentially correlated with schooling, what specific assumption regarding the instrument ($z_i$) ensures that it breaks the endogenous variation in $s_i$ into components that are both correlated and uncorrelated with $\eta_i$ , allowing for consistent estimation of the causal effect?
Suppose you are employing an instrumental variable ($z_i$) strategy to estimate the causal effect of years of schooling ($s_i$) on individual earnings ($y_i$). You have reason to suspect that even though $z_i$ is ostensibly exogenous, its effect on earnings may be heterogeneous across different subgroups of the population. Which econometric technique would be most appropriate to assess the potential bias arising from this heterogeneous treatment effect and obtain a more precise estimate of the average treatment effect?
Suppose you are employing an instrumental variable ($z_i$) strategy to estimate the causal effect of years of schooling ($s_i$) on individual earnings ($y_i$). You have reason to suspect that even though $z_i$ is ostensibly exogenous, its effect on earnings may be heterogeneous across different subgroups of the population. Which econometric technique would be most appropriate to assess the potential bias arising from this heterogeneous treatment effect and obtain a more precise estimate of the average treatment effect?
Consider a scenario where researchers are using proximity to a college ($z_i$) as an instrument for educational attainment ($s_i$) in a wage regression. However, they discover that individuals who grow up near colleges also tend to have better access to healthcare and other resources that directly impact their future earnings, irrespective of their educational choices. This situation exemplifies which of the following threats to the validity of the instrumental variable?
Consider a scenario where researchers are using proximity to a college ($z_i$) as an instrument for educational attainment ($s_i$) in a wage regression. However, they discover that individuals who grow up near colleges also tend to have better access to healthcare and other resources that directly impact their future earnings, irrespective of their educational choices. This situation exemplifies which of the following threats to the validity of the instrumental variable?
In the context of instrumental variables (IV) analysis, what fundamental threat does the exclusion restriction aim to mitigate, and why is its untestability a critical consideration for econometricians?
In the context of instrumental variables (IV) analysis, what fundamental threat does the exclusion restriction aim to mitigate, and why is its untestability a critical consideration for econometricians?
Consider an instrumental variable $z_i$ that purportedly affects a causal variable of interest $s_i$. What specific condition must be met to ensure the validity of $z_i$ as an instrument, and why is directly testing this condition inherently problematic?
Consider an instrumental variable $z_i$ that purportedly affects a causal variable of interest $s_i$. What specific condition must be met to ensure the validity of $z_i$ as an instrument, and why is directly testing this condition inherently problematic?
In an instrumental variable framework, why is the first-stage requirement—that the instrument must have a demonstrable effect on the causal variable—considered essential, and how does a failure in this stage undermine the entire analysis?
In an instrumental variable framework, why is the first-stage requirement—that the instrument must have a demonstrable effect on the causal variable—considered essential, and how does a failure in this stage undermine the entire analysis?
Consider the scenario where an instrument $z_i$ (e.g., quarter of birth) is proposed to address endogeneity in a causal variable $s_i$ (e.g., years of schooling) when estimating its impact on an outcome $y_i$ (e.g., earnings). If $z_i$ is found to directly influence unobserved determinants of $y_i$ or ability, what specific IV assumption is violated, and what is the implication for the validity of the IV estimates?
Consider the scenario where an instrument $z_i$ (e.g., quarter of birth) is proposed to address endogeneity in a causal variable $s_i$ (e.g., years of schooling) when estimating its impact on an outcome $y_i$ (e.g., earnings). If $z_i$ is found to directly influence unobserved determinants of $y_i$ or ability, what specific IV assumption is violated, and what is the implication for the validity of the IV estimates?
Random assignment in experiments is often likened to the exclusion restriction in instrumental variables. In what critical sense does a valid instrument, satisfying the exclusion restriction, emulate random assignment, and why is this equivalence fundamental for causal inference?
Random assignment in experiments is often likened to the exclusion restriction in instrumental variables. In what critical sense does a valid instrument, satisfying the exclusion restriction, emulate random assignment, and why is this equivalence fundamental for causal inference?
Suppose you are employing an instrumental variable $z_i$ to estimate the causal effect of $s_i$ on $y_i$. How does the strength of the correlation between $z_i$ and $s_i$ (the first stage) specifically affect the properties of the resulting instrumental variable estimator, particularly in terms of bias and efficiency?
Suppose you are employing an instrumental variable $z_i$ to estimate the causal effect of $s_i$ on $y_i$. How does the strength of the correlation between $z_i$ and $s_i$ (the first stage) specifically affect the properties of the resulting instrumental variable estimator, particularly in terms of bias and efficiency?
In the framework of instrumental variables, the exclusion restriction is pivotal. Enumerate the potential consequences of violating this restriction, elucidating how such a violation undermines the integrity and interpretability of the causal inferences drawn from the analysis.
In the framework of instrumental variables, the exclusion restriction is pivotal. Enumerate the potential consequences of violating this restriction, elucidating how such a violation undermines the integrity and interpretability of the causal inferences drawn from the analysis.
Consider the application of quarter of birth as an instrument for educational attainment. What are the primary, theoretically-grounded concerns regarding the validity of this instrument, especially concerning potential violations of the exclusion restriction in contemporary, developed economies?
Consider the application of quarter of birth as an instrument for educational attainment. What are the primary, theoretically-grounded concerns regarding the validity of this instrument, especially concerning potential violations of the exclusion restriction in contemporary, developed economies?
In the context of the exclusion restriction in instrumental variables, why is it inherently impossible to empirically 'test' the validity of this assumption, and what implications does this untestability have for the interpretation of IV estimates?
In the context of the exclusion restriction in instrumental variables, why is it inherently impossible to empirically 'test' the validity of this assumption, and what implications does this untestability have for the interpretation of IV estimates?
Critically evaluate the assertion that a valid instrument provides variation in a causal variable 'as good as random.' In what precise sense does this hold, and what are the key limitations or caveats to this analogy when dealing with real-world instrumental variable applications?
Critically evaluate the assertion that a valid instrument provides variation in a causal variable 'as good as random.' In what precise sense does this hold, and what are the key limitations or caveats to this analogy when dealing with real-world instrumental variable applications?
In the context of Instrumental Variables (IV) estimation, if the exclusion restriction is demonstrably violated, what inferential challenge is most likely to arise, assuming all other IV assumptions hold?
In the context of Instrumental Variables (IV) estimation, if the exclusion restriction is demonstrably violated, what inferential challenge is most likely to arise, assuming all other IV assumptions hold?
Consider an IV model analyzing the effect of educational attainment ($s_i$) on earnings ($y_i$), where $z_i$ represents proximity to a college. What specific threat to validity is most directly addressed by including a rich set of geographic and socioeconomic controls ($X_i'$) in both the first-stage and reduced-form equations?
Consider an IV model analyzing the effect of educational attainment ($s_i$) on earnings ($y_i$), where $z_i$ represents proximity to a college. What specific threat to validity is most directly addressed by including a rich set of geographic and socioeconomic controls ($X_i'$) in both the first-stage and reduced-form equations?
In the context of the Wald estimator, assume you are analyzing the impact of military service ($s_i$), a binary variable, on future civilian earnings ($y_i$). You use draft lottery number ($z_i$) as an instrument, also a binary variable. If $E[y_i | z_i = 1] = $50,000$, $E[y_i | z_i = 0] = $40,000$, $p = P(z_i = 1) = 0.3$, and $Cov(s_i, z_i) = 0.0525$, what is the Wald estimate of the effect of military service on earnings?
In the context of the Wald estimator, assume you are analyzing the impact of military service ($s_i$), a binary variable, on future civilian earnings ($y_i$). You use draft lottery number ($z_i$) as an instrument, also a binary variable. If $E[y_i | z_i = 1] = $50,000$, $E[y_i | z_i = 0] = $40,000$, $p = P(z_i = 1) = 0.3$, and $Cov(s_i, z_i) = 0.0525$, what is the Wald estimate of the effect of military service on earnings?
Consider an instrumental variable (IV) regression where both the endogenous variable ($s_i$) and the instrument ($z_i$) are binary. Which of the following statements accurately describes the conditions under which the IV estimator will produce a consistent estimate of the local average treatment effect (LATE)?
Consider an instrumental variable (IV) regression where both the endogenous variable ($s_i$) and the instrument ($z_i$) are binary. Which of the following statements accurately describes the conditions under which the IV estimator will produce a consistent estimate of the local average treatment effect (LATE)?
Suppose an econometrician uses quarter of birth ($z_i$) as an instrument for years of schooling ($s_i$) in a regression examining the effect of education on earnings ($y_i$). Assuming a linear model as described, what specific concern arises if individuals strategically choose their birth quarter to maximize their schooling opportunities?
Suppose an econometrician uses quarter of birth ($z_i$) as an instrument for years of schooling ($s_i$) in a regression examining the effect of education on earnings ($y_i$). Assuming a linear model as described, what specific concern arises if individuals strategically choose their birth quarter to maximize their schooling opportunities?
In IV estimation, what is the consequence of using an instrument ($z_i$) that is only weakly correlated with the endogenous variable ($s_i$) in the first-stage equation, even if the instrument satisfies the exclusion restriction?
In IV estimation, what is the consequence of using an instrument ($z_i$) that is only weakly correlated with the endogenous variable ($s_i$) in the first-stage equation, even if the instrument satisfies the exclusion restriction?
Consider a scenario where $s_i$ represents years of schooling and $y_i$ represents log earnings. Using the described IV framework, which of the following best describes limitations of interpreting $$ as the causal effect of an additional year of schooling on log earnings for all individuals?
Consider a scenario where $s_i$ represents years of schooling and $y_i$ represents log earnings. Using the described IV framework, which of the following best describes limitations of interpreting $$ as the causal effect of an additional year of schooling on log earnings for all individuals?
In the two-equation IV system provided, how does the inclusion of the covariates $X_i'$ affect the interpretation of the coefficients ${11}$ and ${21}$?
In the two-equation IV system provided, how does the inclusion of the covariates $X_i'$ affect the interpretation of the coefficients ${11}$ and ${21}$?
In a scenario where $z_i$ (the instrument) is a valid instrument, $s_i$ is the endogenous variable, and $y_i$ is the outcome, what is the most accurate interpretation of $Cov(y_i, z_i)$?
In a scenario where $z_i$ (the instrument) is a valid instrument, $s_i$ is the endogenous variable, and $y_i$ is the outcome, what is the most accurate interpretation of $Cov(y_i, z_i)$?
Assuming that $z_i$ is a binary instrument. Under what condition does $Cov(y_i, z_i) = {E[y_i | z_i = 1] - E[y_i | z_i = 0]}p(1 - p)$ accurately compute covariance even if the sample data exhibits substantial heteroskedasticity?
Assuming that $z_i$ is a binary instrument. Under what condition does $Cov(y_i, z_i) = {E[y_i | z_i = 1] - E[y_i | z_i = 0]}p(1 - p)$ accurately compute covariance even if the sample data exhibits substantial heteroskedasticity?
In the context of Angrist and Krueger's (1991) study on the causal effect of education on earnings, how does the quarter of birth instrument leverage institutional knowledge and compulsory schooling laws to address omitted variable bias, specifically in relation to unobserved ability?
In the context of Angrist and Krueger's (1991) study on the causal effect of education on earnings, how does the quarter of birth instrument leverage institutional knowledge and compulsory schooling laws to address omitted variable bias, specifically in relation to unobserved ability?
Within the framework of instrumental variables (IV) estimation, if an instrument $z_i$ is correlated with the error term $\eta_i$, what is the most likely consequence for the estimation of the parameter $\pi_1$ representing the causal effect of schooling ($s$) on earnings?
Within the framework of instrumental variables (IV) estimation, if an instrument $z_i$ is correlated with the error term $\eta_i$, what is the most likely consequence for the estimation of the parameter $\pi_1$ representing the causal effect of schooling ($s$) on earnings?
In the context of instrumental variables, what critical assumption must hold true for an instrument $z_i$ to validly isolate the 'unproblematic' variation in a variable $s$ (schooling) that is uncorrelated with the error term $\eta_i$ (unobserved factors), and how is this assumption typically assessed in empirical research?
In the context of instrumental variables, what critical assumption must hold true for an instrument $z_i$ to validly isolate the 'unproblematic' variation in a variable $s$ (schooling) that is uncorrelated with the error term $\eta_i$ (unobserved factors), and how is this assumption typically assessed in empirical research?
Considering Angrist and Krueger's study utilizing quarter of birth as an instrument, what potential threat to the validity of their instrument could arise if individuals born in different quarters of the year systematically differ in unobservable characteristics that also affect earnings, and how might this compromise the identification of the causal effect of education?
Considering Angrist and Krueger's study utilizing quarter of birth as an instrument, what potential threat to the validity of their instrument could arise if individuals born in different quarters of the year systematically differ in unobservable characteristics that also affect earnings, and how might this compromise the identification of the causal effect of education?
Suppose a researcher aims to replicate Angrist and Krueger's (1991) study in a different cultural context where seasonal employment patterns significantly influence family income. How might these seasonal employment patterns affect the validity of quarter of birth as an instrument for educational attainment, and what specific econometric strategies could be employed to address this potential confounding?
Suppose a researcher aims to replicate Angrist and Krueger's (1991) study in a different cultural context where seasonal employment patterns significantly influence family income. How might these seasonal employment patterns affect the validity of quarter of birth as an instrument for educational attainment, and what specific econometric strategies could be employed to address this potential confounding?
Consider a scenario where the compulsory schooling laws exhibit significant heterogeneity across different states within the US during the period studied by Angrist and Krueger (1991). How would this heterogeneity impact the interpretation of their findings, and what methodological adjustments would be necessary to account for this variation in compulsory schooling requirements?
Consider a scenario where the compulsory schooling laws exhibit significant heterogeneity across different states within the US during the period studied by Angrist and Krueger (1991). How would this heterogeneity impact the interpretation of their findings, and what methodological adjustments would be necessary to account for this variation in compulsory schooling requirements?
Suppose that advancements in fertility treatments have led to a higher incidence of multiple births (twins, triplets, etc.). How might the presence of multiple births within a family introduce a potential bias in Angrist and Krueger's instrumental variable approach, and what analytic strategies could be implemented to address this bias?
Suppose that advancements in fertility treatments have led to a higher incidence of multiple births (twins, triplets, etc.). How might the presence of multiple births within a family introduce a potential bias in Angrist and Krueger's instrumental variable approach, and what analytic strategies could be implemented to address this bias?
In the context of instrumental variables estimation, define and differentiate between the concepts of 'relevance' and 'exclusion restriction' concerning the validity of an instrument $z_i$ and explain how a failure of either condition would compromise the identification of a causal effect.
In the context of instrumental variables estimation, define and differentiate between the concepts of 'relevance' and 'exclusion restriction' concerning the validity of an instrument $z_i$ and explain how a failure of either condition would compromise the identification of a causal effect.
In the context of Angrist and Krueger’s study, consider the potential for differential migration patterns based on quarter of birth. If individuals born in certain quarters are more likely to migrate to regions with different labor market conditions and educational opportunities, how could this migration bias affect the validity of their instrumental variable estimates, and what statistical techniques could be employed to mitigate this potential bias?
In the context of Angrist and Krueger’s study, consider the potential for differential migration patterns based on quarter of birth. If individuals born in certain quarters are more likely to migrate to regions with different labor market conditions and educational opportunities, how could this migration bias affect the validity of their instrumental variable estimates, and what statistical techniques could be employed to mitigate this potential bias?
Flashcards
Ability (𝐴𝑖)
Ability (𝐴𝑖)
A vector of variables representing an individual's skills and knowledge.
Random term (𝜐𝑖)
Random term (𝜐𝑖)
The portion of potential outcomes not explained by ability (𝐴𝑖).
Omitted Variable Bias
Omitted Variable Bias
An unobserved factor that correlates with the independent variable (𝑠𝑖) and the dependent variable (𝑦𝑖), leading to biased estimates.
Instrumental Variable (𝑧𝑖)
Instrumental Variable (𝑧𝑖)
Signup and view all the flashcards
IV Exogeneity Condition
IV Exogeneity Condition
Signup and view all the flashcards
Intuition Behind IV
Intuition Behind IV
Signup and view all the flashcards
IV Decomposition
IV Decomposition
Signup and view all the flashcards
Problematic Part of a Variable
Problematic Part of a Variable
Signup and view all the flashcards
Unproblematic Part of a Variable
Unproblematic Part of a Variable
Signup and view all the flashcards
Instrument (in IV)
Instrument (in IV)
Signup and view all the flashcards
Angrist & Krueger (1991)
Angrist & Krueger (1991)
Signup and view all the flashcards
School-Starting Age Policies (US)
School-Starting Age Policies (US)
Signup and view all the flashcards
Compulsory Schooling Laws (US)
Compulsory Schooling Laws (US)
Signup and view all the flashcards
Natural Experiment (Birth Quarter)
Natural Experiment (Birth Quarter)
Signup and view all the flashcards
Birth Quarter & Schooling
Birth Quarter & Schooling
Signup and view all the flashcards
Quarter-of-Birth Instrument
Quarter-of-Birth Instrument
Signup and view all the flashcards
Strong First-Stage
Strong First-Stage
Signup and view all the flashcards
Exclusion Restriction
Exclusion Restriction
Signup and view all the flashcards
Exclusion Restriction Implication
Exclusion Restriction Implication
Signup and view all the flashcards
Exclusion Restriction: Covariance Form
Exclusion Restriction: Covariance Form
Signup and view all the flashcards
IV as 'Good as Random'
IV as 'Good as Random'
Signup and view all the flashcards
Exclusion Restriction Independence
Exclusion Restriction Independence
Signup and view all the flashcards
IV Relevance
IV Relevance
Signup and view all the flashcards
η𝑖
η𝑖
Signup and view all the flashcards
Assumption for Valid Instrument
Assumption for Valid Instrument
Signup and view all the flashcards
Wald Estimator
Wald Estimator
Signup and view all the flashcards
Reduced Form
Reduced Form
Signup and view all the flashcards
First Stage
First Stage
Signup and view all the flashcards
Angrist and Krueger (1991)
Angrist and Krueger (1991)
Signup and view all the flashcards
Angrist (1990)
Angrist (1990)
Signup and view all the flashcards
First-Stage Equation
First-Stage Equation
Signup and view all the flashcards
Reduced Form Equation
Reduced Form Equation
Signup and view all the flashcards
Exclusion Restriction (in IV)
Exclusion Restriction (in IV)
Signup and view all the flashcards
IV estimate of causal variable
IV estimate of causal variable
Signup and view all the flashcards
IV-Estimate Formula
IV-Estimate Formula
Signup and view all the flashcards
IV-Estimate as Regression Ratio
IV-Estimate as Regression Ratio
Signup and view all the flashcards
Cov(𝑦𝑖 , 𝑧𝑖 ) Formula with Binary 𝑧𝑖
Cov(𝑦𝑖 , 𝑧𝑖 ) Formula with Binary 𝑧𝑖
Signup and view all the flashcards
Exogeneity Condition
Exogeneity Condition
Signup and view all the flashcards
Study Notes
- Instrumental variables (IV) are utilized when the zero conditional mean assumption in Ordinary Least Squares (OLS) regressions is unlikely to hold.
- IV serves as a quasi-experimental method, mimicking a real experiment to address issues when conducting a real experiment is not feasible.
Roadmap of IV:
- Circumstances under which an instrument is useful are explored
- Examination of IV under the constant effects.
- Identification of necessary assumptions.
- Introduction to the Wald estimator.
- Discussion on Two-Stage Least Squares (TSLS).
- Coverage of IV tests.
- Investigation of IV under heterogeneous effects assumptions.
Where an Instrument Can Help:
- Instrumental variables can address threats to internal validity such as:
- Omitted variables
- Measurement errors in X
- Simultaneity bias
- Bias arises in all of these cases because E[u|X] ≠ 0
Endogeneity:
- Used to characterize threats to internal validity when E[u|X] ≠ 0.
- When a variable X is endogenous, OLS measures correlation rather than a causal effect.
- Exogeneity occurs when variable X is uncorrelated with u.
IV: Two Cases:
- IV studied under two scenarios:
- One is where the effect is the same for everyone called the constant effect case.
- The other is where the effect differs across people called the heterogeneous effect case.
IV: Constant Effect Case
- With constant effect example shown through causal link of schooling and wages:
- Where potential outcomes equal ysi = fi(s).
- ysi denotes potential schooling outcomes where:
- equation is fi(s) = πο + π₁ς + ηi.
- ηi denotes unobserved factors affecting wages.
- ηi can be function of vector, Ai, called ability:
- ηi = A´¡Y + Vi
- y denotes vector of population regression coefficients
- vᵢ is random term that are uncorrelated by construction
Constant Effect
- Model can be written as yᵢ = α + ρsᵢ + ηi, where the Aᵢ variables, contained in ηi, correlate with sᵢ.
- Provided the Aᵢ correlate with sᵢ this implies an omitted variable issue
- The Aᵢ are the only reason sᵢ and ηi are correlated.
- Equation E[sᵢvᵢ]=0
Instrumental Variable:
- Need to estimate regression coefficient p, when Aᵢ is unobserved.
- Instrumental variables methods can estimate p, when there exists a variable which is:
- Correlated with causal variable of interest sᵢ.
- Uncorrelated with any other determinants of the dependent variable yᵢ.
- This variable is an instrument denoted zᵢ.
- Term "uncorrelated with any other determinants of dependent variable" implies:
- Cov (ηi, zi) = 0, or zᵢ is uncorrelated with Aᵢ and vᵢ.
- Variable zᵢ is uncorrelated with abilities Aᵢ and unobserved determinants of wage.
- Variable zᵢ should be correlated with schooling sᵢ.
The Intuition Behind IV:
- With variable zᵢ, ysi = πο + π₁ς + ηi where E[ηi|s] ≠ 0.
- IV splits in s into two parts:
- One that correlates with ηi.
- One that doesn't correlate with ηi.
- Part correlated with ηi is the "problematic" part
- Part correlated with ηi is the "unproblematic" part
- With an instrument zᵢ to isolate or predict variation in part of s that is uncorrelated with ηi:
- Allowing a consistent π₁ estimate
- Instrument helps distinguish between uncorrelated part from correlated part in s.
- The instrument detects movements in s uncorrelated with ηi and uses these to estimate π₁.
- Variable zi is used to only use the variation schooling s unrelated abilities Aᵢ.
Example of Quarter of Birth Instrument:
- Angrist and Kreuger wanted to study the causal effect of education on earnings
- Institutional knowledge about school-starting age policies and compulsory schooling laws in the US generated instrument for schooling.
- US states generally require 6 year olds to start school in the calendar year (starting in September).
- All children born in same year start school in September of the same year.
- Compulsory schooling laws require student to remain in schools until 16th birthday.
- Means students in different quarters will complete different degrees by the legal dropout age.
- Angrist and Kreuger's analysis creates a "natural experiment" where those with different birthdays stay in school different amounts if time.
Angrist and Kreuger Quarter of Birth
- Example: a person born in January can drop out of school approximately 3 months earlier that someone born in April.
- This may result in less schooling for those born earlier in year.
- Angrist and Kreuger's instrument, quarter of birth, affects years of schooling.
- Quarter-of-birth instrument distinguishes between a "problematic" and "unproblematic" variation in schooling sᵢ.
- Therefore the variation caused by quarter of birth is unrelated to individual ability.
- It is a bad instrument if quarter birth affecting ability and unobserved earnings determinants.
Assumptions Needed for IV
- Instrument zᵢ has a clear affect on the causal variable of interest, schooling. which is "first-stage".
- No effect from the instrument means no information about the causal variable of interest.
- Can be easily checked with that data
- Example: birth quarter must significantly affect years of schooling.
- Second assumption is that the only relationships between outcome variable, yᵢ, and instrument, zᵢ is the first stage.
- Then zᵢ can only affect yᵢ (earnings) via its effect, sᵢ (schooling).
- Such that an exclusion restriction can be written as Cov(zᵢ, ηi) = 0.
- Exclusion restrictions cannot be tested, identifying characteristics, not just beliefs
IV Exclusion Restriction
- Exclusion Restriction is called so as the instrumental zᵢ assumed has no correlation with unobserved outcome determinants yᵢ.
- So its excluded from the main outcome equation Excluding it should not cause omitted variable bias.
- Example the quarter of birth can be excluded as its uncorrelated
The Exclusion Restriction
- The exclusion restriction of IV implies causal variable of interest is as good as random.
- Exclusion restriction says instrument for potential outcomes which exclude causal outcomes
- Conditional on several covariates a birth is good as
Angrist and Kreuger Formally
- Following equation are formalized taking the birth story
- sᵢ = X'ᵢπ₁₀ + π₁₁zᵢ + ζ₁ᵢ
- yᵢ = X'ᵢπ₂₀ + π₂₁zᵢ + ζ₂ᵢ
- Equation one is know as the first stage
- the schooling sᵢ on the instrument zᵢ and other controls X'ᵢ.
- The second is known as the reduced form
- the direct effects
- If exclusion restrictions hold the reduce form shows the direct effects
- only affecting schooling
- Shows the effects on instrument
- The by community can work causal increase of schooling by a year
The Constant Effect Case
- Given the exclusion, an estimate can be given:
- ρ = Cov(yi, zi)/Cov(si, zi) = Cov(yi, zi)/V(zi) / Cov(si, zi)/V(zi)
- Good form as it can be expressed in the ratio
- Can be population regression yᵢ on zᵢ to the first stage equation
The Wald Estimator
- A IV estimator is the most popular estimate is a single binary number
- A dummy that equals with a set probably can be found
- Cov(yi, zi)={E [yizi = 1] - E [yi]zi = 0]} p(1-p)
- An similar for formula known as Cov (Si, Zi). The IV estimator is ρ = E [yilzi=1]-E [yizi=0] divided by E [SiZi=1]-E [SiZi=0]
Wald Estimator Example
- Angrist and Krueger quarter of birth study using.
- Reduced for the difference is earn between the two points
- Between with values standard error
- The values that of the men are the years
- Divided and get Wald economics
Wald Estimator 2
- The Angrist (1990) looks at Vietnam military service on earnings for veterans
- Young men in 1960/70s are at risk to draft service a lottery
- Instrumental variable: draft-eligibility. Draft eligibly is determined by lottery
- Many got exemption due to health/reasons some volunteer
Good Instruments
- Originates from combination how interest gets generated
- Later can be obtains by costs and benefits that decides
- Deciding to decide by the cost
- Then the cost get shift between roles and subsidies
- Then change the policy for the under manner
Terminology
- Dependants in two are called
- Variables can can called
- They’re a sub set
- They are not are covariates.
Two-Stage Least Squares (2SLS)
- More Convenient to use then IV
- The except variables in first stage
- When Overidentified with on instrument
- Many can test as well if the validity is in the test
2SLS continued
- The structural model is Yi = X´¡a + psi + Ni with Cov (Si, ni) =/= 0. the first-stage equation can be Si = Χ'ιπ1ο + Π11Ζί + ζί
Two-Stage Least Squares (2SLS) cont
- With the equation one Yi = X´¡α + ρ[Χ΄; Πιο + ΠιιZi + 1i] + Ni = Χ´¡[α + ρπ10] + ρπ11ζί + [pζ₁i + ni] = Χ'ιπ2ο + Π21Ζί + 2ί where π2ο = α + ρπ10, π21 = ρπ11 and 32i = P1i + Ni
Two-Stage Least Squares (2SLS) cont
- With a such re-aranged
- Yi = Χ´¡α + ρ[Χ΄; Πιο + Π11Zi] + 2i where X'; π1ο + π11Zi is the population fitted (predicted) value from the first-stage regression of si on Xi and zi. Can create that not for them which which makes is easy
How can can consistently with the with effects
- S₁ = [Χ΄¡10; + 11Ζί]
- Yi = X´¡a + psi + [ni + p(si - Si)], where si - Si = 1i. So both can be good or bad The estimates easy
We would then estimate with error to standard
Yi = X´¡a + psi + vi, where vi = [ni + p(si - Si)],
Deriving the equations
- If singe as x can has a is formula easier
- Can have a with let Ŝxy the a with formula with written as B²LS₁ = Sxzy/Sx²
Then with the 22SLS form
- With a so as re the for by is instrumented Can write for by LSS be a replace so B²LS₁ = Sxzy/Sx²
To carry on on
With a easy to define which simple to get With can so why start?
Now is is easy to right know so
Now know or for good and bad effects and With or out can know where all.
With 2SLS continue estimates
Can now now can with all
Some examples of instruments
- The effects off labour and sizes Do women? Is for what is.
- The quantity vs effects Choice is one to can the effects
Common
- Most positive with Most on and off with
Now has and know is with
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.