Lecture 4

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Consider an econometric model where schooling is suspected to be endogenous. A researcher proposes using a change in student loan policies as an instrument. Under what stringent condition would this instrument be considered valid, assuming it is relevant?

  • The change in student loan policies must be independent of potential earnings outcomes, conditional on observed covariates, and only affect earnings through its effect on schooling. (correct)
  • The change in student loan policies must affect schooling decisions differently across various demographic groups, ensuring heterogeneous treatment effects.
  • The change in student loan policies must directly influence the quality of education received, thereby creating a separate pathway to affect earnings, independent of years of schooling.
  • The change in student loan policies must be correlated with potential earnings outcomes, such that individuals from lower socioeconomic backgrounds are more likely to benefit.

In the context of Angrist's (1990) study on the effects of Vietnam-era military service on earnings, which of the following scenarios would most severely threaten the validity of draft-eligibility as an instrument for veteran status?

  • A significant proportion of draft-eligible men pursued further education to avoid service, thereby altering their human capital accumulation.
  • Men who were draft-eligible but volunteered for service exhibited systematically different pre-service earnings potential compared to those who were drafted.
  • Exemptions from military service due to health reasons were correlated with pre-existing health conditions that independently affected post-service labor market outcomes. (correct)
  • The draft lottery numbers were not perfectly random, with some birth dates having a slightly higher probability of selection due to administrative errors.

Suppose a researcher aims to estimate the causal effect of an additional year of schooling on wages using the Wald estimator. The reduced form estimate shows that individuals exposed to a policy change completed 0.2 years more schooling and earned 2% higher wages. However, the first-stage effect (policy change on schooling) is found to be statistically insignificant. Which of the following is the MOST appropriate conclusion?

  • The Wald estimator is valid, and the estimated effect should be interpreted with caution due to the small sample size.
  • The Wald estimator is valid, but the conclusion suggests that there is no effect of wages on schooling.
  • The Wald estimator is invalid, and the instrumental variable approach should not be used.
  • The Wald estimator is invalid due to the weak instrument problem, rendering the results unreliable. (correct)

In the context of Angrist and Krueger's (1991) study using quarter of birth as an instrument for education, which unobserved factor would MOST severely bias the Wald estimate of the return to schooling?

<p>Individuals born in different quarters exhibit inherent differences in cognitive abilities that are correlated with both educational attainment and earnings. (A)</p> Signup and view all the answers

Consider a scenario where a researcher attempts to use proximity to a new community college as an instrument for college attendance. What specific threat to the exclusion restriction should the researcher be MOST concerned about?

<p>Proximity to a community college may correlate with local economic development initiatives that independently boost earnings, irrespective of college attendance. (D)</p> Signup and view all the answers

In the context of the 'constant effect case' model, if a researcher posits that an individual's wage ($y_i$) is determined by $\alpha$, the return to schooling ($p s_i$), and a composite error term ($\eta_i$), and they further decompose $\eta_i$ into observed abilities ($A_i' \gamma$) and a random error term ($\upsilon_i$), under what specific condition does ordinary least squares (OLS) estimation of the 'short' regression ($y_i = \alpha + p s_i + \eta_i$) yield biased estimates of $p$?

<p>When the covariance between $s_i$ and $A_i$ is non-zero, indicating a correlation between schooling and the observed ability, leading to omitted variable bias. (B)</p> Signup and view all the answers

Given the model $y_i = \alpha + p s_i + A_i' \gamma + \upsilon_i$, where $y_i$ represents wages, $s_i$ represents schooling, $A_i$ represents observed abilities, $\gamma$ represents the effect of abilities, and $\upsilon_i$ is a random error term, what key identifying assumption must hold true to ensure that the population regression of $y_i$ on $s_i$ and $A_i$ yields unbiased estimates of the causal effect of schooling ($p$) on wages?

<p>That $s_i$ is randomly assigned, conditional on $A_i$, meaning that $E[\upsilon_i | s_i, A_i] = 0$, thereby precluding any systematic relationship between unobserved factors and schooling. (A)</p> Signup and view all the answers

In the context of instrumental variables (IV) estimation, which of the following scenarios would unequivocally violate the exclusion restriction, rendering the instrument invalid for consistently estimating the causal effect of schooling ($s_i$) on wages ($y_i$)?

<p>The instrument ($z_i$) is correlated with an individual's innate cognitive abilities, which also directly impact wages, even after controlling for years of schooling and observed skills. (B)</p> Signup and view all the answers

Consider a scenario where an instrumental variable $z_i$ is proposed to address the endogeneity of schooling ($s_i$) in a wage regression. While $z_i$ exhibits a statistically significant correlation with $s_i$, a formal overidentification test using multiple instruments fails to reject the null hypothesis that all instruments are valid. Which of the following inferences is most warranted?

<p>The instrumental variable $z_i$ validity remains uncertain, as overidentification tests possess limited power to detect subtle violations of the exclusion restriction, particularly in the presence of weak instruments or small sample sizes. (D)</p> Signup and view all the answers

In an instrumental variables (IV) framework designed to estimate the causal effect of schooling ($s_i$) on wages ($y_i$), where the error term ($\eta_i$) is potentially correlated with schooling, what specific assumption regarding the instrument ($z_i$) ensures that it breaks the endogenous variation in $s_i$ into components that are both correlated and uncorrelated with $\eta_i$ , allowing for consistent estimation of the causal effect?

<p>The instrument $z_i$ must be correlated with schooling ($s_i$) and uncorrelated with the error term $\eta_i$, thereby isolating the exogenous component of schooling that is unrelated to unobserved determinants of wages. (C)</p> Signup and view all the answers

Suppose you are employing an instrumental variable ($z_i$) strategy to estimate the causal effect of years of schooling ($s_i$) on individual earnings ($y_i$). You have reason to suspect that even though $z_i$ is ostensibly exogenous, its effect on earnings may be heterogeneous across different subgroups of the population. Which econometric technique would be most appropriate to assess the potential bias arising from this heterogeneous treatment effect and obtain a more precise estimate of the average treatment effect?

<p>Local Average Treatment Effect (LATE) estimation, focusing on the subpopulation whose schooling decisions are actually influenced by the instrument. (B)</p> Signup and view all the answers

Consider a scenario where researchers are using proximity to a college ($z_i$) as an instrument for educational attainment ($s_i$) in a wage regression. However, they discover that individuals who grow up near colleges also tend to have better access to healthcare and other resources that directly impact their future earnings, irrespective of their educational choices. This situation exemplifies which of the following threats to the validity of the instrumental variable?

<p>Violation of the exclusion restriction, because proximity to college affects earnings through channels other than educational attainment. (D)</p> Signup and view all the answers

In the context of instrumental variables (IV) analysis, what fundamental threat does the exclusion restriction aim to mitigate, and why is its untestability a critical consideration for econometricians?

<p>It prevents omitted variable bias by ensuring that the instrument only affects the outcome variable, via the treatment variable, therefore is uncorrelated with error term. (C)</p> Signup and view all the answers

Consider an instrumental variable $z_i$ that purportedly affects a causal variable of interest $s_i$. What specific condition must be met to ensure the validity of $z_i$ as an instrument, and why is directly testing this condition inherently problematic?

<p>$\text{Cov}(z_i, \eta_i) = 0$, which is problematic because$\eta_i$ encompasses unobservable determinants of the outcome. (C)</p> Signup and view all the answers

In an instrumental variable framework, why is the first-stage requirement—that the instrument must have a demonstrable effect on the causal variable—considered essential, and how does a failure in this stage undermine the entire analysis?

<p>It guarantees the identifiability of causal effects by generating variation to explain the outcome. (D)</p> Signup and view all the answers

Consider the scenario where an instrument $z_i$ (e.g., quarter of birth) is proposed to address endogeneity in a causal variable $s_i$ (e.g., years of schooling) when estimating its impact on an outcome $y_i$ (e.g., earnings). If $z_i$ is found to directly influence unobserved determinants of $y_i$ or ability, what specific IV assumption is violated, and what is the implication for the validity of the IV estimates?

<p>The exclusion restriction is violated, leading to biased and inconsistent IV estimates. (C)</p> Signup and view all the answers

Random assignment in experiments is often likened to the exclusion restriction in instrumental variables. In what critical sense does a valid instrument, satisfying the exclusion restriction, emulate random assignment, and why is this equivalence fundamental for causal inference?

<p>It creates exogenous variation in the causal variable, mimicking random assignment conditional only on the instrument. (D)</p> Signup and view all the answers

Suppose you are employing an instrumental variable $z_i$ to estimate the causal effect of $s_i$ on $y_i$. How does the strength of the correlation between $z_i$ and $s_i$ (the first stage) specifically affect the properties of the resulting instrumental variable estimator, particularly in terms of bias and efficiency?

<p>A weak correlation leads to bias amplification, even with a small violation of the exclusion restriction. (D)</p> Signup and view all the answers

In the framework of instrumental variables, the exclusion restriction is pivotal. Enumerate the potential consequences of violating this restriction, elucidating how such a violation undermines the integrity and interpretability of the causal inferences drawn from the analysis.

<p>Biased estimates arise because the instrument is correlated with unobserved determinants of the outcome. (A)</p> Signup and view all the answers

Consider the application of quarter of birth as an instrument for educational attainment. What are the primary, theoretically-grounded concerns regarding the validity of this instrument, especially concerning potential violations of the exclusion restriction in contemporary, developed economies?

<p>All of the above (E)</p> Signup and view all the answers

In the context of the exclusion restriction in instrumental variables, why is it inherently impossible to empirically 'test' the validity of this assumption, and what implications does this untestability have for the interpretation of IV estimates?

<p>All of the above (E)</p> Signup and view all the answers

Critically evaluate the assertion that a valid instrument provides variation in a causal variable 'as good as random.' In what precise sense does this hold, and what are the key limitations or caveats to this analogy when dealing with real-world instrumental variable applications?

<p>All of the above. (E)</p> Signup and view all the answers

In the context of Instrumental Variables (IV) estimation, if the exclusion restriction is demonstrably violated, what inferential challenge is most likely to arise, assuming all other IV assumptions hold?

<p>The IV estimator becomes biased and inconsistent, as the instrument now directly affects the outcome variable through pathways other than the endogenous variable, invalidating causal inferences. (D)</p> Signup and view all the answers

Consider an IV model analyzing the effect of educational attainment ($s_i$) on earnings ($y_i$), where $z_i$ represents proximity to a college. What specific threat to validity is most directly addressed by including a rich set of geographic and socioeconomic controls ($X_i'$) in both the first-stage and reduced-form equations?

<p>Mitigating the risk of omitted variable bias by accounting for confounding factors that simultaneously affect both educational attainment and earnings. (D)</p> Signup and view all the answers

In the context of the Wald estimator, assume you are analyzing the impact of military service ($s_i$), a binary variable, on future civilian earnings ($y_i$). You use draft lottery number ($z_i$) as an instrument, also a binary variable. If $E[y_i | z_i = 1] = $50,000$, $E[y_i | z_i = 0] = $40,000$, $p = P(z_i = 1) = 0.3$, and $Cov(s_i, z_i) = 0.0525$, what is the Wald estimate of the effect of military service on earnings?

<p>Approximately $285,714. (A)</p> Signup and view all the answers

Consider an instrumental variable (IV) regression where both the endogenous variable ($s_i$) and the instrument ($z_i$) are binary. Which of the following statements accurately describes the conditions under which the IV estimator will produce a consistent estimate of the local average treatment effect (LATE)?

<p>The IV estimator is consistent for the LATE under the assumptions of instrument relevance, exogeneity, the stable unit treatment value assumption (SUTVA), and monotonicity (no defiers). (A)</p> Signup and view all the answers

Suppose an econometrician uses quarter of birth ($z_i$) as an instrument for years of schooling ($s_i$) in a regression examining the effect of education on earnings ($y_i$). Assuming a linear model as described, what specific concern arises if individuals strategically choose their birth quarter to maximize their schooling opportunities?

<p>This strategic behavior directly threatens the exclusion restriction by creating a direct path from birth quarter to earnings outside of schooling. (C)</p> Signup and view all the answers

In IV estimation, what is the consequence of using an instrument ($z_i$) that is only weakly correlated with the endogenous variable ($s_i$) in the first-stage equation, even if the instrument satisfies the exclusion restriction?

<p>The resulting IV estimator will be biased toward the OLS estimate and have a large variance, leading to unreliable inferences. (A)</p> Signup and view all the answers

Consider a scenario where $s_i$ represents years of schooling and $y_i$ represents log earnings. Using the described IV framework, which of the following best describes limitations of interpreting $$ as the causal effect of an additional year of schooling on log earnings for all individuals?

<p>If the effect of schooling on earnings is heterogeneous and the instrument only affects schooling for a subset of the population (compliers), then $$ represents a local average treatment effect (LATE) specific to that subset. (B)</p> Signup and view all the answers

In the two-equation IV system provided, how does the inclusion of the covariates $X_i'$ affect the interpretation of the coefficients ${11}$ and ${21}$?

<p>$<em>{11}$ now reflects the <em>direct</em> effect of $z_i$ on $s_i$ after partialling out the effects of $X_i'$, and $</em>{21}$ the <em>direct</em> effect of $z_i$ on $y_i$ after partialling out the effects of $X_i'$. (A)</p> Signup and view all the answers

In a scenario where $z_i$ (the instrument) is a valid instrument, $s_i$ is the endogenous variable, and $y_i$ is the outcome, what is the most accurate interpretation of $Cov(y_i, z_i)$?

<p>It reflects the association between the instrument $z_i$ and the outcome $y_i$, which, under the exclusion restriction, captures the indirect effect of $z_i$ on $y_i$ through $s_i$. (A)</p> Signup and view all the answers

Assuming that $z_i$ is a binary instrument. Under what condition does $Cov(y_i, z_i) = {E[y_i | z_i = 1] - E[y_i | z_i = 0]}p(1 - p)$ accurately compute covariance even if the sample data exhibits substantial heteroskedasticity?

<p>This equation relies on assumptions of homoskedasticity; therefore, in the presence of substantial heteroskedasticity, standard error corrections (e.g., White's correction) are necessary for valid inference, but the point estimate of covariance remains accurate. (C)</p> Signup and view all the answers

In the context of Angrist and Krueger's (1991) study on the causal effect of education on earnings, how does the quarter of birth instrument leverage institutional knowledge and compulsory schooling laws to address omitted variable bias, specifically in relation to unobserved ability?

<p>By exploiting the exogenous variation induced by school-starting age policies and compulsory schooling laws, creating differential exposure to schooling based on birth quarter, which is plausibly uncorrelated with unobserved ability, thus isolating the causal effect of education. (C)</p> Signup and view all the answers

Within the framework of instrumental variables (IV) estimation, if an instrument $z_i$ is correlated with the error term $\eta_i$, what is the most likely consequence for the estimation of the parameter $\pi_1$ representing the causal effect of schooling ($s$) on earnings?

<p>The estimated $\pi_1$ will be inconsistent, meaning it will not converge to the true value even with an infinite sample size, due to the violation of the exclusion restriction. (C)</p> Signup and view all the answers

In the context of instrumental variables, what critical assumption must hold true for an instrument $z_i$ to validly isolate the 'unproblematic' variation in a variable $s$ (schooling) that is uncorrelated with the error term $\eta_i$ (unobserved factors), and how is this assumption typically assessed in empirical research?

<p>The instrument must satisfy the exclusion restriction, meaning it affects the outcome only through its effect on the endogenous variable; this is assessed by thoroughly justifying the absence of any direct pathways and conducting overidentification tests when multiple instruments are available. (B)</p> Signup and view all the answers

Considering Angrist and Krueger's study utilizing quarter of birth as an instrument, what potential threat to the validity of their instrument could arise if individuals born in different quarters of the year systematically differ in unobservable characteristics that also affect earnings, and how might this compromise the identification of the causal effect of education?

<p>All of the above. (D)</p> Signup and view all the answers

Suppose a researcher aims to replicate Angrist and Krueger's (1991) study in a different cultural context where seasonal employment patterns significantly influence family income. How might these seasonal employment patterns affect the validity of quarter of birth as an instrument for educational attainment, and what specific econometric strategies could be employed to address this potential confounding?

<p>Seasonal employment patterns could induce a spurious correlation between quarter of birth and educational attainment if families facing income shocks during specific seasons are more or less likely to invest in their children's education. This could be addressed by including seasonal employment rates as control variables in the instrumental variable regression. (C)</p> Signup and view all the answers

Consider a scenario where the compulsory schooling laws exhibit significant heterogeneity across different states within the US during the period studied by Angrist and Krueger (1991). How would this heterogeneity impact the interpretation of their findings, and what methodological adjustments would be necessary to account for this variation in compulsory schooling requirements?

<p>The heterogeneity would require interacting the quarter of birth instrument with state-level indicators of compulsory schooling laws, allowing for the estimation of interaction effects that capture the moderating role of state policies on the relationship between quarter of birth and educational attainment. (C)</p> Signup and view all the answers

Suppose that advancements in fertility treatments have led to a higher incidence of multiple births (twins, triplets, etc.). How might the presence of multiple births within a family introduce a potential bias in Angrist and Krueger's instrumental variable approach, and what analytic strategies could be implemented to address this bias?

<p>Multiple births could introduce bias if families with twins or triplets alter their educational investment decisions due to resource constraints, potentially creating a correlation between quarter of birth and unobserved family characteristics. This could be addressed by excluding families with multiple births from the analysis or by including a control variable for the presence of multiple births. (C)</p> Signup and view all the answers

In the context of instrumental variables estimation, define and differentiate between the concepts of 'relevance' and 'exclusion restriction' concerning the validity of an instrument $z_i$ and explain how a failure of either condition would compromise the identification of a causal effect.

<p>Relevance refers to the instrument's correlation with the endogenous variable ($E[z_i s_i] \neq 0$), while the exclusion restriction requires that the instrument only affects the outcome through its effect on the endogenous variable. Failure of relevance leads to weak instrument bias, while failure of the exclusion restriction causes inconsistent estimates. (B)</p> Signup and view all the answers

In the context of Angrist and Krueger’s study, consider the potential for differential migration patterns based on quarter of birth. If individuals born in certain quarters are more likely to migrate to regions with different labor market conditions and educational opportunities, how could this migration bias affect the validity of their instrumental variable estimates, and what statistical techniques could be employed to mitigate this potential bias?

<p>Migration could introduce bias if individuals born in certain quarters are more likely to migrate to areas where the returns to education are systematically different, thus violating the exclusion restriction. This bias could be mitigated using inverse probability weighting based on predicted migration probabilities, or by including region-specific fixed effects. (B)</p> Signup and view all the answers

Flashcards

Ability (𝐴𝑖)

A vector of variables representing an individual's skills and knowledge.

Random term (𝜐𝑖)

The portion of potential outcomes not explained by ability (𝐴𝑖).

Omitted Variable Bias

An unobserved factor that correlates with the independent variable (𝑠𝑖) and the dependent variable (𝑦𝑖), leading to biased estimates.

Instrumental Variable (𝑧𝑖)

A variable correlated with the causal variable of interest (𝑠𝑖) but uncorrelated with other determinants of the dependent variable (𝑦𝑖).

Signup and view all the flashcards

IV Exogeneity Condition

Correlation between the instrument (𝑧𝑖) and the error term (𝜂𝑖) should be zero.

Signup and view all the flashcards

Intuition Behind IV

Breaks the variation in the independent variable into two parts: one correlated with the error term, and one not correlated with the error term.

Signup and view all the flashcards

IV Decomposition

Decompose the effect into a part that is correlated with ( \eta_i ) and another that is orthogonal.

Signup and view all the flashcards

Problematic Part of a Variable

The element of a variable 's' that correlates with error term 𝜂𝑖, potentially causing bias.

Signup and view all the flashcards

Unproblematic Part of a Variable

The element of a variable 's' that is NOT correlated with the error term 𝜂𝑖, helping to provide an unbiased estimate.

Signup and view all the flashcards

Instrument (in IV)

A variable (zi) that isolates the variation in 's' that's uncorrelated with 𝜂𝑖, enabling a consistent estimate of 𝜋1.

Signup and view all the flashcards

Angrist & Krueger (1991)

A study examining education's impact on earnings using birth quarter as an instrument for schooling.

Signup and view all the flashcards

School-Starting Age Policies (US)

School typically starts in September in the US for children turning 6 in that calendar year.

Signup and view all the flashcards

Compulsory Schooling Laws (US)

Laws requiring students to stay in school until their 16th birthday.

Signup and view all the flashcards

Natural Experiment (Birth Quarter)

Individuals born in different quarters face slightly different required schooling durations, creating a real-world experiment.

Signup and view all the flashcards

Birth Quarter & Schooling

Those born earlier in the year might get slightly lesser schooling, due to compulsory schooling laws.

Signup and view all the flashcards

Quarter-of-Birth Instrument

A variable (like quarter of birth) used to create variation in schooling that is plausibly unrelated to an individual's ability.

Signup and view all the flashcards

Strong First-Stage

The instrument must have a demonstrable and significant impact on the causal variable being studied.

Signup and view all the flashcards

Exclusion Restriction

The instrument (𝑧𝑖) only influences the outcome (𝑦𝑖) through its effect on the causal variable (𝑠𝑖).

Signup and view all the flashcards

Exclusion Restriction Implication

The instrument can be omitted from the outcome equation without causing bias.

Signup and view all the flashcards

Exclusion Restriction: Covariance Form

Ensures that the instrument is unrelated to unobserved determinants of the outcome variable.

Signup and view all the flashcards

IV as 'Good as Random'

The exclusion restriction makes the variation in the causal variable as if it were randomly assigned.

Signup and view all the flashcards

Exclusion Restriction Independence

The instrument is independent of potential outcomes but related to the causal variable of interest.

Signup and view all the flashcards

IV Relevance

The instrument should cause a change in the causal variable (e.g. schooling).

Signup and view all the flashcards

η𝑖

The error term in a regression equation that includes all relevant variables.

Signup and view all the flashcards

Assumption for Valid Instrument

A requirement for IV to be uncorrelated with unobserved variables

Signup and view all the flashcards

Wald Estimator

An IV estimator calculated as the ratio of the reduced form to the first stage.

Signup and view all the flashcards

Reduced Form

The effect of the instrument on the outcome variable.

Signup and view all the flashcards

First Stage

The effect of the instrument on the endogenous variable.

Signup and view all the flashcards

Angrist and Krueger (1991)

Using quarter of birth as an instrument to estimate the economic value of education.

Signup and view all the flashcards

Angrist (1990)

Draft eligibility (determined by lottery) as an instrument for veteran status.

Signup and view all the flashcards

First-Stage Equation

Regresses the causal variable (𝑠𝑖) on the instrument (𝑧𝑖) and other controls (𝑋´𝑖).

Signup and view all the flashcards

Reduced Form Equation

Estimates the direct effect of the instrument (𝑧𝑖) on the outcome (𝑦𝑖).

Signup and view all the flashcards

Exclusion Restriction (in IV)

The instrument (𝑧𝑖) affects earnings (𝑦𝑖) only through its effect on schooling (𝑠𝑖).

Signup and view all the flashcards

IV estimate of causal variable

Shows causal effect of increasing your causal variable by one unit on the outcome variable.

Signup and view all the flashcards

IV-Estimate Formula

Ratio of the covariance between the outcome and the instrument to the covariance between the causal variable and the instrument .

Signup and view all the flashcards

IV-Estimate as Regression Ratio

The ratio of the regression coefficients of the reduced form to the first stage equation.

Signup and view all the flashcards

Cov(𝑦𝑖 , 𝑧𝑖 ) Formula with Binary 𝑧𝑖

E[𝑦𝑖 |𝑧𝑖 = 1] − E[𝑦𝑖 |𝑧𝑖 = 0] * p(1 − p).

Signup and view all the flashcards

Exogeneity Condition

Correlation between the instrument (Zi) and the error term (ηi) is zero.

Signup and view all the flashcards

Study Notes

  • Instrumental variables (IV) are utilized when the zero conditional mean assumption in Ordinary Least Squares (OLS) regressions is unlikely to hold.
  • IV serves as a quasi-experimental method, mimicking a real experiment to address issues when conducting a real experiment is not feasible.

Roadmap of IV:

  • Circumstances under which an instrument is useful are explored
  • Examination of IV under the constant effects.
  • Identification of necessary assumptions.
  • Introduction to the Wald estimator.
  • Discussion on Two-Stage Least Squares (TSLS).
  • Coverage of IV tests.
  • Investigation of IV under heterogeneous effects assumptions.

Where an Instrument Can Help:

  • Instrumental variables can address threats to internal validity such as:
    • Omitted variables
    • Measurement errors in X
    • Simultaneity bias
  • Bias arises in all of these cases because E[u|X] ≠ 0

Endogeneity:

  • Used to characterize threats to internal validity when E[u|X] ≠ 0.
  • When a variable X is endogenous, OLS measures correlation rather than a causal effect.
  • Exogeneity occurs when variable X is uncorrelated with u.

IV: Two Cases:

  • IV studied under two scenarios:
    • One is where the effect is the same for everyone called the constant effect case.
    • The other is where the effect differs across people called the heterogeneous effect case.

IV: Constant Effect Case

  • With constant effect example shown through causal link of schooling and wages:
    • Where potential outcomes equal ysi = fi(s).
  • ysi denotes potential schooling outcomes where:
    • equation is fi(s) = πο + π₁ς + ηi.
  • ηi denotes unobserved factors affecting wages.
  • ηi can be function of vector, Ai, called ability:
    • ηi = A´¡Y + Vi
  • y denotes vector of population regression coefficients
  • vᵢ is random term that are uncorrelated by construction

Constant Effect

  • Model can be written as yᵢ = α + ρsᵢ + ηi, where the Aᵢ variables, contained in ηi, correlate with sᵢ.
  • Provided the Aᵢ correlate with sᵢ this implies an omitted variable issue
  • The Aᵢ are the only reason sᵢ and ηi are correlated.
  • Equation E[sᵢvᵢ]=0

Instrumental Variable:

  • Need to estimate regression coefficient p, when Aᵢ is unobserved.
  • Instrumental variables methods can estimate p, when there exists a variable which is:
    • Correlated with causal variable of interest sᵢ.
    • Uncorrelated with any other determinants of the dependent variable yᵢ.
  • This variable is an instrument denoted zᵢ.
  • Term "uncorrelated with any other determinants of dependent variable" implies:
    • Cov (ηi, zi) = 0, or zᵢ is uncorrelated with Aᵢ and vᵢ.
    • Variable zᵢ is uncorrelated with abilities Aᵢ and unobserved determinants of wage.
    • Variable zᵢ should be correlated with schooling sᵢ.

The Intuition Behind IV:

  • With variable zᵢ, ysi = πο + π₁ς + ηi where E[ηi|s] ≠ 0.
  • IV splits in s into two parts:
    • One that correlates with ηi.
    • One that doesn't correlate with ηi.
  • Part correlated with ηi is the "problematic" part
  • Part correlated with ηi is the "unproblematic" part
  • With an instrument zᵢ to isolate or predict variation in part of s that is uncorrelated with ηi:
    • Allowing a consistent π₁ estimate
  • Instrument helps distinguish between uncorrelated part from correlated part in s.
  • The instrument detects movements in s uncorrelated with ηi and uses these to estimate π₁.
  • Variable zi is used to only use the variation schooling s unrelated abilities Aᵢ.

Example of Quarter of Birth Instrument:

  • Angrist and Kreuger wanted to study the causal effect of education on earnings
  • Institutional knowledge about school-starting age policies and compulsory schooling laws in the US generated instrument for schooling.
  • US states generally require 6 year olds to start school in the calendar year (starting in September).
  • All children born in same year start school in September of the same year.
  • Compulsory schooling laws require student to remain in schools until 16th birthday.
  • Means students in different quarters will complete different degrees by the legal dropout age.
  • Angrist and Kreuger's analysis creates a "natural experiment" where those with different birthdays stay in school different amounts if time.

Angrist and Kreuger Quarter of Birth

  • Example: a person born in January can drop out of school approximately 3 months earlier that someone born in April.
  • This may result in less schooling for those born earlier in year.
  • Angrist and Kreuger's instrument, quarter of birth, affects years of schooling.
  • Quarter-of-birth instrument distinguishes between a "problematic" and "unproblematic" variation in schooling sᵢ.
  • Therefore the variation caused by quarter of birth is unrelated to individual ability.
  • It is a bad instrument if quarter birth affecting ability and unobserved earnings determinants.

Assumptions Needed for IV

  • Instrument zᵢ has a clear affect on the causal variable of interest, schooling. which is "first-stage".
  • No effect from the instrument means no information about the causal variable of interest.
  • Can be easily checked with that data
  • Example: birth quarter must significantly affect years of schooling.
  • Second assumption is that the only relationships between outcome variable, yᵢ, and instrument, zᵢ is the first stage.
  • Then zᵢ can only affect yᵢ (earnings) via its effect, sᵢ (schooling).
  • Such that an exclusion restriction can be written as Cov(zᵢ, ηi) = 0.
  • Exclusion restrictions cannot be tested, identifying characteristics, not just beliefs

IV Exclusion Restriction

  • Exclusion Restriction is called so as the instrumental zᵢ assumed has no correlation with unobserved outcome determinants yᵢ.
  • So its excluded from the main outcome equation Excluding it should not cause omitted variable bias.
  • Example the quarter of birth can be excluded as its uncorrelated

The Exclusion Restriction

  • The exclusion restriction of IV implies causal variable of interest is as good as random.
  • Exclusion restriction says instrument for potential outcomes which exclude causal outcomes
  • Conditional on several covariates a birth is good as

Angrist and Kreuger Formally

  • Following equation are formalized taking the birth story
  • sᵢ = X'ᵢπ₁₀ + π₁₁zᵢ + ζ₁ᵢ
  • yᵢ = X'ᵢπ₂₀ + π₂₁zᵢ + ζ₂ᵢ
  • Equation one is know as the first stage
    • the schooling sᵢ on the instrument zᵢ and other controls X'ᵢ.
  • The second is known as the reduced form
    • the direct effects
  • If exclusion restrictions hold the reduce form shows the direct effects
    • only affecting schooling
  • Shows the effects on instrument
  • The by community can work causal increase of schooling by a year

The Constant Effect Case

  • Given the exclusion, an estimate can be given:
    • ρ = Cov(yi, zi)/Cov(si, zi) = Cov(yi, zi)/V(zi) / Cov(si, zi)/V(zi)
  • Good form as it can be expressed in the ratio
  • Can be population regression yᵢ on zᵢ to the first stage equation

The Wald Estimator

  • A IV estimator is the most popular estimate is a single binary number
  • A dummy that equals with a set probably can be found
    • Cov(yi, zi)={E [yizi = 1] - E [yi]zi = 0]} p(1-p)
  • An similar for formula known as Cov (Si, Zi). The IV estimator is ρ = E [yilzi=1]-E [yizi=0] divided by E [SiZi=1]-E [SiZi=0]

Wald Estimator Example

  • Angrist and Krueger quarter of birth study using.
  • Reduced for the difference is earn between the two points
  • Between with values standard error
  • The values that of the men are the years
  • Divided and get Wald economics

Wald Estimator 2

  • The Angrist (1990) looks at Vietnam military service on earnings for veterans
  • Young men in 1960/70s are at risk to draft service a lottery
  • Instrumental variable: draft-eligibility. Draft eligibly is determined by lottery
  • Many got exemption due to health/reasons some volunteer

Good Instruments

  • Originates from combination how interest gets generated
  • Later can be obtains by costs and benefits that decides
  • Deciding to decide by the cost
  • Then the cost get shift between roles and subsidies
  • Then change the policy for the under manner

Terminology

  • Dependants in two are called
  • Variables can can called
  • They’re a sub set
  • They are not are covariates.

Two-Stage Least Squares (2SLS)

  • More Convenient to use then IV
  • The except variables in first stage
  • When Overidentified with on instrument
  • Many can test as well if the validity is in the test

2SLS continued

  • The structural model is Yi = X´¡a + psi + Ni with Cov (Si, ni) =/= 0. the first-stage equation can be Si = Χ'ιπ1ο + Π11Ζί + ζί

Two-Stage Least Squares (2SLS) cont

  • With the equation one Yi = X´¡α + ρ[Χ΄; Πιο + ΠιιZi + 1i] + Ni = Χ´¡[α + ρπ10] + ρπ11ζί + [pζ₁i + ni] = Χ'ιπ2ο + Π21Ζί + 2ί where π2ο = α + ρπ10, π21 = ρπ11 and 32i = P1i + Ni

Two-Stage Least Squares (2SLS) cont

  • With a such re-aranged
  • Yi = Χ´¡α + ρ[Χ΄; Πιο + Π11Zi] + 2i where X'; π1ο + π11Zi is the population fitted (predicted) value from the first-stage regression of si on Xi and zi. Can create that not for them which which makes is easy

How can can consistently with the with effects

  • S₁ = [Χ΄¡10; + 11Ζί]
  • Yi = X´¡a + psi + [ni + p(si - Si)], where si - Si = 1i. So both can be good or bad The estimates easy

We would then estimate with error to standard

Yi = X´¡a + psi + vi, where vi = [ni + p(si - Si)],

Deriving the equations

  • If singe as x can has a is formula easier
  • Can have a with let Ŝxy the a with formula with written as B²LS₁ = Sxzy/Sx²

Then with the 22SLS form

  • With a so as re the for by is instrumented Can write for by LSS be a replace so B²LS₁ = Sxzy/Sx²

To carry on on

With a easy to define which simple to get With can so why start?

Now is is easy to right know so

Now know or for good and bad effects and With or out can know where all.

With 2SLS continue estimates

Can now now can with all

Some examples of instruments

  • The effects off labour and sizes Do women? Is for what is.
  • The quantity vs effects Choice is one to can the effects

Common

  • Most positive with Most on and off with

Now has and know is with

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Use Quizgecko on...
Browser
Browser