Causal Inference in Economics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

According to Cunningham (2021), what does causal inference leverage to estimate the impact of events and choices?

  • Theory and deep knowledge of institutional details (correct)
  • Statistical correlations alone
  • Intuition and common sense
  • Historical precedents and analogies

Correlation between two variables always implies a causal relationship.

False (B)

Which of the following is the main reason correlations often fail to reveal causal relationships, according to Cunningham (2021)?

  • Data collection errors
  • Flaws in statistical methods
  • Human beings engaging in optimal behavior (correct)
  • Insufficient sample sizes

The "Credibility Revolution" in economics signifies a transformative shift prioritizing rigorous ______ over simple correlations.

<p>causal inference</p> Signup and view all the answers

What is the primary aim of scientific methodologies, as described in the lecture?

<p>To form a particular kind of belief (B)</p> Signup and view all the answers

In causal inference, 'endogeneity' is not a major concern.

<p>False (B)</p> Signup and view all the answers

Define 'ceteris paribus' in the context of comparative statistics and causal inference.

<p>holding other things constant</p> Signup and view all the answers

According to the lecture, which of the following is a key characteristic of design-based studies?

<p>Prima facie credibility (B)</p> Signup and view all the answers

Experiments in causal inference are always feasible, cost-effective, and free of ethical concerns.

<p>False (B)</p> Signup and view all the answers

In regression analysis, the ______ captures the influence of all unobserved variables on the dependent variable.

<p>error term</p> Signup and view all the answers

What does it mean for the error term u to be 'mean independent of x' in a regression model?

<p>The average value of u doesn't vary with x (D)</p> Signup and view all the answers

Match the concept with its description:

<p>Endogeneity = The issue where the independent variable is correlated with the error term Omitted Variable Bias = A type of bias that occurs when a relevant variable is not included in the model Reverse Causality = The situation where the dependent variable influences the independent variable Self-Selection Bias = A bias that arises when individuals choose to participate in a study</p> Signup and view all the answers

Ordinary Least Squares (OLS) can still produce reliable coefficient estimations, even if critical assumptions on error terms are violated.

<p>False (B)</p> Signup and view all the answers

What is the primary goal of regression analysis in the sciences?

<p>To test theories and estimate relationships between variables (C)</p> Signup and view all the answers

The ______ is the component of the dependent variable not explained by the independent variables in a regression model.

<p>error term</p> Signup and view all the answers

Lack of correlation between two variables definitively proves that there is no causal relationship between them.

<p>False (B)</p> Signup and view all the answers

What is the purpose of the error term (u) in the population model $y = \beta_0 + \beta_1x + u$?

<p>To capture the influence of all factors that determine or affect y. (C)</p> Signup and view all the answers

Explain the concept of 'reverse causality' and give a brief example.

<p>When the presumed dependent variable actually influences the presumed independent variable. For example, the effect of health on education; education may improve health, rather than health affecting education.</p> Signup and view all the answers

Match each term with its definition in the context of a linear regression:

<p>$\beta_0$ = Population Intercept: The constant term in the regression equation, indicating where the line intercepts the y-axis $\beta_1$ = Population Slope: The coefficient that determines the change in the dependent variable for each unit change in the independent variable u = Error Term: Represents the factors affecting the dependent variable that are not included as independent variables in the model</p> Signup and view all the answers

Consider the linear regression model $y = \beta_0 + \beta_1x + u$. Which of the following statements is true if $E(u|x) \neq E(u)$?

<p>The mean of the error term depends on values of x. (A)</p> Signup and view all the answers

If the error term (u) is correlated with any of the independent variables, the OLS estimates will be biased.

<p>True (A)</p> Signup and view all the answers

Define "unbiasedness" in the context of an estimator.

<p>An estimator is unbiased if its expected value equals the true parameter value.</p> Signup and view all the answers

Which of the following indicates the correct interpretation of $E(\hat{\beta}) = \beta$ ?

<p>The anticipated sample estimate of the parameter is equal to the parameter. (A)</p> Signup and view all the answers

The OLS estimator chooses coefficients that minimize the ______ of squared residuals.

<p>sum</p> Signup and view all the answers

In OLS regression, the error term and the residuals are the same thing.

<p>False (B)</p> Signup and view all the answers

If one of the assumptions of OLS Regression is that the mean of the error term (u) equals zero, then according to the description, we should interpret this as meaning that $E(y|x)$ is equal to:

<p>$\beta_0 + \beta_1x$ (A)</p> Signup and view all the answers

Write the formula for $R^2$ as a function of SST and SSR.

<p>$R^2 = 1 - \frac{SSR}{SST}$</p> Signup and view all the answers

What does a high value of R-squared ($R^2$ close to 1) indicate in the context of regression analysis?

<p>There is a perfect linear relationship between x and y. (C)</p> Signup and view all the answers

Higher values of $R^2$ are required for causal inference.

<p>False (B)</p> Signup and view all the answers

The standard error of the regression ($\sigma$) is an estimate of the standard deviation of the ______ in the regression.

<p>error</p> Signup and view all the answers

Match the problem with a possible symptom:

<p>Heteroskedasticity = Non-constant variance of errors Multicollinearity = High correlation among independent variables Omitted Variable Bias = Biased coefficient estimates</p> Signup and view all the answers

In the context of causal inference methodologies, what does DiD stand for?

<p>Difference-in-Differences (A)</p> Signup and view all the answers

In causal inference, methods can be applied without incorporating theory and local institutional knowledge.

<p>False (B)</p> Signup and view all the answers

When economists over-obsess good experiments they leave what questions unsolved?

<p>Big questions unanswered that have to do with poverty, inequality and unemployment. (A)</p> Signup and view all the answers

Using what methodologies have economists used causal inference techniques to answer big questions?

<p>regression discontinuity, RCTs, DiD</p> Signup and view all the answers

According to Hausman, experimental and quasai-experimental research designs offer little predictive value beyond the ______ of the experiment in question.

<p>context</p> Signup and view all the answers

To look for more evidence about the generalizable outside world one should aim to:

<p>Accumulate convincing empirical findings. (D)</p> Signup and view all the answers

Economic theory doesn't understand the picture that emerges from a constellation of empirical findings, but it does help us paint the full picture

<p>False (B)</p> Signup and view all the answers

Complete the following statement: Causal inference requires and relies on an understanding of

<p>the behavioral processes that structure real-world equilibria.</p> Signup and view all the answers

Flashcards

Causal Inference

Estimating the impact of events and choices leveraging theory and deep institutional knowledge.

Correlation

Association between two variables; doesn't prove cause.

Economic Theory

The choices correlated between those choices and outcomes will rarely represent a direct cause.

Develop Models

Modeling the relationship between variables to form testable hypotheses.

Signup and view all the flashcards

Comparative statistics

In models, theoretical descriptions of causal effects.

Signup and view all the flashcards

Endogenous Income

When income affects transport. Also related to urban infra quality and self-selection.

Signup and view all the flashcards

Endogeneity Problems

Problems with research questions and omitted variables

Signup and view all the flashcards

Demand Curve

Demand curves are theoretical and unobservable

Signup and view all the flashcards

Price exogeneity

Variation in prices are independant of u (exogeneity) otherwise it is hard to see.

Signup and view all the flashcards

Credible Research Design

Need to isolate source of variation to reveal effect.

Signup and view all the flashcards

Credibility Revolution

A transformative shift in economics, prioritizing rigorous causal inference over simple correlations

Signup and view all the flashcards

Random Variable

Random variable function maps to real numbers, facilitates random process analysis

Signup and view all the flashcards

Expected Value

Weighted average of values where probabilities weight each value.

Signup and view all the flashcards

Variance

Expected value of squared deviation from mean.

Signup and view all the flashcards

Covariance

Expected value of the product of deviations.

Signup and view all the flashcards

Correlation

Cov X,Y divided by standard deviations.

Signup and view all the flashcards

Population Model

Start with simple model with cross-section data.

Signup and view all the flashcards

Mean independent x

Error term mean is same for every slice of x.

Signup and view all the flashcards

Zero Conditional Mean

E(u|x) = 0 for all values of x, important assumption in models.

Signup and view all the flashcards

Ordinary Least Squares (OLS)

Estimate population parameters, given data.

Signup and view all the flashcards

Two conditions to be TRUE

With OLS assumption that u is independent of x

Signup and view all the flashcards

Variation impact

The variation allows is to identify the impact. With the school example, not everyone would have years of the same schooling.

Signup and view all the flashcards

Residual

Prediction error based on discrepancy between the fitted + actual y.

Signup and view all the flashcards

Squares of the Mistakes

Sum the Squared Differences

Signup and view all the flashcards

Statistical Properties

Estimates can have the chance to be unbiased through OLS. If this is true with repeated sampling we would get the average outcome/all possible random samples.

Signup and view all the flashcards

Random Sampling

If assumption is met then each i is drawn from population= for ach i= + + with being observed error for observation (not residual we compute from data.

Signup and view all the flashcards

Centred at Zero

Estimates are similar to hard-coded values built into the data +allows graph to be linear at the end. However, u,v are independent, and results will be exactly zero.

Signup and view all the flashcards

Variance

Has 5 assumptions (Homoskedasticity) and also means the population error term, u, has variance to give same variance for value of x. = .

Signup and view all the flashcards

Heterskedasticity OK!?

Is this standard number to show unbias ness but not to just give and accept that there is Heteroskedasticity of errors +is norm.

Signup and view all the flashcards

Inherent uncertainty capture?

Zero conditional mean, so the rule that is followed causes a co-efficient from regression to become unbiased.

Signup and view all the flashcards

Shows Data!!

Shows data with 3 decimal places of high accurate results.

Signup and view all the flashcards

Estimator of Errors

We don't use OLS to help show the unbiased Ness from all the estimates that we may get in 4.

Signup and view all the flashcards

Expectation is indpendent

E(XIY/E(X)

Signup and view all the flashcards

Conditional Expectation Function

CEF is found to denote (Y(i)|X(i)). Has function of X(i) because X is Random. With a treatment variable of D(i) its called the treatment effect has two values of (y|d=o). And (y|d=1).

Signup and view all the flashcards

Decomposition Proporty

Y can be split into 2 pieces= E(Y(i)|X(i))+ E(i). Has many facts+ is mean is independent with Xi or the err

Signup and view all the flashcards

Ceif Prediction Property or CEFS

Minums mean sqaured error or given in. Gives you best predicter with to reduce square loss.

Signup and view all the flashcards

Anova ANOVA

Unconditional variance + conditional expectancy+conditional variance = Variance decomposation with cef.

Signup and view all the flashcards

Linear Approx?

Linear regression is good to use always that is not under or as linear. Good rule with some problems. -Linear regression is a good app for many

Signup and view all the flashcards

Regression CEFS Thoerem?

tells us that xp provides the mini sqared of near approximation linear expression to CEFS

Signup and view all the flashcards

Regression?

Crank estimate into code but some desirable situations could bad also result.

Signup and view all the flashcards

Regression Atomicy?

Use this thereom to slice the regression and have us interpret singular co-efficients in mulitiple. Is family size with labor +random family, so =estimate that is causual on the labor supply

Signup and view all the flashcards

Study Notes

  • Causal Inference focuses on leveraging theory and institutional details to estimate the impact of events and choices on a specific outcome.
  • Humans have been interested in causality for a very long time

Causal Questions in Economics

  • Does a rooster's crow make the sun rise?
  • Does rain cause people to catch a cold?
  • Can higher drug use rates result in increased crime rates?
  • Does drinking salabat make colds or sore throats better?
  • Does offering eggs to Sta. Clara make the weather nicer?
  • Does massaging certain parts of the foot make internal organs healthier?
  • Do Covid-19 vaccines cause death when injected?
  • Do face shields reduce Covid-19 transmission?
  • Does increasing the minimum wage lead to less employment for low-skilled workers?
  • Can microfinance improve household income and reduce poverty?
  • Does raising taxes on cigarettes lower smoking rates?
  • Does using social media reduce mental health?
  • Are happier people more likely to volunteer or does volunteering improve happiness?
  • Does expanding health insurance coverage lower out-of-pocket expenditures?
  • Do greater investments in education increase wages?
  • Does using renewable energy subsidies lower carbon emissions?
  • Does corruption in public procurement raise infrastructure costs?
  • How do programs for fiscal stimulus affect rates of unemployment?
  • Is economic growth impacted by foreign direct investment (FDI)?
  • Did economic benefits come from martial law in Mindanao?
  • Does democracy promote economic development?

The Start of Causal Inference

  • John Snow's cholera outbreak work in the 1850's is considered the first instance of difference-in-difference
  • Sir Ronald A. Fisher's The Design of Experiments was released in 1935
  • Trygve Haavelmo's "The statistical implications of a system of simultaneous equations" was released in Econometrica 11(1): 1-12
  • Donald Rubin's "Estimating causal effects of treatments in randomized and nonrandomized studies," was released in the Journal of Educational Psychology 66(5): 688-701
  • Causal inference gained momentum and entered economics mainstream, gaining the name "Credibility Revolution
  • Correlation does not mean Causation; correlation between 2 variables does not imply causal relationship

Global Rice Consumption

  • Lack of correlation may not mean no causal relationship at play
  • Though not obvious, a causal relationship could exist
  • Those navigating choppy waters must endogenously turn rudder to offset wind and maintain a straight line
  • No clear correlation between a straight-line path and a sailor's carefully planned rudder movements
  • Since humans engage in optimal behavior, correlations do not reveal causal relationships, because human beings do not often act randomly
  • Endogeneity is everywhere and is very difficult to rule out
  • To remove endogeneity, experiments mimic the laboratory setting, however they're often infeasible, too costly, and involve ethical/moral dilemmas
  • Researcher is a passive actor in data generation when using non-experimental or observational data
  • From observational data, correlations do not reflect causal relationships because people choose the variable and are making optimal decisions
  • For correlation to measure a causal effect, it must measure a choice independent of potential outcomes

Economic Correlation

  • Economic theory states choices are endogenous, and therefore correlations between them and outcomes rarely represent a causal effect
  • True scientists do not collect evidence to prove their beliefs and this is propaganda not science
  • Scientific methodologies are methods for forming a particular kind of belief
  • Methodologies allow acceptance of unexpected answers, are process oriented, not outcome oriented, and without it, causal methodologies are unbelievable
  • Empirical analysis in the sciences depends on data to test theories and estimate relationships between variables

Models for Relationships

  • Models are developed to describe the relationship between variables and form testable hypotheses
  • These models can be translated into econometric models, which are then directly estimated using data
  • Comparative statistics are theoretical descriptions of causal effects in models
  • If one changes factor X ceteris paribus, and Y changes, one can infer that X causes Y
  • Confounds can come from other factors moving simultaneously, hindering estimates of X's causal impact on Y

Causality in Income

  • There's a correlation between people's choice of transportation and income but income has endogenous effects
  • Reverse causality shows people relying on slower transport may face higher commute times, limiting their job opportunities
  • The urban infrastructure quality is affecte by both transport choice and income (omitted variable bias)
  • Lower-income may reside in areas with high access to transport, high income may want more private use

Endogeneity Problems

  • Social media use may reduce mental health, or people prone to mental health may use it more (self-selection bias)
  • Volunteering may improve happiness, or happier people may volunteer more (self-selection bias)
  • Reverse causality may relate foreign direct investment (FDI) in economics and growth and economic development may cause democracy with other variables

Price Elasticity of Demand

  • Empirically estimating the price elasticity of demand faces complexities
  • A model can describe variable relationships and form testable hypotheses
  • Econometric models can be made through translation for using data and testing theories
  • By changing factor X ceteris paribus, and Y changes, one can reasonably infer that X causes Y in comparative statistics
  • Demand is theoretical/unobservable. So, the price-quantity values at various equilibrium points are instead observed.
  • Price-quantity pairs won't reflect the curves but connected points if there are simultaneous shifts, making them useless
  • From Economics 102, price elasticity of demand is ∈ =dlogQ/dlogP
  • In exogeneity, P is exogenous, and holds other factors fixed.

Measuring and Estimating Relationships

  • Completely independent changes in P from other factors of supply and demand are needed
  • A correlation between P and Q won't measure elasticity of demand
  • The relationship empirically can be done with logQd = α + δlogP + yX + u formula, α is the intercept, δ is the elasticity, and X is the vector of factors like other prices or consumer income, y is the coefficient (factors and Qd.)
  • Variation in prices must be independent of u (exogeneity). So, being correlated with some stuff captured/darkness
  • The study of Education has literature of school inputs on student achievement (Class and spending)
  • Student outputs use regression to explain amount of variables

Regression in Functions

  • Coleman et al. (1966) used sensitivity analysis by looking at results arising from many specifications (Robus checks.)
  • These studies failed and were reverse causations, omits variables bias
  • Perverse: Test Score bad if it is in small classes(control for demographics) The struggling children and small classes go together
  • School spending and outcomes is in areas of rich districts and large urban districts (where struggling minority students).
  • Class and pupil are linked; You cannot add spend or have small classes

Interpretation Issues

  • The weak foundation/causal of specification in text (J. D. Angrist & Pischke, 2010).
  • Leamed (1983) criticized previous research by Ehrlich (1975b, 1975a) for it punishment had a significant deterrent.
  • Ehrlich's findings sensitivity changes in functional or adds control (More data in 1960s.)

Reverse Causality

  • Ehrlich studies credible, but failed in source of rate variation
  • Murder rates have a two-way relationship w/ executions
  • Used 2 stage lease estimator

Understanding Correlations

  • Native regressions common(1960s-1980s).
  • The issues and no overreaching causal and there was a growing frustration of the state or empiric economics, Spurious with confounding
  • Given the key assumption held, results are not robust, data analysis
  • Leamer(1983) and urged empirical researches to take con out.
  • Different in non-experiment based on degree

Credibility Revolution

  • Definition: Economics that shifts from transform to simple relation(Good designs by simple).
  • Emphasis research mimics conditions( J. Angrist & Pischke, 2010).
  • Random controlled and experiments methods
  • Better data and less emphasis is econometric (More big data now and reduce causal.)
  • Olden days is obsessive over analysiss, more focus with less specification
  • Designs are credibility prima facie (J. D. Angirist & Pischke, 2010).
  • Good designs typically lead methods that are simpel with easy results.
  • Revolution was fueled by experimental designs that are experimental or quasi.
  • Many were costly, clever desings by people for assignemnts etc.

Seminal Papers

  • Card (1990) study 1980 Mariel Boatlift increased Miami's labor by %7 with skills
  • D. Agnist from Vietnam with a lottery as experiment in natural with veeran earnings
  • D. Angrist & Krueger 1991 did compulsory impacts

Nobel Memorial Prizes

  • Banerjee, Duflo, and Kremer won prizes(2019) and Card, Angrist, and Imbens(2021) and Acemoglu, Johnson, and Robinson (2024)
  • To be credible is to study and spell and research
  • Not like Haavelmo Econometrica, which physicist experiments is an essential appendix or economic theorist

Economic Research

  • The changing nature of studies, Hamemer mesh(2011).

External Validity

  • Experimental from quasi offers little beyond the question
  • Look evidence until a moral picture emerges
  • Economic theory in framework

Second Critique

  • It's focus to much on methodologicals
  • Without what can not know or depend, estimative

Third Critique

  • Economist Raj Chetty: They thin questionless with method
  • James Heckman the problem is to much is to the New Yorker
  • Use to trivial studies has causal or big

Basic Econometrics

  • A random variable uses to mapping space (for all outcomes that set) to use analysis in processes

Expected Values

  • Expected or expectations or random value with an average for a population

Expectation Properties

  • For all, expectation has variables which use
  • Population concept apply

Conditional Expections

  • For variable Y under variable x, use if Y has variables under X.
  • If indepdenthen E(YIX) = E(Y) because X has nothing.

Variance

  • Deviation of the value.

(x−x)n2

Variable Estimations

  • For population: y= Beta0+ Betal x + M.
  • models explicitly show form

Coefficent Beta

  • population impossible

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser