Podcast
Questions and Answers
When using observational data to estimate causal effects, what is a primary concern that researchers must address?
When using observational data to estimate causal effects, what is a primary concern that researchers must address?
- Ensuring the data is collected in a laboratory setting.
- Confirming that all variables are measured with perfect accuracy.
- Identifying and addressing potential sources of confounding. (correct)
- Verifying that the sample size is sufficiently large.
Which of the following quasi-experimental methods is most suitable for evaluating the impact of a treatment by comparing outcomes before and after the treatment for both a treatment group and a control group?
Which of the following quasi-experimental methods is most suitable for evaluating the impact of a treatment by comparing outcomes before and after the treatment for both a treatment group and a control group?
- Difference-in-Differences (DID) (correct)
- Regression Discontinuity Design (RDD)
- Randomized Controlled Trials (RCT)
- Instrumental Variables (IV)
What is the main purpose of pre-lecture assignments, according to the instructor?
What is the main purpose of pre-lecture assignments, according to the instructor?
- To encourage rote memorization of key concepts.
- To prepare students for lectures and exam question answering by prompting comprehension and explanation. (correct)
- To ensure students can perfectly repeat definitions from the course material.
- To evaluate students' ability to find information online.
In the context of causal inference, what does the variable 'T' typically represent?
In the context of causal inference, what does the variable 'T' typically represent?
What type of research question is best addressed using Regression Discontinuity Design (RDD)?
What type of research question is best addressed using Regression Discontinuity Design (RDD)?
Which of the following scenarios would LEAST likely be suitable for applying a Randomized Controlled Trial (RCT) methodology?
Which of the following scenarios would LEAST likely be suitable for applying a Randomized Controlled Trial (RCT) methodology?
A researcher aims to study the causal effect of a carbon tax (T) on industrial emissions (Y). Which of the following BEST describes a limitation of using an RCT in this scenario?
A researcher aims to study the causal effect of a carbon tax (T) on industrial emissions (Y). Which of the following BEST describes a limitation of using an RCT in this scenario?
In what scenario is the application of administrative data MOST likely to mitigate the Hawthorne effect in research?
In what scenario is the application of administrative data MOST likely to mitigate the Hawthorne effect in research?
A researcher is interested in determining the causal impact of a specific military intervention (T) on long-term economic growth (Y) in affected countries. What is the primary challenge in using an RCT to answer this question?
A researcher is interested in determining the causal impact of a specific military intervention (T) on long-term economic growth (Y) in affected countries. What is the primary challenge in using an RCT to answer this question?
A company is considering implementing a new R&D subsidy program (T) to boost innovation (Y). What is the potential problem when using an RCT to assess the effectiveness of the subsidy at the market level?
A company is considering implementing a new R&D subsidy program (T) to boost innovation (Y). What is the potential problem when using an RCT to assess the effectiveness of the subsidy at the market level?
Which of the following research topics poses the MOST significant ethical challenge when considering the use of an RCT?
Which of the following research topics poses the MOST significant ethical challenge when considering the use of an RCT?
Suppose a government wants to evaluate the impact of a fiscal stimulus (T) on unemployment rates (Y). Which of the following factors would make it difficult to interpret the results of an RCT?
Suppose a government wants to evaluate the impact of a fiscal stimulus (T) on unemployment rates (Y). Which of the following factors would make it difficult to interpret the results of an RCT?
In the context of the Robert Taylor Homes project study, what was the primary reason some buildings were selected for demolition while others were left untouched?
In the context of the Robert Taylor Homes project study, what was the primary reason some buildings were selected for demolition while others were left untouched?
Why is the study of the Robert Taylor Homes project considered a quasi-experiment rather than a true randomized experiment?
Why is the study of the Robert Taylor Homes project considered a quasi-experiment rather than a true randomized experiment?
What is the 'treatment' in Chyn's study of the Robert Taylor Homes project?
What is the 'treatment' in Chyn's study of the Robert Taylor Homes project?
In the context of quasi-experimental research design, what is the purpose of 'balance tests' using pre-treatment covariates?
In the context of quasi-experimental research design, what is the purpose of 'balance tests' using pre-treatment covariates?
How does Chyn's study contribute differently to existing research on housing mobility and poverty?
How does Chyn's study contribute differently to existing research on housing mobility and poverty?
What data is needed to determine the changes to neighborhood poverty rates after families were relocated?
What data is needed to determine the changes to neighborhood poverty rates after families were relocated?
A researcher aims to study the impact of a new educational program by comparing students from two different schools. What might cause the study to be considered quasi-experimental rather than experimental?
A researcher aims to study the impact of a new educational program by comparing students from two different schools. What might cause the study to be considered quasi-experimental rather than experimental?
In a quasi-experimental study examining the impact of a new workplace policy, the researchers find significant differences in pre-existing conditions between the treatment and control groups. What is the most appropriate course of action?
In a quasi-experimental study examining the impact of a new workplace policy, the researchers find significant differences in pre-existing conditions between the treatment and control groups. What is the most appropriate course of action?
In a city with two residential areas (central city and suburb), what is the likely outcome in a free market, assuming the rich and the poor both work in the city center and dislike commuting?
In a city with two residential areas (central city and suburb), what is the likely outcome in a free market, assuming the rich and the poor both work in the city center and dislike commuting?
What is the primary focus of the study as it relates to the young adults that were displaced from their homes?
What is the primary focus of the study as it relates to the young adults that were displaced from their homes?
What are the factors that lead to segregation in a model city according to the text?
What are the factors that lead to segregation in a model city according to the text?
According to the article, what are the key factors that a neighborhood can determine for an individual?
According to the article, what are the key factors that a neighborhood can determine for an individual?
What is the definition of 'neighborhood effects' in the context of socio-economic outcomes?
What is the definition of 'neighborhood effects' in the context of socio-economic outcomes?
Suppose a city is highly segregated by income. What challenge arises when trying to estimate the impact of living near high-income families using only observational data?
Suppose a city is highly segregated by income. What challenge arises when trying to estimate the impact of living near high-income families using only observational data?
Considering a 'free market' city model versus a 'social mixing' model, which factor is most crucial in determining which city would be better for its citizens?
Considering a 'free market' city model versus a 'social mixing' model, which factor is most crucial in determining which city would be better for its citizens?
Imagine a city where neighborhood quality significantly affects access to education, job networks, and safety. How might this situation perpetuate income inequality?
Imagine a city where neighborhood quality significantly affects access to education, job networks, and safety. How might this situation perpetuate income inequality?
If policy makers aim to improve socio-economic outcomes in a city characterized by income segregation, which intervention would most directly address neighborhood effects?
If policy makers aim to improve socio-economic outcomes in a city characterized by income segregation, which intervention would most directly address neighborhood effects?
In the context of neighborhood effects, why is it important to consider both direct and indirect impacts on socio-economic outcomes?
In the context of neighborhood effects, why is it important to consider both direct and indirect impacts on socio-economic outcomes?
Flashcards
Observational Data
Observational Data
Using data that was not generated through a randomized experiment to estimate causal effects.
Quasi-Experiment
Quasi-Experiment
A research design that attempts to approximate the conditions of an experimental study, but without random assignment.
Causal Inference
Causal Inference
A method used to estimate the causal effect of a variable (T) on an outcome (Y).
Treatment Variable (T)
Treatment Variable (T)
Signup and view all the flashcards
Outcome Variable (Y)
Outcome Variable (Y)
Signup and view all the flashcards
Independent Variable (T)
Independent Variable (T)
Signup and view all the flashcards
Dependent Variable (Y)
Dependent Variable (Y)
Signup and view all the flashcards
Randomized Controlled Trials (RCTs)
Randomized Controlled Trials (RCTs)
Signup and view all the flashcards
Advantages of RCTs
Advantages of RCTs
Signup and view all the flashcards
Limitations of RCTs
Limitations of RCTs
Signup and view all the flashcards
Hawthorne/John Henry Effects
Hawthorne/John Henry Effects
Signup and view all the flashcards
Ethical Limitation Solutions
Ethical Limitation Solutions
Signup and view all the flashcards
Residential Area
Residential Area
Signup and view all the flashcards
Central City
Central City
Signup and view all the flashcards
Suburb
Suburb
Signup and view all the flashcards
Rich (Type 1) Households
Rich (Type 1) Households
Signup and view all the flashcards
Poor (Type 2) Households
Poor (Type 2) Households
Signup and view all the flashcards
Segregation
Segregation
Signup and view all the flashcards
Neighborhood Effects
Neighborhood Effects
Signup and view all the flashcards
Effects of Living Near High Income Families
Effects of Living Near High Income Families
Signup and view all the flashcards
Difficulty Estimating Effects with Observational Data
Difficulty Estimating Effects with Observational Data
Signup and view all the flashcards
Unobservable Differences
Unobservable Differences
Signup and view all the flashcards
Impact of Relocation
Impact of Relocation
Signup and view all the flashcards
Robert Taylor Homes
Robert Taylor Homes
Signup and view all the flashcards
Public Housing Demolition as Quasi-Experiment
Public Housing Demolition as Quasi-Experiment
Signup and view all the flashcards
"Naturally" Created Groups
"Naturally" Created Groups
Signup and view all the flashcards
Quasi-Experiment Analysis
Quasi-Experiment Analysis
Signup and view all the flashcards
Treatment (in Chyn's Study)
Treatment (in Chyn's Study)
Signup and view all the flashcards
Pre-Treatment Covariate Balance
Pre-Treatment Covariate Balance
Signup and view all the flashcards
Research Design: Displaced vs. Non-Displaced
Research Design: Displaced vs. Non-Displaced
Signup and view all the flashcards
Study Notes
- Empirical analysis focuses on moving from experimental data to observational data for estimating causal affects
- You will become familiar with common quasi-experimental causal inference methods
- Instrumental Variables (IV)
- Regression Discontinuity Design (RDD)
- Difference-in-differences (DID)
- Designs based on controlling for observable differences are also covered
Homework 4 Outline
- Opens on Monday 3.2
- The deadline is in the last week of lectures (Feb 12)
- There is a practical homework assignment
- An exercise session with Lachlan on Thursday 6.2 is related
Homework 5 Outline
- Also known as the reading assignment
- Consists of a list of research papers
- Questions about the papers must be answered
- It will be due one week after the exam
Pre-lecture assignments
- Answers should show understanding with a few sentences each
- A good rule of thumb is to use a few sentences for each question
- Explanations of the concept or issue are important
- Short questions prepare for lectures and exams
Today's Lecture focuses on these topics:
- Limitations of Randomized experiments (RCTs)
- Observational data and the difficulty of making causal claims based on it
- Quasi-experiments
- Case study of studying neighborhood quasi-experimental effects.
Causal Inference Definitions
- Estimating variable T on outcome Y
- T = independent variable
- Y = the outcome variable
Causal Inference Examples
- Education (T) leads to earnings (Y)
- Marketing campaign (T) leads to sales (Y)
- Carbon tax (T) leads to emissions (Y)
- R&D subsidy (T) leads to innovation (Y)
- Fiscal stimulus (T) lead to unemployment (Y)
The Limits of RCTs
- Randomized experiments are often the best way to evaluate causal impacts of "treatments"
- Advantages:
- Simple and transparent to understand results
- Require fewer assumptions than alternative approaches
- Disadvantages: RCTs are not always feasible or desirable
- Too costly
- Unethical
- Unhelpful for historical or understanding market level questions
- Observational data and clever design can still allow the researcher to study causal questions.
Ethical & Practical Limitations of RCTs
- Experiments should not knowingly harm anyone
- Can pose an issue when trying to study the effect of potentially harmful things
- The relevant time horizon may be very long
- Lengthy experiments may be needed when studying the role of a given education on future labor market outcomes throughout adult life
- Meaningful experiments are sometimes very expensive
- Policy and business mistakes are costly even though not experiments
- Hawthorne and John Henry Effects = the evaluation itself can push people to change their behavior
- Less of a problem with administrative data and long follow-up periods
Examples of ethical limitations for RCTs
- Political propaganda on political views or beliefs
- Fertility on women's wages
- Exposure to violence, war, or pollution on human capital development
- Introducing random variation in these treatments would be unethical
- Alternative: find some exogenous factor:
- Introduces random variation in participation / exposure to war or violence or pollution.
Fundamental limitations of RCTs:
- Spillovers:
- occur when the treatment also affects the control group
- leads to issues inferring what would have happened without the treatment
- General equilibrium (GE) effects:
- GE effects may be the main value of some treatments
- RCTs never capture economy-wide GE effects
- Some examples of measuring more limited spillovers with RCTs exist.
- Scarcity of potential observations
- Some treatments affect entire countries or even the whole world and there will never be adequate designs for them
Observational data
- Researchers use observational data when experimental designs are infeasible
- Collected as part of normal functioning of society's institutions, and firms, including:
- Data from administrative records
- Surveys
- Sales Data
- Censuses
- Apps
- Observational studies draw inferences from a sample of a population
- The independent variable or "treatment" is not under the researcher's control
- In contrast with experiments, such as randomized controlled trials (RCTs)
- Each the subject is randomly assigned to a treatment or a control group
Observational data and Correlations
- Correlations found in observational data should be viewed with suspicion
- Most correlations are not linked causally
- Variables endogenously chosen by the involved people contribute to the choice
- The FLEED data previously provided by Statistics Finland is from education level of the people in FOLK data, and the results are optimization by those people
- Economic theory explains people's decisions are not random on average
- Causal effects are challenging to estimate
People Optimize
- T= education and Y= Earnings
- In the potential outcomes model the treatment must be completely independent to measure the effect
- Violations of the condition are caused by a person choosing a level of education for themselves
- Economic theory predicts that choices are endogenous, and thus naïve correlations are misleading
- Selection bias must be kept in mind
Observational alternatives to experiments
- Selection on observables: occurs when treatment and control groups differ from each other w.r.t. observable characteristics
- Multivariate regression and matching
- Selection on unobservables: occurs when treatment and control groups differ from each other in unobservable characteristics
- can occur if something unexpected happened in an “almost random” manner
- "Structural Approaches" can offer useful alternatives
Natural or Quasi Experiments
- Broad term for many different situations requiring different types of research methods
- IV
- RDD
- DID
- Researcher utilizes government policy or acts of nature that affect households in a way that resembles an experiment
- Results in control and treatment groups
- Historical episodes sometimes provide observable "pseudo" random variation in treatment status
- Might be caused by law changes that affect some people, but not others
Quasi Experiments
- Studying neighborhood effects with a quasi-experimental study
Segregation in a model city
- Assuming two residential areas with a fixed supply of housing
- Area 1: Central City - historic city center
- Area 2: Suburb - far away with lower quality amenities and less services
- Assuming two types of housesholds
- Type 1: Rich
- Type 2: Poor
- With both working in the city center and disliking commuting
Where will the rich end up living?
- With these assumptions and a free market, the city will be segregated by income
- Prices will be higher in the attractive city center
- Rich live in the city center, and poor live in the suburbs
- Segregation results from income inequality and optimizing behavior
- Resulting neighborhood effects can range
Neighborhood effects
- The neighborhood an individual grows up in can determine their peer group, education, role models and aspirations
- Poor neighborhoods have less opportunities than affluent neighborhoods
Observational data
- If we wanted to estimate the effect of living next to high income families
- Observing any effects from segregation will be difficult due to selection problems
Focusing on a Moving Low-income family
- Focus on one family moving from low to high income areas
- Neighborhood quality would increase
- The children could have different role models and peers
- Will the children in the family benefit from moving next to high income families?
Housing market mechanism and selection bias
- If living in richer neighborhoods affects nbd quality and peer groups (T) and/or children's education and future wages?
- Is children growing up in rich neighborhoods and doing better just correlation?
- Is that due to optimization behavior by the parents or a causal effect?
Selection into neighborhoods
- The same resources that affect the choice of what neighborhood to live in affect the child outcome directly
- How can the link between living in a rich treatment neighborhood to outcomes be isolated?
Controlling for observable differences?
- May be possible to focus for observable differences to improve insight
Similar People
- This would mean compare people who are:
- similar on observable and measurable characteristics,
- have the same initial income, level of education etc.,
- but live live in different quality neighborhoods
- If families are assumed to be similar, why did the families make different residential location choices?
- Low-income parents who make the effort to move to a higher quality nbd probably also use more of other resources in parenting - This assumes observably similar parents who remain in the poor neighborhood
Public housing demolition as a quasi-experiment
- What if we provided one low-income family the resources to move to the other residential area
- Neighborhood quality would increase
- The children would have different role models and peers
- Would the benefit if moved?
Chyn 2018 on Public Housing Demolitions in Chicago
- The study looks at demolitions in Chicago
- It focuses on the the long-Run Effects of Public Housing Demolition on Children
- This paper focuses on the long-run outcomes of children.
- It studies public housing demolitions in Chicago, which forced low-income households to relocate to less disadvantaged neighborhoods using housing vouchers.
- Specifically, it compares:
- displaced to their peers who lived in nearby public housing
- those that were not demolished
- Concludes that displaced children are more likely to:
- be employed
- earn more in young adulthood.
- Displaced children also have fewer violent crime arrests
- Children displaced at young ages have:
- lower high school dropout rates.
Quasi-experiment
- Studies the case of Chicago where:
- The housing authority reduced its stock of public housing during the 1990s
- Authority targeted buildings with poor maintenance for demolition
- Other buildings were nearby were left untouched
- Residents selected for demolition received Section 8 housing vouchers
- They were forced to relocate
- This policy created a treatment and a control group “naturally” or by accident
- The housing group was not planning to divide residents for research purposes
- Researcher was not involved in creating the groups
Chyn (2018 AER) Limitations
- The Chyn paper is a particular type of quasi-experiment
- It can be analyzed exactly as if it were a randomized experiment
The Paper
- Considers what study gives insights different from related research by focusing on treatment.
- The kind of observational data that is used must be presented
- Must check if the groups really look like they are randomized
- Must look if pre-treatment covariates must be balanced across groups (balance tests)
- Main results should be checked
- including Heterogeneity w.r.t sex and age etc.
Research Design
- Compares the the young adult outcomes of displaced and non-displaced children from the same public housing project
- Treatment = being displaced
Key assumption 1
- Demolition decision is unrelated tenants' characteristics for validity
- Important requirement: tenant selection mechanism did not allow households to self-select into buildings for this validity
- Occurs if households were assigned (as-good-as) randomly
- Caused by a surplus of waiting applicants where severe need for affordable housing means people chose the units they have
Key assumption 2
- Need for careful showing of households/children in control/treatment
- If they are similar observably/ nonobservably it is plaussible the researcher does not observe
- Results need to be tested for balancing
- Demolition should have have no effects on nondemolished
- Prior research on the same demolitions show it is violated because of crime
- Results are be biased toward zero (underestimates)
Chyn study Results
- The point estimate for the treat-control difference is 0.066
- In the treated (displaced) group
- The employment rate is 6.6 ppt higher
- The study also showed households often ended up in "better" ones than was possible
- Heterogenous effects are present and measured in sub groups sorted by:
- Sex
- Demolition age
Discussion 1
- Aims to estimate family's moving from poor neighborhood success
- Both Chetty et al. (MTO paper) and Chyn find that younger kids benefit more
Validity
- Internal validity: Do we learn the true effect for the treated population?
- are free of selection bias?
- External validity: Can we extrapolate to other populations?
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.