Multiple Regression Analysis
20 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

The standard error of the coefficient of the regressor age is:

  • 1.96
  • 0.045 (correct)
  • 6.263
  • 22.187

The 95% confidence interval estimator for the coefficient of the regressor bachelor is:

  • [9.33,10.36] (correct)
  • [9.33,10.21]
  • [9.84, 10.50]
  • [9.58,10.10]

Which of the following can cause the usual OLS t statistics to be invalid, that is, not to have the t distribution under Ho?

  • Heteroscedasticity (correct)
  • Including an unimportant explanatory variable.
  • A sample correlation coefficient of 0.095 between two independent variables that are in the model.
  • All of the above.

Which of the following is not a Gauss-Markov assumption of the multiple linear regression model?

<p>The error term u has zero variance given any values of the explanatory variable (C)</p> Signup and view all the answers

The correct interpretation of the coefficient for the regressor bachelor in the regression is

<p>An individual with bachelor's degree is expected to earn US$ 9.84 more in average hourly earnings than individuals without bachelor's degree. (C)</p> Signup and view all the answers

Suppose you wish to estimate the effect of age in a worker's average hourly earnings. We believe that an increase in one year of age increases the average hourly earnings by a constant percentage. To estimate this relationship we should:

<p>Run a regression with age as independent variable and log of average hourly earnings as a dependent variable (A)</p> Signup and view all the answers

A low regression $R^2$ means that:

<p>There are other important factors that influence the dependent variable (A)</p> Signup and view all the answers

If multicollinearity is present in a model, you should:

<p>remove highly correlated predictors from the model. (A)</p> Signup and view all the answers

A Jarque-Bera Test was performed on the residuals of the regression and was presented in the OLS regression results table above. The conclusion of the test is

<p>We can reject the null hypothesis that the residuals are normally distributed at a significance level of 5% (A)</p> Signup and view all the answers

For a male individual with 36 years of age and bachelors degree the expected average hourly earnings is:

<p>$31.02 (C)</p> Signup and view all the answers

The correct interpretation of the coefficient for the regressor lcost in the regression is

<p>A 1% increase in the cost of a law school is predicted to increase in approximately 4.9% the median salary of new law school graduates. (C)</p> Signup and view all the answers

Omitted variable bias is a potential problem because it

<p>Prevents accurately estimating true marginal effects. (B)</p> Signup and view all the answers

In the Python OLS summary, we have information on the following OLS hypothesis test conducted $H_o: \beta_1 = \beta_2 = \beta_3 = \beta_4 = \beta_5 = 0$ vs $H_1$: at least one $\beta_k \neq 0$ for k = 1, ...,5 Based on the p-value for the F-Test in the Summary we can conclude that

<p>We can reject the null hypothesis that none of the variables have an impact in lsalary since the p-value is less than 5%. (A)</p> Signup and view all the answers

Which of the following is true with respect to the coefficient of the regressor LSAT at a significance level of 5%?

<p>The variable is not significant at a 5% level since the t-statistic is equals to 1.13. (C)</p> Signup and view all the answers

Which of the following is correct with respect to the fit of the regression

<p>If the adjusted $R^2$ of the regression is 0.823 whereas the adjusted $R^2$ of a regression which has faculty as an additional regressor is 0.849, then including an additional regressor has improved the fit of the regression. (D)</p> Signup and view all the answers

The picture below shows the Residual plot of regressing lsalary on lcost. What can be concluded from this diagnostic test?

<p>The fan shaped pattern of residuals suggest the presence of heteroskedasticity. (C)</p> Signup and view all the answers

Which of the following characteristics does a normally distributed variable exhibit?

<p>Skewness = 0 and Kurtosis = 3 (D)</p> Signup and view all the answers

A Breusch-Pagan heteroskedasticity test was conducted for the regression and the result was a statistic $F= 2.81$ with a p-value of $p=0.03$. The correct interpretation of this result is

<p>The null hypothesis of homoskedasticity can be rejected at the 5% significance level. (B)</p> Signup and view all the answers

Under which of the following, we say that errors are homoscedastic?

<p>If the conditional variance of the errors is constant (A)</p> Signup and view all the answers

Which of the following statements is correct about testing joint hypothesis:

<p>The 95% confident set for multiple coefficients is the set of values not rejected at the 5% confidence level by the F-Statistic for a test of joint hypothesis. (C)</p> Signup and view all the answers

Flashcards

Standard Error of a Coefficient

The standard error of a coefficient measures the accuracy of the coefficient estimate.

Confidence Interval

A range of values likely to contain the true population parameter with a certain level of confidence.

Heteroscedasticity

A violation where the variance of the error term is not constant across all observations.

Gauss-Markov Assumptions

Assumptions required for Ordinary Least Squares (OLS) estimators to be BLUE (Best Linear Unbiased Estimator).

Signup and view all the flashcards

Homoscedasticity

The error term has constant variance given any values of the explanatory variable.

Signup and view all the flashcards

Coefficient Interpretation

Holding other variables constant, the expected change in the dependent variable for a one-unit increase in the independent variable.

Signup and view all the flashcards

Log Transformation

Transforming a variable using logarithms to model percentage changes.

Signup and view all the flashcards

Regression R-squared

A measure of how well the independent variables explain the variation in the dependent variable.

Signup and view all the flashcards

Multicollinearity

High correlation between independent variables in a regression model.

Signup and view all the flashcards

Jarque-Bera Test

A test for normality of residuals in a regression model.

Signup and view all the flashcards

Predicted Value

An estimate of the dependent variable based on the regression equation and given values of independent variables.

Signup and view all the flashcards

Regression with Log Salary and Cost

log(salary) as dependent variable, cost as independent variable

Signup and view all the flashcards

Omitted Variable Bias

The bias that occurs when a relevant variable is excluded from the regression model.

Signup and view all the flashcards

F-test in Regression

A statistical test to determine if at least one independent variable is significant.

Signup and view all the flashcards

T-Statistic in Regression

A test statistic used to determine the significance of an individual coefficient in a regression model.

Signup and view all the flashcards

Adjusted R-squared

A modified version of R-squared that adjusts for the number of independent variables in the model.

Signup and view all the flashcards

Residual Plot

A visual tool to assess the assumption of homoscedasticity.

Signup and view all the flashcards

Normally Distributed Variable

Symmetric, bell-shaped distribution

Signup and view all the flashcards

Skewness

A measure of the asymmetry of a distribution.

Signup and view all the flashcards

Kurtosis

A measure of the 'tailedness' of a distribution.

Signup and view all the flashcards

Breusch-Pagan Test

A test for heteroskedasticity.

Signup and view all the flashcards

Homoskedastic Errors

The conditional variance of the error term is constant.

Signup and view all the flashcards

Joint Hypothesis Testing

Testing multiple hypotheses simultaneously.

Signup and view all the flashcards

Null Hypothesis in Regression

Slope coefficients are all zero.

Signup and view all the flashcards

Confidence Set for Multiple Coefficients

A range of values for multiple coefficients that are not rejected by Joint Hypothesis at a level of significance.

Signup and view all the flashcards

Cost impact on Salary

Expected change in median salary of new law school graduates when cost increases by 1%.

Signup and view all the flashcards

Accurate parameter estimates

OLS is unable to make an estimate.

Signup and view all the flashcards

Independent variables impact on dependent variable

Variable doesn't have an impact on salary.

Signup and view all the flashcards

R^2 of regression

Regressors explain the variation in median salary of new law school graduates.

Signup and view all the flashcards

residuals

Heteroskedasticity vs. homoskedasticity shape

Signup and view all the flashcards

Study Notes

  • This question paper consists of 20 mandatory questions, each worth 5 marks
  • Answers should be input into the Brightspace quiz section via the ProctorFree student portal.
  • A scientific calculator is allowed, but it cannot be programmable, have wireless capabilities, store textual information, or require a mains electricity supply.
  • A formula sheet and present value tables are provided in the appendix.

Information for questions 1 to 10

  • A multiple linear regression model was used to determine the average hourly earnings of workers.
  • The variables include: ahe, bachelor, female, and age.
  • ahe represents worker's average hourly earnings in US Dollars per hour.
  • bachelor is a dummy variable (1 for bachelor, 0 for non-bachelor).
  • female is a dummy variable (1 for female, 0 for male).
  • age represents worker's age in years.

Question 1

  • The standard error of the coefficient of the regressor age is 0.045.

Question 2

  • The 95% confidence interval estimator for the coefficient of the regressor bachelor is [9.33, 10.36].

Question 3

  • Heteroscedasticity can cause the usual OLS t statistics to be invalid.

Question 4

  • The error term u having zero variance given any values of the explanatory variable is not a Gauss-Markov assumption of the multiple linear regression model.

Question 5

  • The correct interpretation of the coefficient for the regressor bachelor is that an individual with a bachelor's degree is expected to earn US$ 9.84 more in average hourly earnings than individuals without a bachelor's degree.

Question 6

  • To estimate the effect of age on a worker's average hourly earnings, run a regression with age as the independent variable and the log of average hourly earnings as the dependent variable to see how an increase in one year of age increases the average hourly earnings by a constant percentage.

Question 7

  • A low regression R² implies that other important factors influence the dependent variable.

Question 8

  • If multicollinearity is present in a model, remove highly correlated predictors from the model.

Question 9

  • The Jarque-Bera test results suggest that the null hypothesis that the residuals are normally distributed can be rejected at a significance level of 5%.

Question 10

  • For a male individual with 36 years of age and a bachelor's degree, the expected average hourly earnings are approximately $31.02.

Information for questions 11 to 20

  • A multiple linear regression model was calculated to determine the logarithm of the median salary for new law school graduates.
  • The variables featured include: log(salary), LSAT, GPA, log(libvol), log(cost), rank, and faculty.
  • log(salary) represents the log of the median salary for new law school graduates (salary measured in $1000s).
  • LSAT represents the median LSAT score for the graduating class.
  • GPA represents the median college GPA for the class.
  • log(livbol) is The Log of the number of volumes in the law school library
  • log(cost) represents the log of the annual cost of attending law school (cost measured in $1000s).
  • rank represents the law school ranking (1 is the best).
  • faculty represents the number of faculty in the law school.

Question 11

  • A 1% increase in the cost of a law school is predicted to increase in approximately 4.9% the median salary of new law school graduates.

Question 12

  • Omitted variable bias is a potential problem because it prevents accurately estimating true marginal effects.

Question 13

  • Based on the p-value for the F-Test in the Summary, reject the null hypothesis that none of the variables have an impact in lsalary if the p-value is less than 5%.

Question 14

  • The variable is not significant at a 5% level since the t-statistic is 1.13.

Question 15

  • If the adjusted R² of the regression is 0.823 whereas the adjusted R² of a regression which has faculty as an additional regressor is 0.849, then including an additional regressor has improved the fit of the regression; therefore, including an additional regressor has improved the fit of the regression.

Question 16

  • The fan-shaped pattern of residuals suggest the presence of heteroskedasticity.

Question 17

  • A normally distributed variable has a skewness of 0 and a kurtosis of 3.

Question 18

  • The null hypothesis of homoskedasticity can be rejected at the 5% significance level.

Question 19

  • Errors are homoscedastic if the conditional variance of the errors is constant.

Question 20

  • The 95% confident set for multiple coefficients is the set of values not rejected at the 5% confidence level by the F-Statistic for a test of the joint hypothesis.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Description

This question paper focuses on multiple linear regression, specifically determining the average hourly earnings of workers. The variables considered are average hourly earnings, bachelor's degree attainment, gender, and age. It includes calculating confidence intervals and regression analysis.

More Like This

Linear Regression Analysis Quiz
10 questions
Linear Regression Analysis
16 questions

Linear Regression Analysis

TroubleFreeHippopotamus avatar
TroubleFreeHippopotamus
Use Quizgecko on...
Browser
Browser