BMS2043 - Statistics and Data Analysis: Linear Regression Quiz

CongratulatoryIntelligence5915 avatar
CongratulatoryIntelligence5915
·
·
Download

Start Quiz

Study Flashcards

25 Questions

In statistical terms, what does a regression model represent?

A model for the association structure in the data

If more than two groups have non-normal data, which test is suitable for analysis?

Kruskal-Wallis test

Which statistical test is appropriate for quantitative normally distributed data?

Student’s t-test

What is the main purpose of linear regression analysis?

To identify the line that minimizes the sum of the squared differences between observed and predicted values

What crucial assumption does linear regression make about the errors in predictions?

A and B

In multiple linear regression, what does Y represent?

Dependent variable

When is it appropriate to use a one-tail P value?

When the difference between groups can only go in one direction based on previous data, physical limitations, or common sense

What is the first step in performing a statistical test?

Formulate null and alternative hypotheses

What is the last step in performing a statistical test?

Reject or accept null hypothesis

What does a regression coefficient 𝛽=0 indicate?

There is no association between the explanatory variable and the outcome variable

In a symmetric distribution, what percentage of cases fall within Mean ± 1 SD?

68.2%

When calculating a 95% confidence interval, what value is used for z𝛼?

1.96

In a one-tailed test, if the other group had ended up with the smaller mean, it would be considered statistically significant.

False

In a statistical test, the null hypothesis is always rejected if the P-value is less than the significance level.

False

When performing a statistical test, it is essential to obtain both the test statistic and the confidence interval for accurate interpretation.

False

In linear regression, a regression coefficient 𝛽=0 means that there is no association between the independent variable X and the dependent variable Y.

True

The standard error (SE) of 𝛽 measures how accurately the model estimates the unknown 𝛽 and is unaffected by sample size.

False

A 95% confidence interval (CI) for a point estimate 𝛽, calculated as 𝛽 ± 1.96*SE(𝛽), means that there is a 95% chance that the true value of 𝛽 falls within this interval.

False

In symmetric distributions, Mean ± 3 SD includes approximately 99.7% of cases.

True

In a one-tailed test, the alternative hypothesis specifies a specific direction of the effect, while the null hypothesis does not.

True

When testing whether a new antibiotic impairs renal function, a left-sided one-tailed test is appropriate because it is hard to imagine a mechanism by which an antibiotic would increase the glomerular filtration rate.

True

The P-value of 0.004 indicates strong evidence against the null hypothesis.

True

The Mann-Whitney U-test is suitable for quantitative normally distributed data.

False

Linear regression models can only represent how a dependent variable depends on one independent variable.

False

A statistical model is a complex representation of reality, often involving multiple layers and variables.

False

Study Notes

  • The text is from a lecture on Statistics and Data Analysis (BMS2043) at the University of Surrey during Spring 2024.
  • Youngchan Kim, PhD, is the lecturer teaching inferential statistics, part 1.
  • Steps to perform a statistical test: formulate null and alternative hypotheses, evaluate data and choose an appropriate statistical test, perform the test, obtain test statistic and P-value, evaluate statistical significance, and accept or reject null hypothesis.
  • One-tailed test: appropriate when previous data, physical limitations, or common sense suggest the difference can only go in one direction. The alternative hypothesis must specify the predicted direction, and if the other group had a larger mean, it would be attributed to chance.
  • Inferential statistics covers correlation and associated P-value, test of frequencies, quantitative normally distributed data, quantitative non-normal data, more than 2 groups, and linear regression.
  • Linear regression: a method to model the relationship between a dependent variable and one or more independent variables. It identifies the line (regression model) that minimizes the sum of squared differences between observed and predicted values, using the least squares method.
  • Linear regression assumes a linear relationship, normally distributed errors, and homoscedasticity.
  • Multiple linear regression: models the relationship between a dependent variable and multiple independent variables. The goal is to estimate the intercept, regression coefficients, and their standard errors.
  • Confidence intervals: measure the precision of an estimate by calculating the range within which the true value most likely lies, based on the sample data. For example, a 95% confidence interval would include 95.4% of all cases within ±2 standard deviations of the mean.
  • One-tailed t-test: used when we have an a priori hypothesis about the direction of the difference. For example, a scientist wants to test if FTO gene variations increase BMI in Europeans.
  • In another example, an antibiotic's effect on serum creatinine is tested, assuming it either does not change or increases mean serum creatinine.

Test your knowledge of linear regression, including simple and multiple linear regression analysis techniques. Linear regression is used to create models that describe the relationship between dependent and independent variables.

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free
Use Quizgecko on...
Browser
Browser