Linear and Bivariate Regression Overview

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the formula for a linear function?

y = a + βx

In social sciences, what is the statistical linear model for a sample?

ŷ = a + bx OR y = a + bx + e

In social sciences, what is the statistical linear model for a population?

E(y)= α + βx OR y = a + βx + ε

The slope 'b' of the prediction equation is independent of the units of the dependent and independent variables.

False (B) Signup and view all the answers

What is the standardized slope 'b' equivalent to?

Pearson's correlation coefficient in bivariate regression. Signup and view all the answers

What is the formula for the standardized slope 'b'?

r = (Sx/Sy)b Signup and view all the answers

In linear regression, how can you decompose the variance in the dependent variable?

into the Regression Sum of Squares (SSR) and the Error Sum of Squares (SSE). Signup and view all the answers

What is the formula for the total sum of squares (TSS) in linear regression?

TSS = SSR + SSE Signup and view all the answers

What does 'SSE' represent in linear regression?

the sum of the squared differences between the actual values of the dependent variable and its predicted values by the regression equation. Signup and view all the answers

What is the formula for the coefficient of determination (r²)?

1 - SSE/TSS = (TSS-SSE)/TSS = SSR/TSS. Signup and view all the answers

The F-test is the most commonly used method for evaluating specific explanatory variables in linear regression.

False (B) Signup and view all the answers

In hypothesis testing, what is a test statistic used for?

to decide whether to support or reject the null hypothesis. Signup and view all the answers

What is the formula for the test statistic for the b-coefficient?

t = (b-0)/se Signup and view all the answers

What does the standard error 'se' of the slope represent?

the variability of estimates of the slope 'b' obtained from multiple samples drawn from the population. Signup and view all the answers

What is the formula for the standard error 'se' of the slope?

se = s/√Σ(x-x)² Signup and view all the answers

What is the formula for the standard deviation of the residuals 's'?

s = √SSE/(n-p-1) Signup and view all the answers

What are the degrees of freedom for the t-distribution used in linear regression?

(n-p-1) Signup and view all the answers

If the value of the test statistic is larger than the critical value, we reject the null hypothesis.

True (A) Signup and view all the answers

The p-value represents the probability of obtaining the observed results if the null hypothesis is true.

True (A) Signup and view all the answers

If the p-value is less than the significance level alpha, we reject the null hypothesis.

True (A) Signup and view all the answers

What are the three criteria for establishing a causal relationship between two variables?

There must be an association between the two variables, there must be an appropriate time order, and alternative explanations should be eliminated. Signup and view all the answers

Experiments are the gold standard for establishing causality in social sciences because they allow for complete control over all variables.

False (B) Signup and view all the answers

Randomization in a social science experiment aims to ensure that the treatment and control groups have similar distributions of all variables, including unobserved ones.

True (A) Signup and view all the answers

Statistical control is used in observational studies to mimic the control provided by experiments.

True (A) Signup and view all the answers

What is a spurious association in the context of three variables?

An association between two variables that disappears when a third variable is controlled for. Signup and view all the answers

What is a chain/indirect relationship in the context of three variables?

A relationship between two variables mediated by a third variable. Signup and view all the answers

In a multiple causes relationship with independent causes, the predictor variables have no relationship with each other.

True (A) Signup and view all the answers

In multiple causes relationship with related causes, the predictor variables have a significant relationship with each other.

True (A) Signup and view all the answers

What is a suppressor variable?

A variable that suppresses or hides the effect of another variable on a third variable. Signup and view all the answers

What is partial regression?

A regression analysis that examines the relationship between two variables while controlling for the influence of other variables within subgroups. Signup and view all the answers

Statistical interaction occurs when the effect of one independent variable on the dependent variable is influenced by another independent variable.

True (A) Signup and view all the answers

Flashcards

Linear Function

A function represented by y = α + βx, showing a direct relationship.

Statistical Linear Model

In social sciences, represented as ŷ = a + bx + e, combining sample statistics and error.

Slope (b)

Represents the rate of change in y for a unit change in x in a prediction equation.

Standardized b

A unitless measure indicating relationship strength in bivariate regression, equivalent to Pearson's correlation.