Regression Modelling in Machine Learning

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What type of regularization does Ridge Regression employ?

Dropout regularization

Elastic Net regularization

L1 regularization

L2 regularization (correct)

Which statement is true about Lasso Regression?

It is not effective for feature selection.

It uses the square of weights as a penalty.

It can shrink some coefficients to exactly zero. (correct)

It always includes all features in the model.

What happens to the cost function of Ridge Regression as the value of λ approaches zero?

It becomes completely undefined.

It turns into the cost function of Lasso Regression.

It has no effect on the cost function.

It becomes similar to the cost function of linear regression. (correct)

What is a primary use of Ridge Regression?

To address high collinearity between independent variables. Signup and view all the answers

What distinguishes Lasso Regression from Ridge Regression concerning feature selection?

Lasso Regression neglects certain features entirely. Signup and view all the answers

Which of the following statements is false regarding Ridge Regression?

It can shrink coefficients to exactly zero. Signup and view all the answers

What is a characteristic of Lasso Regression in comparison to Ridge Regression?

It provides a pathway to feature selection. Signup and view all the answers

Which regularization technique is more appropriate when the goal is to retain all features in the model?

Ridge Regression is preferred. Signup and view all the answers

What is the primary benefit of Lasso regression?

It helps in reducing overfitting and performs feature selection. Signup and view all the answers

Which method is computationally efficient for selecting a subset of predictors?

Forward stepwise selection Signup and view all the answers

Which selection method begins with a model containing no predictors?

Forward stepwise selection Signup and view all the answers

What is the main concept behind best subset selection?

To fit a separate model for every possible subset of predictors. Signup and view all the answers

Principal component analysis is primarily used for which of the following?

Transforming predictors to reduce dimensionality. Signup and view all the answers

Which method iteratively removes the least useful predictor from the model?

Backward stepwise selection Signup and view all the answers

What type of variable selection approach does shrinkage represent?

It controls variance by shrinking coefficients. Signup and view all the answers

What defines the major limitation of best subset selection?

It is computationally infeasible for large predictor sets. Signup and view all the answers

What does the intercept of the regression line represent in this context?

The predicted external marks when internal marks are zero Signup and view all the answers

Which step in the OLS algorithm involves squaring the differences of X?

Step 5 Signup and view all the answers

In the context of the regression equation M = 19.04 + 1.89 × M, what does 'M' represent?

External marks of students Signup and view all the answers

What defines the maximum point on a curve according to the provided content?

It has the highest y-coordinate and slope of zero Signup and view all the answers

What is the primary goal of the Ordinary Least Squares (OLS) method?

To minimize the sum of the squares of the errors Signup and view all the answers

What effect does multicollinearity have on the standard errors of coefficients?

Increases standard errors, making variables statistically insignificant Signup and view all the answers

Which of the following is NOT a step in calculating 'b' using the OLS algorithm?

Get the sum of squared differences of Y Signup and view all the answers

What does a residual indicate in the context of regression analysis?

The difference between predicted and actual points Signup and view all the answers

What does the Variance Inflation Factor (VIF) assess?

The extent of linear relationships among the independent variables Signup and view all the answers

In multiple linear regressions, what distinguishes it from simple linear regression?

It uses multiple predictor variables Signup and view all the answers

What does the regression equation $y = a_0 + a_1x + ε$ represent in regression analysis?

The relationship between a dependent variable and one or more independent variables Signup and view all the answers

Which assumption is violated when perfect multicollinearity is present?

There is an exact linear relationship among independent variables Signup and view all the answers

What is the result of heteroskedasticity in regression analysis?

Erroneous predictions due to changing variance of the error term Signup and view all the answers

Which type of regression uses only one independent variable?

Simple Linear Regression Signup and view all the answers

What is the purpose of the linear regression coefficient $a_1$ in the equation?

It determines the slope of the regression line Signup and view all the answers

In the context of linear regression, what does high bias indicate?

Low accuracy of the model Signup and view all the answers

What is necessary for the OLS estimates to be effective?

Independent variables must have sufficient variation Signup and view all the answers

Which regression technique combines multiple types of regression to help improve prediction accuracy?

Elastic Net Regression Signup and view all the answers

What does the term ‘random error’ ($ε$) in the regression equation signify?

Unforeseen variations affecting the dependent variable Signup and view all the answers

Which of the following accurately describes low variance in a model's predictions?

Predicted values are consistent and close to each other Signup and view all the answers

What does the assumption about the number of observations and parameters in linear regression imply?

number of parameters must be less than observations Signup and view all the answers

What is a characteristic feature of Logistic Regression compared to Linear Regression?

It is used for categorical outcome prediction Signup and view all the answers

In regression analysis, what does the ‘intercept’ ($a_0$) represent?

It is the predicted value when all independent variables are zero Signup and view all the answers

How does Stepwise Regression function in the context of model building?

It adds or removes predictors based on their statistical significance Signup and view all the answers

What happens when the number of observations (n) is not much larger than the number of parameters (k)?

This may lead to overfitting and poor predictions. Signup and view all the answers

Under which condition is linear regression not usable?

When k > n Signup and view all the answers

What does regularization aim to achieve in a machine learning model?

Prevent overfitting by adding extra information Signup and view all the answers

In linear regression, what is the purpose of the residual sum of squares (RSS)?

To optimize the parameters for accurate value prediction Signup and view all the answers

What happens to the magnitude of the feature in a regularization technique?

It is reduced toward zero. Signup and view all the answers

What is a critical factor in ensuring the least squares estimates perform well?

The number of observations (n) must be significantly larger than parameters (k). Signup and view all the answers

What does adding a complexity term in regularization help to address?

It helps prevent overfitting. Signup and view all the answers

Which of the following methods can improve the accuracy of linear regression?

Shrinkage Approach Signup and view all the answers

Study Notes