Machine Learning Session 1

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the primary purpose of Gradient Descent in optimization?

To create irregular fluctuations in parameter updates
To achieve the global maximum of the cost function
To increase the cost function exponentially
To achieve the global minimum of the cost function (correct)

What effect does a small learning rate (α) have during the Gradient Descent process?

It causes faster convergence to the minimum
It leads to overshooting the minimum
It results in slow convergence towards the minimum (correct)
It prevents convergence entirely

Which cost function is commonly used in Gradient Descent?

Root Mean Squared Error (RMSE)
Mean Absolute Error (MAE)
Mean Squared Error (MSE) (correct)
Logarithmic Loss

What can happen if a large learning rate (α) is used in Gradient Descent?

It can lead to fluctuations around the minimum (B) Signup and view all the answers

What role do partial derivatives play in the Gradient Descent algorithm?

They determine the direction and magnitude of parameter updates (B) Signup and view all the answers

What is the primary goal of supervised learning in machine learning?

To make predictions using input-output pairs (D) Signup and view all the answers

What distinguishes multiple linear regression from simple linear regression?

Multiple linear regression uses two or more predictor variables. (C) Signup and view all the answers

What type of machine learning focuses on finding patterns in unlabeled data?

Unsupervised learning (A) Signup and view all the answers

In linear regression, what does the regression line represent?

The line of best fit minimizing prediction error (C) Signup and view all the answers

Which of the following applications is an example of supervised learning?

Spam detection (D) Signup and view all the answers

What is the main difference between classification and regression in machine learning?

Classification divides data into categories, while regression predicts continuous values. (B) Signup and view all the answers

What type of learning involves models getting feedback through rewards or penalties?

Reinforcement learning (B) Signup and view all the answers

Which statement best describes polynomial regression?

It fits a linear equation to complex datasets by using polynomial terms. (C) Signup and view all the answers

What does polynomial regression primarily allow researchers to do?

Model the relationship as a curve (A) Signup and view all the answers

How are polynomial regression models classified despite incorporating non-linear terms?

As linear in parameters (B) Signup and view all the answers

Which of the following evaluation metrics is the least sensitive to outliers?

Mean Absolute Error (MAE) (B) Signup and view all the answers

What does the Mean Squared Error (MSE) measure in regression analysis?

The average of squared differences (D) Signup and view all the answers

What is the primary advantage of using polynomial regression over linear regression?

Ability to model non-linear relationships (A) Signup and view all the answers

Which metric represents error in the same units as the original data?

Root Mean Squared Error (RMSE) (D) Signup and view all the answers

What is the purpose of a cost function in regression?

To measure the difference between predicted and actual values (D) Signup and view all the answers

In polynomial regression, what type of datasets is it particularly useful for?

Datasets with non-linear relationships (C) Signup and view all the answers

Flashcards

Mean Squared Error (MSE)

The cost function used in Gradient Descent to measure the error between predicted and actual values.

Cost Function

A function that determines the quality of a machine learning model's predictions based on the difference between these predictions and actual values.

Gradient Descent

A mathematical technique used to minimize the cost function by iteratively adjusting the model's parameters in the opposite direction of the gradient.

Learning Rate (α)

A small positive value that controls the step size of Gradient Descent in the opposite direction of the gradient.