Artificial Intelligence and Machine Learning

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which statement best defines machine learning?

A branch of AI focused on explicit programming of tasks.
A technology that processes information without the need for data.
A field in AI that studies algorithms that can learn from data and generalize. (correct)
A method for transferring human intelligence to computers.

What characterizes supervised learning?

It involves trial and error to generate data.
Data is provided in a structured form with labels. (correct)
It requires fixed outputs for every potential input.
Data is presented without any labels.

In which application is unsupervised learning utilized?

Self-driving cars making decisions using labeled scenarios.
Email spam filtering based on keywords.
Chat-GPT generating text without predefined labels. (correct)
Image classification using predefined categories.

What is a key feature of reinforcement learning?

It involves creating databases through trial and error. (D) Signup and view all the answers

What is a confusion matrix used for in machine learning?

To visualize the performance of a classification model. (B) Signup and view all the answers

Which approach involves finding the least error in predictions?

Using an error function to optimize model accuracy. (B) Signup and view all the answers

What is an example of the first widespread use of machine learning?

Email spam filters developed in the 1990s and 2000s. (C) Signup and view all the answers

What mathematical basis underpins the operations of computers in machine learning?

Statistics and mathematical functions. (C) Signup and view all the answers

What is the purpose of dropping the 'petal length' column during parameter selection?

To prevent overfitting in the model. (D) Signup and view all the answers

When splitting the dataset using train_test_split, which parameter controls the randomness of the shuffle?

random_state (D) Signup and view all the answers

What does the mean_absolute_percentage_error function measure in the context of a regression model?

The average percentage error of the predictions. (D) Signup and view all the answers

How does a decision tree determine which feature to split on at the root node?

Based on the feature with the most distinct classes. (C) Signup and view all the answers

What does reg.predict(X_test) do in the context of model training?

It predicts the target values for the test dataset. (B) Signup and view all the answers

Why might a model exhibit better error values on training data compared to test data?

The model is likely overfitting to the training data. (A) Signup and view all the answers

In machine learning classification tasks, what would be the most informative feature among the given options?

The fruit is 15cm long. (C) Signup and view all the answers

What is the primary goal when choosing a PC1-axis in PCA?

To maximize the variance of the projected points (A) Signup and view all the answers

What is meant by a 'binary tree' in the context of decision trees?

A tree with two branches for each node representing two categories. (C) Signup and view all the answers

What does it indicate if a set of datapoints has a high variance?

There is a large spread among the datapoints (A) Signup and view all the answers

Why is the factor (n-1) used when calculating variance?

To ensure an unbiased estimate of variance (A) Signup and view all the answers

What relationship does the angle between PC1 and another variable's axis indicate?

The importance of the variable in relation to PC1 (C) Signup and view all the answers

What does the term 'perpendicular' refer to in the context of PCA?

The relationship between two dimensions (C) Signup and view all the answers

In PCA, what is typically visualized to suggest input feature importance?

The angles between the principal components and the features (C) Signup and view all the answers

Which statement correctly describes the effect of iteratively rotating the PC1 axis in PCA?

It redistributes the importance of the original features (A) Signup and view all the answers

How is the projection of data points onto the PC1-axis optimized?

By minimizing distances of points to the PC1-axis (A) Signup and view all the answers

What does the term 'garbage in, garbage out' in data pre-processing refer to?

The need to ensure quality data before analysis. (A) Signup and view all the answers

In the context of linear regression, what is the primary goal when fitting the function y = kx + m?

To minimize the total error across all training data points. (D) Signup and view all the answers

What is the effect of overfitting a model?

The model will fit the training data too closely and may not generalize well. (B) Signup and view all the answers

What is Mean Square Error (MSE) used for in evaluating model performance?

To calculate the average of squared errors across test samples. (C) Signup and view all the answers

How do artificial neural networks differ from linear regression?

Neural networks can incorporate non-linear relationships between inputs and output. (D) Signup and view all the answers

What is the baseline approach when a value needs to be replaced or is missing in a dataset?

Replacing the value with the mean of the remaining features. (A) Signup and view all the answers

What distinguishes non-linear regression from linear regression?

Non-linear regression includes non-linear functions affecting the output. (D) Signup and view all the answers

When using the training/testing split of 80/20, what is the purpose of this approach?

To reserve a portion of data for testing the model's performance. (B) Signup and view all the answers

What is hyperparameter grid search used for in random forest models?

To find the best setting of parameters that maximizes test accuracy (B) Signup and view all the answers

What role does the validation set play in machine learning?

It is used for hyperparameter tuning during the model training (B) Signup and view all the answers

In logistic regression, how is the baseline established?

By finding the closest match to a sample from the training data (A) Signup and view all the answers

What does 'stratify' do in the context of splitting data into training and test sets?

Guarantees that both sets have an equal representation of each class (D) Signup and view all the answers

How can one reduce the risk of overfitting in a model?

By implementing PCA to focus on the most important features (D) Signup and view all the answers

What is one way to focus training on tricky classes in classification tasks?

Increasing the sample size for the more challenging classes (C) Signup and view all the answers

Why might a model perform well on test data but poorly on training data?

The model is overly complex and not generalized (B) Signup and view all the answers

What is one benefit of using data augmentation?

It helps create more balanced data sets by generating new samples (A) Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes