Machine Learning Lecture Summaries

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is a primary disadvantage of using Artificial Neural Networks (ANNs)?

They are computationally inexpensive and require minimal training.
They always outperform simpler models in all situations.
They can be difficult to interpret, acting as a 'black box'. (correct)
They are highly interpretable and easy to understand.

In the context of ANNs, what is the function of weight factors between nodes?

They represent the explanatory variables.
They determine the flow of information through the network. (correct)
They add an intercept to the output of nodes.
They determine the error between predicted and actual values.

A neural network with two or more hidden layers is referred to as a:

Recurrent network.
Wide network.
Shallow network.
Deep network. (correct)

Which of the following is a typical loss function used for classification problems in ANNs?

Cross-entropy (log-loss). (A) Signup and view all the answers

What process is used to optimize weights in an ANN by minimizing the loss function?

Backpropagation. (B) Signup and view all the answers

Which preprocessing technique is essential for transforming categorical variables into a numerical format suitable for ANNs?

One-hot encoding. (C) Signup and view all the answers

What is the primary goal of using K-Fold Cross Validation when evaluating an ANN?

To get a more robust evaluation of the model's generalization performance. (C) Signup and view all the answers

What is the purpose of early stopping in the context of hyperparameter tuning for ANNs?

To prevent overfitting by halting training when test performance plateaus or declines. (C) Signup and view all the answers

What is the primary purpose of cross-validation when training a model?

To minimize the risk of overfitting by evaluating performance on different data combinations. (B) Signup and view all the answers

Which technique involves training multiple models sequentially, with each model correcting the errors of its predecessor?

Boosting (A) Signup and view all the answers

What is the primary advantage of using ensemble models, such as Random Forests?

They reduce bias and improve generalization by using multiple models from different sources of information. (C) Signup and view all the answers

Which of the following describes the random patching procedure in Random Forests?

Selecting a random subset of features for splits. (A) Signup and view all the answers

What does the hyperparameter `n_estimators` control in a Random Forest?

The number of decision trees included in the model. (B) Signup and view all the answers

What strategy is used in random forests to create multiple models to improve performance?

Bagging with random patching (A) Signup and view all the answers

What is the consequence of using too many decision trees in a random forest?

Diminishing returns in improved performance as computational cost rises. (C) Signup and view all the answers

What is the main difference between bagging and boosting?

Bagging combines diverse models in parallel, while boosting combines models sequentially that correct the errors of their predecessors. (A) Signup and view all the answers

What is the primary goal of model generalization in machine learning?

To develop a model that performs well on new, unseen data. (A) Signup and view all the answers

Which scenario describes a model that is underfitting the data?

A model that is too simple and misses important relationships in the data. (D) Signup and view all the answers

What is the primary purpose of splitting data into training and testing sets?

To evaluate how well the model generalizes to new data. (C) Signup and view all the answers

In the model development process, what is the typical sequence for a basic iterative approach?

Study phenomenon & clean data, discover of dates, explore connections, basic model train, and then evaluate model (C) Signup and view all the answers

What is a key difference between statistical and machine learning approaches to regression models?

Machine learning approaches make fewer assumptions about the data, but can result in less interpretable parameters. (D) Signup and view all the answers

Which type of geospatial data is represented by points, lines, and polygons?

Vector data (A) Signup and view all the answers

What is a primary disadvantage of using decision trees?

They are prone to overfitting and can be unstable with minor data changes. (C) Signup and view all the answers

Which of the following is a disadvantage of using Mercator projection?

It distorts areas, especially at higher latitudes. (B) Signup and view all the answers

What is the primary benefit of using Shapley Values in model predictions?

They provide fairly distributed contributions of features. (C) Signup and view all the answers

Which property of Shapley Values ensures that total contributions equal the model output?

Efficiency (B) Signup and view all the answers

What visualization tool effectively displays contributions of features for individual predictions?

Waterfall plot (B) Signup and view all the answers

In terms of speed and accuracy, how does SHAP compare to LIME?

Slower but more accurate (A) Signup and view all the answers

Which of the following methods measures how variations in model input affect output?

Sobol Global Sensitivity Analysis (A) Signup and view all the answers

What aspect does a Beeswarm plot visualize in relation to SHAP values?

Contributions of features across multiple data points (D) Signup and view all the answers

What is a major goal of using Explainable AI (XAI) methods?

To increase understandability and confidence (D) Signup and view all the answers

Which of the following is a model-specific technique used in visual explanations for CNNs?

Saliency maps (B) Signup and view all the answers

What are potential privacy risks associated with AI?

Inappropriate processing of sensitive data. (B) Signup and view all the answers

Which of the following represents a challenge that Large Language Models (LLMs) face?

Discrimination and spread of misinformation. (A) Signup and view all the answers

What is a recommended solution for addressing privacy issues in AI?

Representative training data. (D) Signup and view all the answers

In the context of AI and climate change, which application is NOT mentioned?

Enhancing food production. (C) Signup and view all the answers

What is a key challenge when implementing Explainable AI (XAI)?

Balancing explanatory power and model performance. (C) Signup and view all the answers

What role do post-hoc explanations play in AI models?

They enhance understanding of black-box models. (B) Signup and view all the answers

Which guideline is emphasized for Explainable AI in the context of audience needs?

Tailor statements to the audience's understanding. (D) Signup and view all the answers

What is a necessary next step for integrating XAI methodologies into systems?

Further integration of XAI into socio-technical systems. (B) Signup and view all the answers

What is a primary advantage of Gradient Boosted Trees (GBTs) over single decision trees?

They can learn complex non-linear relationships. (D) Signup and view all the answers

Which hyperparameter is NOT commonly associated with Gradient Boosted Trees?

sample_weight (B) Signup and view all the answers

What is a key difference between Random Forests and Boosting techniques?

Boosting focuses on correcting errors from previous predictors. (A) Signup and view all the answers

What do embeddings help to create from discrete data?

A continuous, lower-dimensional vector space. (D) Signup and view all the answers

What does semantic preservation in embeddings refer to?

Maintaining relationships between data elements. (D) Signup and view all the answers

Which condition is NOT required for establishing causality?

Presence of a third variable causing the same effect. (D) Signup and view all the answers

In the context of models, what do embeddings specifically make suitable for?

Algorithmic processing like classification and clustering. (D) Signup and view all the answers

Which property of embeddings is commonly measured using Euclidean distance or cosine similarity?

Semantic preservation. (B) Signup and view all the answers

Flashcards

Ensemble Model

Combining multiple models to make a more robust model

Random Forest

A type of ensemble model consisting of multiple decision trees

Bagging

A technique used in Random Forests to create multiple bootstrap datasets by randomly sampling with replacement

Random Patching

A technique used in Random Forests to randomly select features for splits in decision trees, adding diversity