Experiment Design for Data Science

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

In supervised learning, what is the primary goal?

To create a representative sample of the real world.
To identify the core assumptions in a dataset.
To discover new patterns in unlabeled data.
To approximate a function that maps observations to outcomes. (correct)

During the 'Refining the problem' stage of data science and design thinking, illustrated in the lecture, what is the main task?

Design choices. (correct)
Model building.
Study design.
Feature engineering.

In the context of evaluating models, what does the train/test paradigm aim to estimate?

Generalisation error. (correct)
Training time.
The dataset's size.
The model's complexity.

What is a key advantage of using the F1-score over accuracy in classification problems?

It is less sensitive to class imbalances. (D) Signup and view all the answers

In a classification problem, if a model incorrectly classifies a malignant tumor as benign, which type of error is this considered?

Type 2 error. (C) Signup and view all the answers

In regression analysis, what does a higher value of the coefficient of determination (R²) indicate?

A larger amount of variation in Y is explained by X. (C) Signup and view all the answers

What is a noted drawback of using Root Mean Squared Error (RMSE) as a performance metric?

It treats over- and under-predictions equally. (C) Signup and view all the answers

What is 'data leakage' in the context of model evaluation?

Duplicated observations ending up in both training and test sets. (A) Signup and view all the answers

When evaluating models, what issue does the use of resampling methods, such as cross-validation, primarily address?

Generalisation error. (A) Signup and view all the answers

Why is nested k-fold cross-validation used?

To tune hyperparameters. (B) Signup and view all the answers

What is the key characteristic of 'Monte Carlo Cross Validation'?

It generates train/test splits based on a random seed. (B) Signup and view all the answers

In the context of resampling methods, what is the purpose of sampling with replacement?

To create a training set. (C) Signup and view all the answers

When dealing with class imbalance, which of the following techniques should ONLY be applied to the training set?

Applying data augmentation techniques. (D) Signup and view all the answers

What is a potential argument for using a fixed random seed in machine learning experiments?

To ensure reproducibility. (C) Signup and view all the answers

In hypothesis testing, failing to reject the null hypothesis means:

There isn't enough evidence to reject the null hypothesis. (D) Signup and view all the answers

What is a key assumption of parametric statistical tests?

Assumptions about the underlying distribution of the observations. (A) Signup and view all the answers

What does a 'one-tailed test' in statistical testing evaluate?

Testing a specific direction such as seeing if classifier 1 is better or worse. (B) Signup and view all the answers

What statistical issue arises when conducting multiple comparisons, such as comparing a set of classifiers against each other?

Increased likelihood of making a Type 1 error. (B) Signup and view all the answers

What is the initial action to take when facing the multiple comparisons problem?

Avoid making too many comparisons. (A) Signup and view all the answers

What should machine learning experiments do to address violations of statical tests?

Use more relaxed tests. (C) Signup and view all the answers

What is the primary focus of factorial experiments?

Impact of multiple factors and their interaction on performance. (A) Signup and view all the answers

What is a key characteristic of A/B testing in online experiments?

It's a randomized controlled experiment comparing two variants of a system. (D) Signup and view all the answers

What does the null hypothesis state in A/B testing?

There's no difference in the performance metric. (C) Signup and view all the answers

In A/B testing, what does 'statistical power' refer to?

The probability of correctly rejecting the null hypothesis. (C) Signup and view all the answers

How does Multi-Armed Bandit (MAB) testing differ from traditional A/B testing?

MAB takes a more adaptive approach. (B) Signup and view all the answers

In the Epsilon-Greedy Algorithm, what is the 'balancing act'?

Balancing exploitation and exploration. (C) Signup and view all the answers

What does UCB mean in terms of MAB variations?

Upper Confidence Bound. (D) Signup and view all the answers

What features does Contextual Bandits use to help inform arm selection?

Features the user or the environment. (D) Signup and view all the answers

What does Reproducibility let researchers do?

To verify and build upon our work. (B) Signup and view all the answers

In the context of scientific research, what does 'reproducibility' primarily refer to?

Obtaining the same results using the original data and code. (B) Signup and view all the answers

What is measured in Intrinsic?

Interpretability is a property of a model. (B) Signup and view all the answers

What does Local feature importance focus on?

A single prediction. (B) Signup and view all the answers

Which models are easy to interpret?

Decision Model and nearest-neighbour models. (C) Signup and view all the answers

What is LIME?

Generate a new dataset. (D) Signup and view all the answers

In the context of machine learning interpretability, what do counterfactual explanations aim to provide?

An explanation of why the model made a specific prediction. (D) Signup and view all the answers

What is a potential disadvantage of using counterfactual explanations?

Often multiple counterfactual explanations. (A) Signup and view all the answers

Flashcards

Train/Test Paradigm

Evaluating models on data not used for fitting to estimate generalization

Coefficient of determination (R²)

Proportion of variance in the dependent variable predictable from the independent variable.