Untitled Quiz

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the significance of having two hidden layers in an artificial neural network (ANN)?

They can represent any decision boundary with high accuracy. (correct)
They increase the overall computational speed.
They improve the interpretability of the model.
They simplify the training process.

How is the optimal size of the hidden layer(s) in a multi-layer ANN typically determined?

By following pre-set standard sizes for specific tasks.
Based on theoretical analysis of network performance.
Through extensive simulations on training data.
By using a trial-and-error heuristic approach. (correct)

What happens during the training of a multi-layer ANN when an error is detected in the output?

New input patterns are generated.
The hidden layers are removed.
The entire network resets to its initial state.
Weights are adjusted to reduce the error. (correct)

Why might more layers be added to an ANN structure?

To handle more complex data and numerous predictors. (A) Signup and view all the answers

Which of the following is true about the learning process in a multi-layer ANN?

It involves presenting input patterns and adjusting weights based on errors. (C) Signup and view all the answers

What is the purpose of the backward pass in an artificial neural network (ANN)?

To calculate and propagate the error backwards (C) Signup and view all the answers

Which factor can contribute to the overfitting of an ANN?

An increasing number of hidden neurons (C) Signup and view all the answers

What does the back propagation algorithm primarily aim to achieve?

Minimize the total error of the ANN (B) Signup and view all the answers

What is indicated by a higher $R^2$ value in relation to an ANN?

A better fit to the data (D) Signup and view all the answers

What does momentum help achieve when training an ANN?

Prevention of local maxima (C) Signup and view all the answers

Which of the following is NOT a parameter that can affect the performance of an ANN?

Output data type (A) Signup and view all the answers

What does the local gradient of a neuron during back propagation represent?

The change in the neuron's output relative to changes in input (A) Signup and view all the answers

What is one potential consequence of a high learning rate in an ANN?

Divergence from optimal weights (D) Signup and view all the answers

What is the primary purpose of the least squares method in linear regression?

To minimize the sum of the squared differences between actual and predicted values (B) Signup and view all the answers

Which of the following indicates that a linear regression model may not adequately fit the data?

A non-linear pattern in residuals (A) Signup and view all the answers

When analyzing residuals to check for homoscedasticity, what does constant variance imply?

The model is correctly specified (B) Signup and view all the answers

In a confusion matrix, what does a True Positive (TP) represent?

The model predicted YES, and the actual answer was YES (C) Signup and view all the answers

What does a non-independent residual analysis indicate?

Residuals are correlated and may indicate model misspecification (A) Signup and view all the answers

The linear function is represented mathematically as which of the following?

Y = β0 + β1X + ε (A) Signup and view all the answers

Which statement is true regarding the slope in a linear regression model?

It represents the rate of change of Y with respect to X (A) Signup and view all the answers

How does increasing the number of data points affect the least squares regression line?

It could potentially stabilize the coefficients and reduce variability (A) Signup and view all the answers

What does it mean if the residuals have a fan-shaped pattern when plotted?

There is a problem with linearity in the data (C) Signup and view all the answers

What is represented by the term 'error' in a linear regression model?

The difference between predicted and actual values (B) Signup and view all the answers

Which logical operation results in 1 only when the inputs are different?

XOR (C) Signup and view all the answers

In a single-layer perceptron, which type of problems can it not solve?

Non-linearly separable problems (B) Signup and view all the answers

What is true about multi-layer artificial neural networks (ANNs)?

They can learn both linearly and non-linearly separable problems. (B) Signup and view all the answers

Which of the following accurately describes the layers in a multi-layer ANN?

At least one layer must be hidden. (A) Signup and view all the answers

What does the AND operator output when both inputs are 0?

0 (D) Signup and view all the answers

What classification task is suited for a multi-layer ANN but not for a single-layer perceptron?

Multi-class classification (C) Signup and view all the answers

How do multi-layer ANNs propagate input signals?

Layer-by-layer in a forward direction (C) Signup and view all the answers

Which result does the OR logical operation yield for inputs 0 and 0?

0 (D) Signup and view all the answers

How does changing the mean (μ) of a normal distribution affect its graph?

It shifts the distribution left or right. (A) Signup and view all the answers

What does the standard deviation (σ) determine in a normal distribution?

The width or spread of the distribution. (A) Signup and view all the answers

In a normal distribution defined by its mean and standard deviation, what does E(X) represent?

The expected value of the random variable. (A) Signup and view all the answers

What mathematical function describes a normal distribution?

A probability density function (pdf). (C) Signup and view all the answers

How is the variance (Var(X)) of a normal distribution calculated?

By squaring the standard deviation. (D) Signup and view all the answers

According to the central limit theorem, how does the mean of a sample (𝑥̅) vary around the population mean (μ)?

It varies around μ with a standard deviation of σ/n. (D) Signup and view all the answers

In a normal distribution, if the mean (μ) is increased while the standard deviation (σ) remains unchanged, what happens to the distribution?

The distribution shifts to the right. (A) Signup and view all the answers

Which of the following statements is true about the total area under a normal distribution curve?

It always equals 1. (C) Signup and view all the answers

What happens to the sampling distribution of 𝑥̅ as sample size n increases?

It becomes a Gaussian distribution. (D) Signup and view all the answers

What does a p-value greater than 0.1 indicate?

No presumption against the null hypothesis. (B) Signup and view all the answers

What does a positive covariance between two variables indicate?

The variables are positively correlated. (B) Signup and view all the answers

Which of the following p-value ranges indicates a low presumption against the null hypothesis?

0.05 < 𝑝 ≤ 0.1 (B) Signup and view all the answers

In the context of A/B testing, what does Fisher's exact test evaluate?

Non-random associations between two categorical variables. (C) Signup and view all the answers

In terms of linear correlation, what does it mean when the covariance is equal to zero?

The two variables are independent. (C) Signup and view all the answers

Which statement describes the 68-95-99.7 Rule?

68% of data falls within one standard deviation from the mean. (A) Signup and view all the answers

Which scenario best demonstrates anomaly detection in machine learning?

Finding potential scams in an online retail shop. (B) Signup and view all the answers

Which of the following represents a weak linear relationship in terms of correlation?

cov(X,Y) = 0.01 (A) Signup and view all the answers

What is the interpretation of a p-value that falls within the range of 0.01 to 0.05?

Strong presumption against the null hypothesis. (B) Signup and view all the answers

Flashcards

Normal Distribution

A bell-shaped probability distribution defined by its mean (µ) and standard deviation (σ).

Mean (µ)

The average value of the distribution.

Standard Deviation (σ)

Measures the spread or dispersion of the data around the mean.

Probability Density Function (PDF)

A mathematical function that describes the probability of a random variable taking on a specific value.