Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Which of the following best describes the role of probability in real-world modeling?

Probability is used only when the system is inherently random at a quantum level.
Probability is only relevant when the outcome of an experiment is completely unknown beforehand.
Probability helps to simplify models by abstracting away complexities that are too difficult or unnecessary to model deterministically. (correct)
Probability is used to model systems only when deterministic models have been proven to be inaccurate.

A deterministic experiment is one in which the outcome is unpredictable.

False (B)

Define a 'sample space' in the context of probability and provide an example.

A sample space is the set of all possible outcomes of a random experiment. For example, when flipping a coin, the sample space is {Heads, Tails}.

In probability, a random variable is a __________ description of the outcomes of a random experiment.

numerical Signup and view all the answers

Which of the following is an example of a continuous random variable?

The height of students in a class (C) Signup and view all the answers

What is the probability of event A, defined as rolling a dice and getting a number greater than 4?

1/3 (A) Signup and view all the answers

Match each concept with its correct description:

Deterministic Experiment = Experiment where the outcome is known beforehand. Random Experiment = Experiment where the outcome is not known beforehand. Sample Space = The set of all possible outcomes of an experiment. Event = Any subset of the sample space. Signup and view all the answers

What is the probability of picking a Samsung phone given that the phone is not working, denoted as $P(SP|NW)$?

$rac{5}{12}$ (D) Signup and view all the answers

The probability of picking a non-working phone given that it is a Samsung phone ($P(NW|SP)$) is equal to the probability of picking a Samsung phone given that it is a non-working phone ($P(SP|NW)$).

False (B) Signup and view all the answers

If a phone is picked at random from the box, what is the probability that it is a Samsung phone?

2/3 Signup and view all the answers

Given the information, there are a total of ______ phones in the box.

60 Signup and view all the answers

What does $P(NW|SP)$ represent in this context?

The probability that a Samsung phone is not working. (D) Signup and view all the answers

How many total phones are not working?

12 (A) Signup and view all the answers

Which formula correctly represents the conditional probability $P(SP|NW)$?

$P(SP|NW) = \frac{P(SP \cap NW)}{P(NW)}$ (B) Signup and view all the answers

Based on the context, more than half of the Samsung phones are non-working.

False (B) Signup and view all the answers

What is the probability that a randomly selected MI phone is not working?

1/10 Signup and view all the answers

In the context of decision tree construction, what is the primary purpose of splitting training samples into smaller bags based on feature values?

To create more homogeneous subsets of data, facilitating decision-making. (D) Signup and view all the answers

In the context of decision tree construction, it is generally advisable to continue splitting nodes indefinitely to achieve perfect classification of the training data.

False (B) Signup and view all the answers

In constructing a decision tree, the process of dividing a node into sub-nodes is known as ______.

splitting Signup and view all the answers

Match the following Blood Pressure (BP) levels with the corresponding sample sets.

BP = Low = S1, S3 BP = Normal = S2, S6 BP = High = S4, S5, S7 Signup and view all the answers

If fever is conditionally independent of cough given Covid, then $P(Fever | Covid, Cough) = P(Fever | Cough)$

False (B) Signup and view all the answers

When modeling $P(Fever, Covid, Cough)$ without assuming conditional independence, how many parameters are typically required?

7 (A) Signup and view all the answers

In the context of the Naïve Bayes' model, what key assumption is made about the effects given the cause?

The effects are independent given the cause. Signup and view all the answers

In a Naïve Bayes' model, if we have effects $E_1$, $E_2$, and $E_3$ and a cause $C$, then $P(E_1, E_2, E_3, C) = P(C) * P(E_1 | C) * P(E_2 | C) * P(E_3 | C)$. This is based on the assumption of conditional ______ between the effects.

independence Signup and view all the answers

Which of the following is the correct formula for $P(Effect_1, Effect_2, ..., Effect_n, Cause)$ in a Naïve Bayes' model?

$P(Cause) * \prod_{k=1}^{n} P(Effect_k | Cause)$ (D) Signup and view all the answers

Match the term with its description in the context of the Naïve Bayes' model:

Cause = Represents the class label in a classification problem. Effect = Represents a feature used for classification. Conditional Independence = Assumption that effects are independent given the cause. Parameters = Values that define the conditional probability distributions. Signup and view all the answers

In the context of the graph representing influences, if Covid is the cause and Fever and Cough are effects, what does this structure imply according to the Naïve Bayes' approach?

Fever and Cough are independent of each other, given Covid. (D) Signup and view all the answers

Using conditional independence assumptions always increases the number of parameters needed to accurately model a joint probability distribution.

False (B) Signup and view all the answers

Explain how the Naïve Bayes’ model simplifies the calculation of probabilities when dealing with multiple effects.

It assumes that the effects are conditionally independent of each other given the cause, allowing the joint probability to be calculated as the product of individual conditional probabilities. Signup and view all the answers

In a classification problem using the Naïve Bayes’ model, what do the 'effects' typically represent?

Features (C) Signup and view all the answers

In the context of Gaussian Mixture Models (GMM), what does the variable $λ_i^{(n)}$ represent?

The belongingness of data point $x^{(n)}$ to the $i^{th}$ Gaussian component. (D) Signup and view all the answers

In GMM, recalculating $μ_i$, $Σ_i$, and $π_i$ aims to minimize, rather than maximize, the log likelihood.

False (B) Signup and view all the answers

Write the equation for calculating $μ_i$ (mean) in a Gaussian Mixture Model using belongingness values.

$\mu_i = \frac{\sum_{n=1}^{N} \lambda_i^{(n)} x^{(n)}}{\sum_{n=1}^{N} \lambda_i^{(n)}}$ Signup and view all the answers

In the context of GMM, the parameter $π_i$ represents the ______ of choosing the $i^{th}$ Gaussian component.

probability Signup and view all the answers

What is the purpose of calculating belongingness in the Expectation Step of GMM?

To estimate the probability that each data point belongs to each cluster. (C) Signup and view all the answers

The covariance matrix, $Σ_i$, in GMM defines the center of the $i^{th}$ Gaussian component.

False (B) Signup and view all the answers

Describe in words, how the value of $π_i$ is calculated in GMM.

The value of $π_i$ is calculated by dividing the sum of the belongingness values for cluster i by the total number of data points. Signup and view all the answers

The M in GMM stands for ______, where parameters are updated.

Maximization Signup and view all the answers

What benefit does soft clustering provide in GMM compared to hard clustering methods?

It allows data points to belong to multiple clusters with varying degrees of membership. (D) Signup and view all the answers

Match the GMM parameter with its update calculation:

$μ_i$ = $\frac{\sum_{n=1}^{N} \lambda_i^{(n)} x^{(n)}}{\sum_{n=1}^{N} \lambda_i^{(n)}}$ $Σ_i$ = $\frac{\sum_{n=1}^{N} \lambda_i^{(n)} (x^{(n)} - μ_i)(x^{(n)} - μ_i)^T}{\sum_{n=1}^{N} \lambda_i^{(n)}}$ $π_i$ = $\frac{\sum_{n=1}^{N} \lambda_i^{(n)}}{N}$ Signup and view all the answers

Flashcards

Probability

The study and quantification of uncertainty.

Random Experiment

Situations where the outcome is not known in advance.