Machine Learning Decision Trees Quiz

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the primary purpose of the Attribute Selection Measure (ASM) in decision trees?

To select the best splitting criterion for partitioning data. (correct)
To minimize the number of features in the dataset.
To ensure the model prevents any overfitting.
To determine the target feature value of unseen instances.

Which step should NOT be performed when building a decision tree?

Select a test for the root node.
Create branches for each outcome of the test.
Stop recursion when all instances in a branch have the same class.
Combine subsets to increase the number of instances. (correct)

In estimating class probabilities, how does a decision tree determine the likelihood that an instance belongs to class k?

By adding the probabilities of each feature considered in the branch.
By averaging the probabilities from all leaf nodes.
By calculating the overall mean of class probabilities.
By looking at the proportion of training instances in the corresponding leaf node. (correct)

What happens when a branch in a decision tree reaches a point where all instances belong to the same class?

The branch is considered a terminal node. (C) Signup and view all the answers

Which step is involved in the process of building a decision tree?

Recursively applying the splitting process to create branches. (C) Signup and view all the answers

What does it mean for a computer program to learn from experience in the context of machine learning?

It adapts its performance at tasks based on previous experiences. (A) Signup and view all the answers

Which type of machine learning involves predicting a continuous outcome from input data?

Regression (B) Signup and view all the answers

In the context of a confusion matrix, what does a true positive (TP) represent?

Positive examples that are correctly classified as positive. (D) Signup and view all the answers

Which term describes the percentage of actual positive instances that are correctly identified?

True positive rate (D) Signup and view all the answers

What is the goal of unsupervised learning in machine learning?

To categorize data into groups without pre-labeled examples. (D) Signup and view all the answers

Which of the following is NOT a characteristic of reinforcement learning?

It requires labeled training data to learn effectively. (A) Signup and view all the answers

What is the primary purpose of training and test sets in machine learning?

To ensure that algorithms can classify future input correctly. (A) Signup and view all the answers

What is the correct count of images that contain cats based on the model's predictions?

160 (D) Signup and view all the answers

Which of the following statements best describes the essence of machine learning?

Machine learning seeks to improve performance through data-based experience. (B) Signup and view all the answers

Which type of learning algorithm is used to predict outcomes based on labeled data?

Supervised learning (D) Signup and view all the answers

In the context of decision trees, what is the main goal when splitting the dataset?

To achieve pure sub-datasets regarding the target feature (D) Signup and view all the answers

What is indicated by the term 'leaf nodes' in decision tree algorithms?

The final predictions made for new instances (A) Signup and view all the answers

How many total images were used to test the model's performance?

200 (A) Signup and view all the answers

Which type of learning aims at learning from both labeled and unlabeled data?

Semi-supervised learning (A) Signup and view all the answers

What type of problems can decision trees be used for?

Both classification and regression tasks (D) Signup and view all the answers

What is the purpose of the stopping criterion in decision trees?

To indicate when to stop splitting the dataset (C) Signup and view all the answers

What is the primary goal of minimizing the within-cluster sum of squares (WCSS)?

To achieve the smallest possible WCSS (B) Signup and view all the answers

What assumption is implicit in minimizing WCSS?

SSE is similar for each group (B) Signup and view all the answers

What is the primary goal of the k-means clustering algorithm?

To partition observations into specified clusters based on proximity to cluster means (D) Signup and view all the answers

Which process is generally used to model decision-making in reinforcement learning?

Markov Decision Process (MDP) (B) Signup and view all the answers

What is a significant weakness of the k-means clustering algorithm?

It can be challenging to determine the appropriate number of clusters, K (A) Signup and view all the answers

In reinforcement learning, what is the role of the agent?

To explore the environment and make decisions (D) Signup and view all the answers

What distinguishes reinforcement learning from supervised learning?

Reinforcement learning does not provide explicit guidance for actions. (C) Signup and view all the answers

During which step of the k-means algorithm do cluster centroids get recalculated?

After assigning objects to the closest cluster (C) Signup and view all the answers

What type of distance measure is primarily used in the k-means algorithm?

Squared Euclidean distance (C) Signup and view all the answers

What does Q-learning primarily seek to determine?

The optimal policy to maximize long-term rewards (A) Signup and view all the answers

What initial action does the Q-learning algorithm perform?

Randomly choose an initial state (B) Signup and view all the answers

What could be a consequence of using the k-means algorithm without rescaling data?

Clusters may be misrepresented due to differing scales in dimensions (A) Signup and view all the answers

What is the initial purpose of selecting K in the k-means algorithm?

To determine the number of potential clusters to form (B) Signup and view all the answers

What is indicated by the variable R in the context of reinforcement learning?

The reward received after executing an action (A) Signup and view all the answers

At what point does the k-means algorithm determine that it has converged?

When there are no changes in cluster assignments (B) Signup and view all the answers

Which characteristic of k-means clustering could hinder its effectiveness with categorical data?

Its reliance on a distance measure that cannot be applied effectively to nominal data (C) Signup and view all the answers

Which statement is true about decision tree learning models like ID3 and C4.5?

They are generally fast to train and easy to interpret. (B) Signup and view all the answers

What assumption underlies the Naïve Bayes classifier?

Attributes are independent of each other. (B) Signup and view all the answers

What is the primary purpose of clustering in unsupervised algorithms?

To group similar items together. (B) Signup and view all the answers

Which of the following is a method for evaluating clustering effectiveness?

Manual inspection and distance measures. (D) Signup and view all the answers

Which of the following statements is false regarding Naïve Bayes classifiers?

They are guaranteed to provide the highest accuracy. (C) Signup and view all the answers

How is the distance between two items calculated in clustering with multiple numeric attributes?

Using the Euclidean distance formula. (A) Signup and view all the answers

Which characteristic should clusters have in an effective clustering approach?

High similarity within clusters and low similarity across clusters. (B) Signup and view all the answers

What is a limitation of decision tree models in classification tasks?

Their accuracy is often not state-of-the-art. (A) Signup and view all the answers

Flashcards

Machine Learning

The study of algorithms that allow computers to learn from data.

Training Set

Data used to train a machine learning model.