Unsupervised Learning Overview

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What does a lift of 1 indicate regarding the association of two items?

There is a high probability of purchasing both items together.
Both items are independent with no association. (correct)
Both items are equally popular among customers.
Both items have a strong positive correlation.

How is the confidence of an item being purchased calculated?

By dividing the number of transactions containing both items by the number containing one of the items. (correct)
By dividing the number of transactions with one item by the total number of transactions.
By dividing the total number of transactions by the number of transactions with both items.
By multiplying the number of transactions containing both items with the total transactions.

What is the maximum confidence achievable in a scenario where items are repeatedly purchased together?

No maximum limit
100% (correct)
75%
50%

What does a lift value greater than 1 suggest about two items?

There is a significant association between the two items. (D) Signup and view all the answers

In the context of support, what does support measure?

The proportion of transactions that include a particular item or item set. (D) Signup and view all the answers

What characterizes unsupervised learning?

It finds patterns in unlabeled data without human intervention. (A) Signup and view all the answers

Which statement accurately describes K-means clustering?

Data points belong to exactly one cluster only. (C) Signup and view all the answers

What is one of the main tasks performed in unsupervised learning?

Finding groups or clusters within data. (D) Signup and view all the answers

What distinguishes agglomerative clustering from other clustering methods?

It begins with each data point as its own cluster. (D) Signup and view all the answers

In overlapping clustering, how do data points relate to clusters?

Points can belong to multiple clusters with varying degrees of membership. (B) Signup and view all the answers

Why is unsupervised learning ideal for exploratory data analysis?

It automatically discovers hidden patterns in data. (B) Signup and view all the answers

What is a common application of unsupervised learning?

Image recognition tasks. (B) Signup and view all the answers

Which of the following is NOT a task typically associated with unsupervised learning?

Supervised classification. (D) Signup and view all the answers

Which distance measure is commonly used in K-means clustering to find the distance between two points?

Euclidean distance measure (A) Signup and view all the answers

How is Manhattan distance calculated?

The sum of the horizontal and vertical components (D) Signup and view all the answers

What does the within-sum-of-squares (WSS) measure indicate in K-means clustering?

The total squared distance between each data point and its cluster centroid (B) Signup and view all the answers

What does the elbow point in WSS versus the number of clusters graph represent?

The point where the number of clusters has no effect on WSS (B) Signup and view all the answers

What is the first step in the K-means clustering process?

Randomly initialize two cluster centroids (D) Signup and view all the answers

Which step involves repositioning the randomly initialized centroid after calculating actual centroids?

Step 5 (D) Signup and view all the answers

What happens to the value of WSS as K increases beyond a certain point?

WSS stabilizes and changes minimally (D) Signup and view all the answers

Which of the following distance measures considers the angle between vectors?

Cosine distance measure (C) Signup and view all the answers

What is the primary purpose of K-Means clustering?

To divide objects into distinct clusters based on similarities (B) Signup and view all the answers

What is required before applying K-Means clustering to a dataset?

A defined distance metric over the variable space (C) Signup and view all the answers

How is K (the number of clusters) determined in K-Means clustering?

Through a systematic search for the optimal value based on data characteristics (A) Signup and view all the answers

What happens after the initial random allocation of centroids in K-Means clustering?

The actual centroid for each cluster is recalculated based on assigned data points (C) Signup and view all the answers

Which of the following describes a use case for K-Means clustering?

Summarizing properties of clusters for exploratory analysis (D) Signup and view all the answers

What is an important feature of the centroids used in K-Means clustering?

Centroids can be positioned randomly at the beginning of the process (B) Signup and view all the answers

What characteristic best describes the data input required by K-Means clustering?

Numerical data representing measurements of interest (B) Signup and view all the answers

Which of the following is NOT a step followed during K-Means clustering?

Creation of a linear regression model for centroid adjustment (B) Signup and view all the answers

What indicates that the k-means algorithm has converged?

The cluster remains static. (A) Signup and view all the answers

Which of the following is a caution related to k-means clustering?

The number of clusters must be decided a priori. (C) Signup and view all the answers

What property does the Apriori algorithm assume about itemsets?

All subsets of a frequent itemset must be frequent. (D) Signup and view all the answers

In the context of association rule learning, what does 'support' represent?

The frequency of an event or itemset occurring across all transactions. (C) Signup and view all the answers

What is a limitation of the k-means algorithm regarding cluster shapes?

It tends to create round, equi-sized clusters. (B) Signup and view all the answers

What happens if the first guess in k-means clustering is poor?

The results may be poor or less optimal. (C) Signup and view all the answers

Which of the following accurately describes the 'lift' measure in association rule learning?

It indicates how much more likely an item is purchased with another item. (B) Signup and view all the answers

What does the term 'K' represent in k-means clustering?

The number of clusters to be formed. (B) Signup and view all the answers

Flashcards

Conditional Probability

The probability of event A happening given that event B has already happened.

Lift

The ratio of the observed probability of two events occurring together to the expected probability if they were independent.

Confidence

The proportion of transactions containing both item X and item Y, divided by the proportion of transactions containing item Y.