7 - Hierarchical Clustering
17 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the purpose of combining rows and columns using Lance-Williams update equations?

To find the position of the minimum distance or maximum similarity.

What is the general form of Lance-Williams update equations?

Several (but not all) linkages can be expressed in this form.

Which linkages cannot be computed with Lance-Williams updates?

MiniMax, Medoid, Hausdorff linkages.

What is the purpose of extracting clusters from a dendrogram?

<p>To obtain the clusters from the hierarchical structure.</p> Signup and view all the answers

Why is it recommended to avoid computing directly in hierarchical clustering?

<p>Direct computation is expensive.</p> Signup and view all the answers

What is the basic idea behind Hierarchical Agglomerative Clustering?

<p>To initially consider every object as a cluster and then iteratively merge the two most similar clusters until only one cluster remains.</p> Signup and view all the answers

What are some variations in Hierarchical Agglomerative Clustering?

<p>Variations include different distance or similarity measures for objects, different distance measures for clusters after merging (Linkage), and various optimizations.</p> Signup and view all the answers

What is Single-linkage in terms of distances of clusters?

<p>Single-linkage considers the minimum distance between clusters as the basis for merging, representing maximum similarity.</p> Signup and view all the answers

Describe the Complete-linkage approach in Hierarchical Agglomerative Clustering.

<p>Complete-linkage involves considering the maximum distance between clusters, indicating minimum similarity, for merging.</p> Signup and view all the answers

What is the concept of Ward-linkage in Hierarchical Agglomerative Clustering?

<p>Ward-linkage focuses on the minimum increase of squared error when merging clusters.</p> Signup and view all the answers

Explain the process of AGNES (Agglomerative Nesting) in clustering.

<p>AGNES involves computing the pairwise distance matrix of objects and using the Lance-Williams equations for merging clusters.</p> Signup and view all the answers

What are some strategies to determine when to stop the hierarchical agglomerative clustering process?

<p>visually inspect the dendrogram, choose interesting branches; stop when clusters remain; stop at a certain distance; significant change in distance; significance via bootstrap resampling; change in cluster sizes or density; constraints satisfied (semi-supervised)</p> Signup and view all the answers

What are some benefits of Hierarchical Agglomerative Clustering (HAC)?

<p>very general: any distance / similarity; easy to understand and interpret; hierarchical result; dendrogram visualization often useful; number of clusters does not need to be fixed beforehand; many variants</p> Signup and view all the answers

What is the main limitation of Hierarchical Agglomerative Clustering (HAC) in terms of scalability?

<p>scalability is the main problem</p> Signup and view all the answers

What is a common issue that users face with Hierarchical Agglomerative Clustering (HAC) in terms of the desired clustering outcome?

<p>in many cases, users want a flat partitioning</p> Signup and view all the answers

What are some challenges or limitations of Hierarchical Agglomerative Clustering (HAC) related to cluster sizes and outliers?

<p>unbalanced cluster sizes; outliers/noise for some linkage strategies</p> Signup and view all the answers

What is one approach to incorporating supervision in Hierarchical Agglomerative Clustering (HAC)?

<p>constraints satisfied (semi-supervised): certain objects are labeled as 'must' or 'should not' be in the same clusters</p> Signup and view all the answers

More Like This

Introduction to Agglomerative Methods
13 questions
Introduction to Agglomerative Methods
13 questions
Use Quizgecko on...
Browser
Browser