Podcast
Questions and Answers
What technique is used for grouping individuals or objects into unknown groups in cluster analysis?
What technique is used for grouping individuals or objects into unknown groups in cluster analysis?
- K-means clustering (correct)
- Hierarchical clustering
- Discriminant analysis
- Non-hierarchical clustering
Which diagram is recommended before performing clustering to display the main characteristics of underlying clusters?
Which diagram is recommended before performing clustering to display the main characteristics of underlying clusters?
- Profile diagram
- Bar chart
- Box plot
- Scatter diagram (correct)
What is a helpful technique for a moderate number of variables in cluster analysis?
What is a helpful technique for a moderate number of variables in cluster analysis?
- Pie chart
- Histogram
- Profile diagram (correct)
- Scatter plot
Which distance measure is commonly used in clustering based on distances between observations and clusters?
Which distance measure is commonly used in clustering based on distances between observations and clusters?
In cluster analysis, what type of data checking is advisable due to sensitivity to outliers?
In cluster analysis, what type of data checking is advisable due to sensitivity to outliers?
What can yield different clusters in cluster analysis?
What can yield different clusters in cluster analysis?
What is the purpose of the centroid method in clustering?
What is the purpose of the centroid method in clustering?
In clustering, what does a large increase in distance in the dendrogram indicate?
In clustering, what does a large increase in distance in the dendrogram indicate?
Why might it be desirable to perform principal components analysis or factor analysis before clustering?
Why might it be desirable to perform principal components analysis or factor analysis before clustering?
What is a common approach to evaluate clustering results when different runs produce varying outputs?
What is a common approach to evaluate clustering results when different runs produce varying outputs?
Why is it recommended to split data and run clustering on both halves to see if they yield similar results?
Why is it recommended to split data and run clustering on both halves to see if they yield similar results?
What is a key aspect to consider when trying different clustering methods and number of clusters?
What is a key aspect to consider when trying different clustering methods and number of clusters?
What is the purpose of standardizing variables in clustering when they have different units of measurement?
What is the purpose of standardizing variables in clustering when they have different units of measurement?
In hierarchical clustering, what happens in each successive step in agglomerative methods?
In hierarchical clustering, what happens in each successive step in agglomerative methods?
What is the key concept behind the centroid method in agglomerative hierarchical clustering?
What is the key concept behind the centroid method in agglomerative hierarchical clustering?
What is the primary approach used to split clusters in divisive hierarchical methods?
What is the primary approach used to split clusters in divisive hierarchical methods?
Why might hierarchical procedures mislead investigators in clustering analysis?
Why might hierarchical procedures mislead investigators in clustering analysis?
What is considered an important issue when performing clustering analysis?
What is considered an important issue when performing clustering analysis?