Podcast
Questions and Answers
What determines the clusters significantly when the number of data is not large?
What determines the clusters significantly when the number of data is not large?
- Number of clusters
- Objects clustering
- Initial grouping (correct)
- Iteration stability
Which factor must be determined before carrying out k-mean clustering?
Which factor must be determined before carrying out k-mean clustering?
- Number of clusters (K) (correct)
- Objects-Centroids distances
- Number of iterations
- Initial condition
What is a disadvantage of k-mean clustering according to the text?
What is a disadvantage of k-mean clustering according to the text?
- It is not sensitive to initial conditions
- It yields the same result with each run
- Resulting clusters depend on initial random assignments (correct)
- It guarantees the real cluster will be known
Why might the algorithm of k-mean clustering be trapped in a local optimum?
Why might the algorithm of k-mean clustering be trapped in a local optimum?
What indicates that the computation of k-mean clustering has reached stability?
What indicates that the computation of k-mean clustering has reached stability?
Why does k-mean clustering not yield the same result with each run according to the text?
Why does k-mean clustering not yield the same result with each run according to the text?
What may produce different cluster results if the same data is inputted in a different order?
What may produce different cluster results if the same data is inputted in a different order?
Why is big data considered challenging in terms of volume?
Why is big data considered challenging in terms of volume?
What kind of data feed is mentioned as contributing to the increase in data volume?
What kind of data feed is mentioned as contributing to the increase in data volume?
In what way does high-frequency stock trading impact the data landscape?
In what way does high-frequency stock trading impact the data landscape?
Which type of devices generate massive log data in real-time according to the text?
Which type of devices generate massive log data in real-time according to the text?
What is a characteristic of on-line gaming systems mentioned in relation to data volume?
What is a characteristic of on-line gaming systems mentioned in relation to data volume?
How has the scale of data storage evolved since 2000 based on the information provided?
How has the scale of data storage evolved since 2000 based on the information provided?
What is characteristic of sensors embedded into everyday objects that contribute to big data?
What is characteristic of sensors embedded into everyday objects that contribute to big data?
Why do clickstreams and ad impressions capture user behavior at high event rates?
Why do clickstreams and ad impressions capture user behavior at high event rates?
What is a key aspect of machine to machine processes mentioned in terms of data exchange?
What is a key aspect of machine to machine processes mentioned in terms of data exchange?
What impact do smart phones and sensors embedded into everyday objects have on data generation according to the text?
What impact do smart phones and sensors embedded into everyday objects have on data generation according to the text?