Podcast
Questions and Answers
best fit line
best fit line
finite number
finite number
Which clustering algorithm is used to generate a posteriori distribution
over the collection of all partitions of the data?
Which clustering algorithm is used to generate a posteriori distribution over the collection of all partitions of the data?
It is considered as one of the easiest statistical models in machine
learning.
It is considered as one of the easiest statistical models in machine learning.
Signup and view all the answers
A ______ variables are numeric variables that have an infinite number of
values between any two values.
A ______ variables are numeric variables that have an infinite number of values between any two values.
Signup and view all the answers
Which type of hierarchical clustering where the dendrogram is built from
the bottom level by merging the most similar (or nearest) pair of
clusters?
Which type of hierarchical clustering where the dendrogram is built from the bottom level by merging the most similar (or nearest) pair of clusters?
Signup and view all the answers
Which of the following does not belong to the group?
Which of the following does not belong to the group?
Signup and view all the answers
A _______ algorithm takes a known set of input data (the training set)
and known responses to the data (output), and trains a model to
generate reasonable predictions for the response to new input data.
A _______ algorithm takes a known set of input data (the training set) and known responses to the data (output), and trains a model to generate reasonable predictions for the response to new input data.
Signup and view all the answers
This refers to the organization of unlabeled data into similarity
groups called clusters.
This refers to the organization of unlabeled data into similarity groups called clusters.
Signup and view all the answers
He was a London physician who plotted the location of the cholera
deaths on a map during an outbreak in the 1850s.
He was a London physician who plotted the location of the cholera deaths on a map during an outbreak in the 1850s.
Signup and view all the answers
Below are the factors needed for clustering except for _?
Below are the factors needed for clustering except for _?
Signup and view all the answers
What are the two (2) types of Hierarchical Algorithms?
What are the two (2) types of Hierarchical Algorithms?
Signup and view all the answers
In machine learning, it is considered to be useful when you want to
explore your data but do not yet have a specific goal or are not sure
what information the data contains.
In machine learning, it is considered to be useful when you want to explore your data but do not yet have a specific goal or are not sure what information the data contains.
Signup and view all the answers
Each cluster has a cluster center, called ___.
Each cluster has a cluster center, called ___.
Signup and view all the answers
are data points that are very far away from other data points and
could be errors in the data recording or some special data points with
very different values.
are data points that are very far away from other data points and could be errors in the data recording or some special data points with very different values.
Signup and view all the answers
This refers to a type of centrality where node is the average length of
the shortest path from the node to all other nodes.
This refers to a type of centrality where node is the average length of the shortest path from the node to all other nodes.
Signup and view all the answers
Who gave the idea of complete graph and bipartite graph?
Who gave the idea of complete graph and bipartite graph?
Signup and view all the answers
This gives a measure of the ‘tightness' of the Graph and can be used
to understand how quickly/easily something flows in this Network.
This gives a measure of the ‘tightness' of the Graph and can be used to understand how quickly/easily something flows in this Network.
Signup and view all the answers
The line with the ______ value of the sum of square is the best-fit
regression line.
The line with the ______ value of the sum of square is the best-fit regression line.
Signup and view all the answers
This term refers to a collection of data items which are "similar"
between them, and "dissimilar" to data items in other cluster.
This term refers to a collection of data items which are "similar" between them, and "dissimilar" to data items in other cluster.
Signup and view all the answers