Support Vector Classifier Quiz

Podcast

Listen to an AI-generated conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is a drawback of the maximal margin classifier?

It perfectly classifies all training observations.
It may have overfit the training data. (correct)
It identifies support vectors effectively.
It is insensitive to individual observations.

The support vector classifier aims to perfectly separate the two classes.

False (B)

What is the primary role of the hyperplane in the support vector classifier?

To minimize the width of the margin.
To classify observations without any misclassification.
To separate the training observations into two classes. (correct)
To increase the number of observations on the correct side of the margin.

What are observations that lie directly on the margin or on the wrong side of the margin for their class called?

Support vectors

Signup and view all the answers

If a slack variable $\epsilon_i$ is greater than 1, it indicates that the observation is on the wrong side of the margin.

False (B)

Signup and view all the answers

In a support vector classifier, changing the position of an observation that lies strictly on the correct side of the margin will ___ the classifier.

not change

Signup and view all the answers

What happens to the margin of a support vector classifier as the regularization parameter C increases?

The margin widens. (A)

Signup and view all the answers

A small C value leads to a classifier with high bias and low variance.

True (A)

Signup and view all the answers

What does the acronym SVM stand for?

Support Vector Machine (B)

Signup and view all the answers

The maximal margin classifier is the most complex form of SVM.

False (B)

Signup and view all the answers

What is the purpose of a hyperplane in SVM?

To separate different classes in feature space.

Signup and view all the answers

The vector β in the hyperplane equation β0 + β1 X1 + β2 X2 +...+ βp Xp = 0 is known as the ______.

normal vector

Signup and view all the answers

What method is used in SVM when there are more than 2 classes?

One versus All (OVA) (A), One versus One (OVO) (B)

Signup and view all the answers

Support Vector Machine (SVM) is more effective than Logistic Regression (LR) when classes are not separable.

False (B)

Signup and view all the answers

What is the loss function used in support vector classifier optimization?

Hinge loss

Signup and view all the answers

When $y_i(\beta_0 + \beta_1x_{i1} +...+ \beta_px_{ip})$ is greater than 1, the SVM loss is ______.

zero

Signup and view all the answers

Match the following concepts with their descriptions:

SVM = Works well for nearly separable classes Logistic Regression = Estimates probabilities One versus All (OVA) = Fit K 2-class SVM classifiers One versus One (OVO) = Fit all pairwise classifiers

Signup and view all the answers

What characterizes a support vector machine compared to a support vector classifier?

It can combine with a non-linear kernel. (B)

Signup and view all the answers

The radial kernel has a global behavior, meaning all training observations affect the predicted class label for a test observation.

False (B)

Signup and view all the answers

What is the role of the parameter gamma (𝛾) in radial basis kernel?

It controls the fit of the model, affecting the non-linearity.

Signup and view all the answers

Support vector machines utilize kernels to compute the __________ needed for different dimensions.

inner-products

Signup and view all the answers

Match the kernel types with their characteristics:

Linear Kernel = Linear in features Polynomial Kernel = Uses degree d for transformations Radial Kernel = High-dimensional implicit feature space Kernels in SVM = Computes pairs without enlarged space

Signup and view all the answers

Which of the following best describes the polynomial kernel?

It computes inner products for transformations of degree d. (D)

Signup and view all the answers

As the distance between a test observation and a training observation increases, the contribution of that training observation to the prediction increases.

False (B)

Signup and view all the answers

What happens to the predicted class label when the training observations are far from the test observation?

They have virtually no role in determining the predicted class label.

Signup and view all the answers

Flashcards

What is Support Vector Machine (SVM)?

A method for classification developed in the 1990s and known for its strong performance.

Maximal Margin Classifier

A simple classifier that aims to find a hyperplane that best separates data points into two classes.

Support Vector Classifier

An extension of the maximal margin classifier that can handle more complex datasets by allowing some misclassified points.

Hyperplane

A hyperplane in p dimensions is a flat subspace of dimension p-1. For example, a line in 2D, a plane in 3D.