Data Science Section A Quiz

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the measure of central tendency that represents the most frequently occurring value in a dataset?

Median
Mode (correct)
Mean
Range

If a dataset has an even number of observations, how is the median determined?

The maximum value
The mean of the two middle values (correct)
The last middle value
The first middle value

Which of the following is not a measure of dispersion?

Range
Variance
Mode (correct)
Standard deviation

What is the range of a dataset?

The difference between the highest and lowest values (B) Signup and view all the answers

Which measure of central tendency is most sensitive to extreme values?

Mean (B) Signup and view all the answers

What is the formula for calculating the variance?

(sum of squared deviations) / (number of values) (A) Signup and view all the answers

Which measure of spread is equal to the square root of the variance?

Standard deviation (A) Signup and view all the answers

What is a significant impact of Data Science on businesses?

Improved decision-making and efficiency (D) Signup and view all the answers

What are the three key components of Data Science?

Data, Statistics, and Visualization (D) Signup and view all the answers

Which of the following is a supervised learning technique?

Linear Regression (A) Signup and view all the answers

What is the difference between precision and recall?

Precision measures the number of true positives, while recall measures the number of false negatives (A) Signup and view all the answers

Which of the following is a data visualization technique?

Box Plot (C) Signup and view all the answers

What is the goal of feature engineering?

To transform the features into a more suitable representation for a machine learning algorithm (B) Signup and view all the answers

What is the purpose of cross-validation?

To ensure that the model is not overfitting the data (D) Signup and view all the answers

What is the purpose of hypothesis testing in data science?

To determine if a sample statistic is significantly different from a population parameter (A) Signup and view all the answers

How can you define a function in Python that accepts an arbitrary number of positional arguments?

Using the *args parameter (A) Signup and view all the answers

Which data structure is primarily used in NumPy for handling arrays?

ndarray (A) Signup and view all the answers

Which method is used to create a NumPy array of integers ranging from 0 to 9?

np.arange(10) (B) Signup and view all the answers

What is the default data type of elements in a NumPy array?

Integer (C) Signup and view all the answers

What will be the result of the operation np.array([1, 2, 3]) + np.array([4, 5, 6])?

[5, 7, 9] (B) Signup and view all the answers

How can you access the element at the second row and third column of a NumPy array arr?

arr[2, 3] (B) Signup and view all the answers

What followed the equation of the regression line y = 2x + 3 when x is 5?

13 (A) Signup and view all the answers

Which of the following is NOT a common assumption of linear regression?

Multicollinearity (A) Signup and view all the answers

In logistic regression, what type of outcome does the dependent variable typically represent?

A binary outcome or category (B) Signup and view all the answers

What is the primary purpose of conducting residual analysis in regression models?

To identify outliers and assess the model's assumptions (C) Signup and view all the answers

When performing polynomial regression, what effect does increasing the degree of the polynomial generally have?

Overfitting the data (C) Signup and view all the answers

Which type of regression is typically preferred when dealing with multicollinearity among independent variables?

Lasso Regression (D) Signup and view all the answers

In the context of regression analysis, which of the following is an example of a dependent variable?

Sales (B) Signup and view all the answers

Given the function f(x) = x^3 + 3x^2 - 24*x + 7, what is true about x=2?

x=2 will give the minimum for f(x) (B) Signup and view all the answers

What distinguishes linear regression from logistic regression?

Linear regression produces a linear outcome, while logistic regression produces a binary outcome (C) Signup and view all the answers

Which of the following accurately describes the purpose of logistic regression?

To predict categorical variables (A) Signup and view all the answers

Which measure indicates how well the linear regression model fits the data?

R-squared (A) Signup and view all the answers

What does the correlation coefficient measure in regression analysis?

The strength and direction of the relationship (B) Signup and view all the answers

What is a primary objective of k-means clustering?

To minimize the distance within clusters (A) Signup and view all the answers

How do k-means clustering and hierarchical clustering primarily differ?

K-means uses centroids, while hierarchical uses distance measures (B) Signup and view all the answers

What is a limitation of using k-means clustering?

It requires a priori knowledge of the number of clusters (A) Signup and view all the answers

What is the function of a hyperparameter in the gradient descent algorithm?

To set the learning rate (D) Signup and view all the answers

What is a disadvantage of using a low learning rate in gradient descent?

The algorithm may converge slowly (B) Signup and view all the answers

What is the condition on a and b for which the given system of linear equations has no solution?

a ≠ 4, 2a + b − 6 = 0 (A) Signup and view all the answers

Which statement is true about the determinant of a matrix?

The determinant of a diagonal matrix is the product of its diagonal entries. (B) Signup and view all the answers

Using the provided confusion matrix for classification, how is accuracy calculated?

(True Positive + True Negative) / Total Predictions (C) Signup and view all the answers

What distinguishes simple linear regression from multiple regression?

Simple linear regression involves only one independent variable, while multiple involves more than one. (D) Signup and view all the answers

What is the goal of multivariate optimization?

To find the minimum or maximum of a function with multiple variables. (D) Signup and view all the answers

Which method can be used to find the minimum of a function with multiple variables without derivatives?

Gradient descent (A) Signup and view all the answers

What does pruning in decision trees achieve?

Reduces the complexity of the tree by removing unnecessary branches. (C) Signup and view all the answers

Flashcards

Key components of Data Science

Data, statistics, and visualization are the core elements of data science.

Supervised Learning Technique

A type of machine learning where the model learns from labeled data.