Data Mining and Machine Learning True or False Quiz

SnazzyForsythia avatar
SnazzyForsythia
·
·
Download

Start Quiz

Study Flashcards

12 Questions

Which machine learning technique would be the most promising to start with for predicting whether a person is likely to develop cancer based on medical history, diet, and heredity factors?

Classification

Which of the following is an example of an unsupervised learning algorithm?

K-Means

Which of the following is NOT a machine learning technique?

Linear Components Analytics

What does the term ETL stand for?

Extract, Transform, Load

What does the term OLAP stand for?

Online Analytical Processing

What is a database where all the values for a particular column are stored contiguously called?

Column-oriented storage

What is data mining?

A process designed to detect patterns in data sets

In unsupervised learning, what is the role of the outcome variable?

It is ignored by the learning algorithm

What is the primary goal of unsupervised learning?

To cluster data into groups with similar characteristics

Which type of learning is regression analysis an example of?

Supervised learning

If you want to predict future sales based on historical customer data, which technique would be most appropriate?

Regression

What is the primary difference between supervised and unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning does not

Study Notes

Data Mining and Machine Learning

  • Data Mining is a process designed to detect patterns in data sets.
  • Unsupervised learning does not involve building a statistical model for predicting or estimating an output based on inputs.
  • In unsupervised learning, the learning algorithm is not trained using data attributes paired with an outcome variable.

Regression Analysis

  • Regression analysis involves developing a model where one or more inputs are used to predict an output variable.
  • Regression represents supervised learning.

Predicting Future Sales

  • To predict future sales using a dataset with sales data for every customer over several years, the most appropriate technique to investigate is Regression.

Determining Cancer Likelihood

  • To determine whether a person is likely to develop cancer using data including medical history, diet, and heredity factors, the most promising technique to start with is Classification.

Unsupervised Learning Algorithm

  • K-Means is an example of an unsupervised learning algorithm.

Prediction Outcome Variable

  • A prediction outcome variable does not have to be categorical.

Machine Learning Techniques

  • Linear Components Analytics is not a machine learning technique.
  • Regression, Clustering, and Neural Networks are machine learning techniques.

Bias in Supervised Learning

  • In a supervised learning model, Bias does not refer to the error introduced from the assumptions of the data analyst.

Data Mining Objective

  • The objective of Data Mining is to identify valid, novel, and potentially useful and understandable correlations and patterns in existing data.

NOSQL Analytics Database

  • Cassandra is an example of a NOSQL Analytics database.

ETL

  • ETL stands for Extract, Transform, Load.

Data Warehouse

  • Unidimensional data is not stored in a star schema format in a data warehouse.

OLAP

  • OLAP stands for Online Analytical Processing.

Column-oriented Storage

  • A database where all values for a particular column are stored contiguously is called Column-oriented storage.

Test your knowledge on data mining and machine learning with this True or False quiz. Questions cover topics such as detecting patterns in data sets and unsupervised learning algorithms.

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free

More Quizzes Like This

Use Quizgecko on...
Browser
Browser