The Data Explosion

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the estimated daily volume of data generated by NASA's current Earth observation satellites?

100 exabytes
1 terabyte (correct)
100 gigabytes
10 petabytes

Approximately how many users are there on Facebook?

100 million
900 million (correct)
500 million
1.5 billion

What is the estimated number of tweets sent daily on Twitter?

350 million (correct)
100 million
500 million
200 million

What is the estimated number of websites?

650 million (C) Signup and view all the answers

What type of data is recorded by CCTV recordings?

Non-symbolic data (B) Signup and view all the answers

What is the purpose of a Data Warehouse?

To store and analyze customer transactions (B) Signup and view all the answers

What is a consequence of the vast amounts of data being stored?

Most of the data is not examined in detail. (B) Signup and view all the answers

What is the potential of machine learning technology?

To solve the problem of the tidal wave of data. (A) Signup and view all the answers

What is the goal of knowledge discovery?

To extract implicit, previously unknown and potentially useful information from data. (B) Signup and view all the answers

What is the role of data mining in knowledge discovery?

It is a central part of the knowledge discovery process. (D) Signup and view all the answers

What is the outcome of the knowledge discovery process?

New and potentially useful knowledge. (C) Signup and view all the answers

What happens to most of the data that is stored?

It is merely stored and never examined. (D) Signup and view all the answers

What is the current state of the world in terms of data and knowledge?

Data rich but knowledge poor. (B) Signup and view all the answers

What is a potential application of knowledge discovery?

All of the above. (D) Signup and view all the answers

What is the primary goal of using labelled data in data mining?

To predict the value of a designated attribute for unseen instances (A) Signup and view all the answers

What is the term for data mining using unlabelled data?

Unsupervised learning (B) Signup and view all the answers

What is the task called when the designated attribute is categorical?

Classification (A) Signup and view all the answers

What is the term for a dataset of examples, each comprising the values of a number of variables?

Instances (D) Signup and view all the answers

What is the goal of data mining when using unlabelled data?

To extract the most information from the data available (C) Signup and view all the answers

What is the term for the process of predicting a numerical outcome?

Regression (D) Signup and view all the answers

What is the primary goal of classification in data mining?

To predict the value of a categorical attribute (A) Signup and view all the answers

What is the term for data that has a specially designated attribute?

Labelled data (B) Signup and view all the answers

What is the goal of the analysis in the given dataset?

To predict the degree classification for other students given their grade profiles (A) Signup and view all the answers

What method involves identifying the closest examples to an unclassified instance?

Nearest Neighbour Matching (A) Signup and view all the answers

What is the purpose of a classification tree?

To generate classification rules (C) Signup and view all the answers

What type of structure is used to generate classification rules?

Decision Tree (A), Classification Tree (C) Signup and view all the answers

What is the form of the dataset?

A table containing students' grades on five subjects (C) Signup and view all the answers

What is the purpose of the classification rules?

To predict the degree classification of an unseen instance (D) Signup and view all the answers

What is the result of applying the nearest neighbour matching method?

A predicted degree classification for an unseen instance (D) Signup and view all the answers

What is the relationship between the attributes in the dataset?

The attributes are used to predict the degree classification (A) Signup and view all the answers

What is the primary goal of market basket analysis?

To find relationships between product purchases (C) Signup and view all the answers

What is the purpose of stating association rules with additional information?

To indicate the reliability of the rules (B) Signup and view all the answers

What is the main difference between supervised and unsupervised learning?

The presence of labeled data (C) Signup and view all the answers

What is the purpose of clustering algorithms?

To find groups of similar items (C) Signup and view all the answers

What is an example of a clustering application?

Fault diagnosis (D) Signup and view all the answers

What is the concept of 'IF variable 1 > 85 and switch 6 = open THEN variable 23 < 47.5 and switch 8 = closed (probability = 0.8)' an example of?

Association rule (A) Signup and view all the answers

What is the term for the type of prediction where the value to be predicted is a label?

Classification (A) Signup and view all the answers

What is the term for the process of finding relationships between product purchases?

Market basket analysis (A) Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes