16 Questions
What is a dataset?
A collection of data objects
What are the other names for data objects?
Records
Which term describes ordered data such as sequential, temporal, and spatial data?
Record data
What are the general characteristics of data sets?
Missing values and inconsistent values
What type of attributes are qualitative and quantitative?
Qualitative and quantitative attributes
Which term refers to the description of data objects by a number of attributes?
Data objects
What are other names for attributes in a data set?
Variables, Characteristics, Fields, Features, Dimensions
In a data set, what do the objects typically represent?
Records
What does each field (or column) correspond to in a data set?
An attribute
What type of data set would be represented by a numerical matrix or crosstabs?
Relational records
What type of data set would be represented by text documents with term-frequency vectors?
Document data
What type of data set would be represented by transaction sequences?
Sequential Data
What type of data set would represent spatial data like maps?
Spatial and multimedia
What does each TID (Transaction ID) item in a data set usually represent?
$A$ sequence of items
In a data set, what do the fields of a record correspond to?
Attributes
What would represent image data in a data set?
Spatial and multimedia
Study Notes
What is a Dataset?
- A dataset is a collection of data, which can be referred to as a data object.
Alternative Names for Data Objects
- Data objects can also be referred to as cases, samples, observations, or records.
Ordered Data
- Sequential, temporal, and spatial data are categorized as ordered data.
Characteristics of Data Sets
- Data sets typically consist of a collection of objects or cases, with each object described by a set of attributes or variables.
- Each object or case represents a single entity or observation.
- The attributes or variables describe the characteristics of each object or case.
Types of Attributes
- Qualitative attributes are descriptive and categorical, while quantitative attributes are numerical and measurable.
Description of Data Objects
- The description of data objects by a number of attributes is referred to as a multivariate description.
Alternative Names for Attributes
- Attributes can also be referred to as variables, features, or fields.
Objects in a Data Set
- In a data set, each object typically represents a single entity or observation.
Fields or Columns in a Data Set
- Each field or column in a data set corresponds to a single attribute or variable.
Types of Data Sets
- Numerical data sets can be represented by a numerical matrix or crosstabs.
- Text data sets can be represented by text documents with term-frequency vectors.
- Sequential data sets can be represented by transaction sequences.
- Spatial data sets can be represented by spatial data like maps.
Transaction ID (TID)
- Each TID (Transaction ID) item in a data set usually represents a single transaction or event.
Fields of a Record
- The fields of a record in a data set correspond to the attributes or variables that describe the object or case.
Image Data
- Image data in a data set would be represented by a set of numerical or categorical attributes that describe the characteristics of the images.
Test your knowledge on data types, data quality, and processing with this quiz based on the DM&W course at May Al-Nashashibi University of Petra, Amman, Jordan. Topics include introduction to data mining, machine learning, measures of similarities, data exploration, data warehousing representation, predictive modeling, classification, performance measures, and alternative techniques.
Make Your Own Quizzes and Flashcards
Convert your notes into interactive study material.
Get started for free