Indexes: Primary vs. Secondary Keys

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is the primary goal of an index in the context of databases?

To complicate the retrieval of data.
To reorganize the physical storage of data.
To slow down the retrieval of data.
To provide speedy retrieval of data. (correct)

What are search keys also referred to as?

Index Value
Search Attribute
Termed Value
Termed Key (correct)

What does a search key consist of?

Attributes from a different table
The file path to the table
Randomly generated values
Attributes from the schema of the underlying table (correct)

Which type of index has a search key that is not the primary key for the underlying table?

Secondary-key index (D) Signup and view all the answers

Which of the following scenarios exemplifies a secondary-key index?

Index on GPA over the students table. (B) Signup and view all the answers

What is a characteristic of a primary-key index?

Has only one tuple for a given search key. (C) Signup and view all the answers

Which of the following is NOT a choice of what to store in an index?

Links to external resources (A) Signup and view all the answers

In an index-based table, how many indexes can store the entire table?

Only one index (B) Signup and view all the answers

When the table is sorted by the same attribute as the Search Key, this produces a comb-like shape which is an indication of what?

Clustered index (B) Signup and view all the answers

What is a key characteristic of a clustered index?

Table is sorted based on the same attribute as that of the index (C) Signup and view all the answers

What is a key difference between dense and sparse indexes?

Dense index has an entry in the index per tuple, while sparse index has an entry per page (C) Signup and view all the answers

If an un-clustered index is used for a range search, what is likely to occur?

Tuples in the range will end up being in separate pages. (C) Signup and view all the answers

In the context of query processing with un-clustered indexes, what is the advantage of collecting TIDs?

It allows retrieving a page only once for multiple qualifying tuples and reduces cost. (B) Signup and view all the answers

What is a multi-dimensional index?

An index where the search key is more than one attribute. (D) Signup and view all the answers

What is the simple solution to realize a composite index?

Concatenate the two attribute values together and build a hash or tree-based index (A) Signup and view all the answers

What type of queries can a tree-based index on concatenated attributes A and B efficiently answer?

Equality on AB, Equality on A only, Range predicate on A and Range predicate on B (D) Signup and view all the answers

Assume a table Enrolled (sid, cid, grade) and Scenario 2: Build two single-attribute indexes; one on A and one on B related to composite indexes vs. One-attribute indexes; which of the following is a strategy to execute the query that use the index of SID and CID to find the grade?

Use the index of SID to find the qualifying TIDs and then retrieve the corresponding tuple and check the CID value on the fly (B) Signup and view all the answers

When building single-attribute indexes for a composite query, which factor determines the best strategy?

The one that ends up retrieving the least number of tuples (i.e., the one with high selectivity) (A) Signup and view all the answers

What is a characteristic of ISAM?

Extend to multiple levels and Good for static data (C) Signup and view all the answers

Which operation does ISAM handle well?

Range queries (B) Signup and view all the answers

In ISAM, what happens when data is skewed during inserts?

Index performance degrades to that of a linked list O(n) (B) Signup and view all the answers

What is a key feature of all nodes in a B-tree or B+-tree?

Each node is a disk page (C) Signup and view all the answers

What performance characteristic is associated with B-tree operations?

O(log N) (D) Signup and view all the answers

What is the minimum occupancy guarantee for a B-tree node (except for the root)?

At least 50% full (C) Signup and view all the answers

Which of the following is specific to B+-trees compared to B-trees?

Data only at the leaf level (A) Signup and view all the answers

Why are leaf nodes linked together in B+-trees?

To facilitate range search (C) Signup and view all the answers

How does B+-tree search begin?

At the root node (B) Signup and view all the answers

After locating the leaf page in a B+-tree, how is data within the leaf page typically accessed?

Binary search (A) Signup and view all the answers

In a B+-tree insert operation, what happens if the target leaf node is full?

The node is split evenly into two siblings (D) Signup and view all the answers

What is the first consideration when deleting from B+-tree?

Every node is guaranteed to continuously be at least 50% full (B) Signup and view all the answers

What happens when a B+-tree leaf node is exactly half full during a deletion?

The node borrows an item from a sibling node. (B) Signup and view all the answers

During B+-tree deletion, what initiates a merge of leaf nodes?

When adjacent sibling leaf nodes are both exactly half full (D) Signup and view all the answers

What is the first step in bulk loading data into a B+-tree?

Sort the data at the leaf level (D) Signup and view all the answers

What occupancy level should be used when bulk loading a B+-tree?

Any amount of desired occupancy (e.g., 80% full nodes) (C) Signup and view all the answers

What performance benefit does bulk loading provide over standard insertion into a B+-tree?

Allows page-level inserts and hence faster than tuple-level inserts. (C) Signup and view all the answers

What is Bulk Loading?

Propagate the key of each page in the leaf level to the Non-leaf level (B) Signup and view all the answers

Besides speeding up data retrieval, what other purposes can B+-tree indexes serve?

Testing for integrity constraints (C) Signup and view all the answers

Why are tree indexes popular?

The B-tree is both theoretically and practically appealing (A) Signup and view all the answers

For what usage scenarios is the B-tree particularly well-suited?

Indexing dynamic data sets (B) Signup and view all the answers

What is the overall time complexity for operations performed on a B-tree?

O(log N) (A) Signup and view all the answers

What is the expected output of an index, given one or more values, or a range of values, as input?

Tuples from the underlying table that match the given values or fall within the specified range. (C) Signup and view all the answers

If an index's search key consists of the primary key of the underlying table, how is the index classified?

Primary-key index. (B) Signup and view all the answers

Under what circumstance is the Search key of the index is NOT the primary key for the underlying table?

Secondary-key index (D) Signup and view all the answers

Which of the following is true when considering what an index stores?

Indexes can store the entire table, key-value tuple identifier pairs, or a key value set of tuple identifiers (D) Signup and view all the answers

In an index that contains (Key Value, Tuple-Identifier) pairs, what do the tuple identifiers do?

Point to the tuples in the underlying table. (C) Signup and view all the answers

What is indicated when comb-like connections are observed between the index and the table?

A clustered index is being used. (A) Signup and view all the answers

In the context of sparse indexes, what does an index entry typically point to?

The first tuple in each page of the table. (A) Signup and view all the answers

Which of the following is true about index-based tables?

A table can be conceptually indexed multiple times, but stored in only one index. (B) Signup and view all the answers

How does a clustered index affect the physical ordering of a table?

Sorts the table based on the same attribute as the index. (C) Signup and view all the answers

In the context of database indexes, what is the primary difference between a dense index and a sparse index?

A dense index has an entry for every record in the file, whereas a sparse index has entries for a subset of records. (C) Signup and view all the answers

What is a notable disadvantage of using un-clustered indexes?

Sorting the table by a different attribute can lead to non-contiguous range search results. (D) Signup and view all the answers

How can the cost of query processing for un-clustered indexes be addressed?

Collecting and sorting TIDs (Tuple Identifiers) to minimize page retrievals. (C) Signup and view all the answers

What is the result of concatenating sid and cid?

Multi-dimensional Index (D) Signup and view all the answers

If a database has a table 'Employees' with attributes 'department' and 'salary', which query would benefit most from a composite index on ('department', 'salary')?

SELECT * FROM Employees WHERE department = 'Sales' AND salary > 50000; (C) Signup and view all the answers

What benefit can be derived from using a secondary key index that stores a Key Value and Set of Tuple-identifiers?

It can save on storage space when multiple records have the same key value. (D) Signup and view all the answers

When using single-attribute indexes for a composite query, which strategy is the best?

The one that ends up retrieving the least tuples (i.e., the one with high selectivity) (C) Signup and view all the answers

How can building multiple one-attribute indexes be advantageous over building a composite index?

Offers multiple strategies to answer the query. (B) Signup and view all the answers

What happens to index performance in an ISAM (Indexed Sequential Access Method) system when data is skewed during inserts?

Index performance degrades to that of a linked list. (A) Signup and view all the answers

What is the consequence of inserting a large amount of data into an ISAM index, leading to long chains of overflow pages?

The index becomes fragmented, and search performance degrades. (D) Signup and view all the answers

What is a common feature exhibited by nodes in a B-tree and B+-tree?

Every node is a disk page. (B) Signup and view all the answers

How does performance scale with the number of disk pages at the leaf level in a B-tree?

Logarithmic. (D) Signup and view all the answers

Which aspect distinguishes a B+-tree from a B-tree?

Data is located only at the leaf level. (D) Signup and view all the answers

Why does a B+-tree link leaf nodes together?

To allow sequential data retrieval. (C) Signup and view all the answers

In a B+-tree, where does data exist?

Data only goes at the leaf level (D) Signup and view all the answers

What is one of the first considerations during the deletion of a B+-tree?

Whether Q is greater than half full (D) Signup and view all the answers

In the context of bulk loading a B+-tree, what dictates the order for adding data entries?

Data entries need to be sorted first. (A) Signup and view all the answers

What happens to the tree height if the root is full and root splits into two?

Tree height will increase by 1. (A) Signup and view all the answers

What happens to the rest of the nodes in a B+-tree after deletion?

Every node is guaranteed to at least 50% full (except root) (B) Signup and view all the answers

In B+-tree indexes: Testing for Integrity Constraints, is it true or false that you could check if key of a table is unique, system can construct a B+-tree index to check quickly for this integrity constraint condition?

True (B) Signup and view all the answers

Which of the following is true about ISAM?

ISAM predates the B-tree and mostly works well for indexing static data (C) Signup and view all the answers

What is the initial strategy for handling inserts?

Leave gaps in pages while building the index (D) Signup and view all the answers

In what usage scenarios is ISAM particularly well-suited?

Data that is primarily static and requires fast read access. (A) Signup and view all the answers

Of the options, which one is the main feature of the B-tree family?

Each node in the B-tree or B+-tree is a disk page (B) Signup and view all the answers

Which of the following is the correct Big O representation for the B-tree family of indexes insert, deletes, updates, and search?

O(Log N) (B) Signup and view all the answers

Assume one performed operations such that there were an odd number of children, what would be the correct way to determine which key to copy to the parent?

Pick the key closest to the middle (C) Signup and view all the answers

Flashcards

Target of an index

The target of an index is to provide speedy retrieval of data from an underlying table.

Primary-key index

The search key of the index is the primary key for the underlying table