Database Storage Organization

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Which storage type provides the fastest data access for the CPU?

Primary storage (correct)
Secondary storage
Tertiary storage
External storage

Which of the following is characteristic of secondary storage?

Slower data access compared to primary storage (correct)
Directly accessed by the CPU
More expensive per unit of storage compared to tertiary storage
Smaller storage capacity compared to primary storage

What is the primary role of 'cache memory' in the memory hierarchy?

Serving as the least expensive end of the memory hierarchy
Providing the main work area for the CPU
Storing large permanent databases
Speeding up execution of program instructions (correct)

Which memory type is known as 'main memory'?

DRAM (D) Signup and view all the answers

What does a 'record type' or 'record format definition' consist of?

A collection of field names and their corresponding data types (D) Signup and view all the answers

What is the key characteristic of 'fixed-length records'?

Every record in the file has exactly the same size in bytes (D) Signup and view all the answers

When might a file have variable-length records?

When one or more fields have multiple values for individual records (C) Signup and view all the answers

In fixed-length records, if a field is optional, what is a common technique to handle the absence of a value?

Storing a special NULL value (A) Signup and view all the answers

What is the purpose of 'separator characters' in variable-length fields?

To determine the bytes that represent each field (D) Signup and view all the answers

If a file includes records of different types, what is typically included at the beginning of each record?

A record type indicator (B) Signup and view all the answers

What is 'blocking factor'?

The number of records that can fit into one block (D) Signup and view all the answers

In the context of database storage, what is a 'spanned record'?

A record that is larger than the block size. (C) Signup and view all the answers

When is it advantageous to use spanned records?

When the average record is large (D) Signup and view all the answers

In 'contiguous allocation', what is a disadvantage?

Makes expanding file blocks difficult (C) Signup and view all the answers

What is the role of a 'file header'?

Determine the disk addresses of the file blocks. (A) Signup and view all the answers

What is a primary drawback of using files of unordered records (heap files)?

Expensive linear search procedure (C) Signup and view all the answers

What is a common method for record deletion in unordered files (heap files) that avoids immediately rewriting the block?

Setting a deletion marker (C) Signup and view all the answers

In which file organization if the 'ordering field' also a 'key field'?

Sorted files (C) Signup and view all the answers

What is the primary disadvantage of using ordered files (sorted Files)?

Expensive insertions and deletions (B) Signup and view all the answers

What is a typical approach to handle insertions in ordered files to improve efficiency?

Creating an overflow file (D) Signup and view all the answers

In hashing, what is the component that generates a disk block address?

Hash function (B) Signup and view all the answers

What condition must be met in a hash file?

An equality condition (A) Signup and view all the answers

In hashing, what is 'folding'?

Using an arithmetic or logical function to combine portions of the hash field value (A) Signup and view all the answers

What happens during a 'collision' in hashing?

Two different records hash to the same address (C) Signup and view all the answers

In 'open addressing', how is a collision resolved?

By checking subsequent positions until an empty one is found (B) Signup and view all the answers

How does the 'chaining' method resolve collisions in hashing?

It allocates additional overflow positions and uses pointers. (D) Signup and view all the answers

What is the main goal of a good hashing function?

To minimize collisions and unused locations (C) Signup and view all the answers

What is the key difference between internal and external hashing?

External hashing is for disk files, while internal hashing is for memory. (D) Signup and view all the answers

In external hashing, what does a bucket typically consist of?

One disk block or a cluster of contiguous disk blocks (A) Signup and view all the answers

In external hashing, what is 'static hashing'?

A hashing scheme where the number of buckets is fixed. (B) Signup and view all the answers

Which of the following describes 'extendible hashing'?

A dynamic hashing that grows and shrinks efficiently. (D) Signup and view all the answers

What do index structures provide, in addition to the primary data file?

Alternative ways to access the records. (C) Signup and view all the answers

What is a primary index?

A index on the ordering key field of an ordered file (D) Signup and view all the answers

In a file using a primary index, what is referred to as the 'block anchor'?

First record in each block. (A) Signup and view all the answers

What characterizes a 'dense index'?

It has an index entry for every search key value in the data file. (B) Signup and view all the answers

What is the difference between a primary index and a clustering index?

A clustering has duplicate values (D) Signup and view all the answers

What is a key advantage of using a secondary index?

Improvement in search time for an arbitrary record (C) Signup and view all the answers

On what type of field can a secondary index be created?

It can be created on a candidate field with unique value or on a nonkey field with duplicate values. (C) Signup and view all the answers

What is the purpose of multilevel indexing?

Faster access, especially for large directories (D) Signup and view all the answers

What are the advantages of B-Trees?

Self-balancing, fast retrieval times (D) Signup and view all the answers

In a B+ tree, where are data pointers stored?

Only at the leaf nodes (C) Signup and view all the answers

Imagine a scenario where a database system uses spanned records with a block size $B$ of 4096 bytes. If the file contains a mix of variable-length and fixed-length records, and a particular variable-length record with internal fragmentation consumes significant portions block, without separators, and the number of records, $r = 10,000$, with an average file size of 2000 bytes, what would be the best way to approach optimization?

Implement a record clustering strategy to place related records contiguously, improving locality and reducing block boundary crossings. (D) Signup and view all the answers

In cases where memory read and write operations are significantly slower than CPU operations, which strategies would be the most effective in minimizing the impact of slow memory access in a database system?

Increase memory capacity and reduce frequency of memory read/write operations. (B) Signup and view all the answers

Suppose in a very specific database file format, each record starts with a 2-byte record type indicator, and the rest of the record's structure varies greatly based on this indicator. Given this setup, devise a precise algorithm to perform a binary search on this file by the `timestamp` field, considering the `timestamp` field may be located at vastly different offsets into each record. If not, where did the file read go wrong? The timestamp is guaranteed to exist but its location is unknown, and is not guaranteed, either. Return 'record not found'.

Use binary search algorithm. Perform linear record read. Use a key field to organize records to perform search. Maintain timecode and key fields that follow a common standard with a block size divisible by the size of the new record-header. (C) Signup and view all the answers

Signup and view all the answers

Flashcards

Primary storage

Storage media that can be operated on directly by the computer's CPU. Provides fast data access.

Secondary and tertiary storage

Storage including magnetic disks, optical disks, and tapes; data must be copied to primary storage to be processed by the CPU.