Distributed Databases (DDB) Explained

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Which of the following accurately describes a distributed database (DDB)?

A single database stored in one central location for easy access and management.
A database managed by software that hides data distribution, requiring manual user configuration.
A database system where all nodes must be homogeneous to ensure data consistency.
A collection of logically interrelated databases distributed over a computer network. (correct)

Which of the following is NOT typically considered a direct benefit of using distributed databases?

Improved reliability through redundancy and fault tolerance.
Enhanced data security due to centralized control and monitoring. (correct)
Increased scalability to handle growing data volumes and user traffic.
Better support for geographically distributed organizational structures.

Which aspect of a DDBMS do users typically not need to be aware of, reflecting a key principle of transparency?

Data fragmentation. (correct)
Data replication methods.
Data language used for queries.
Network configuration.

How do replicated components in a distributed system contribute to reliability and availability?

They eliminate the single point of failure, so failure of one node does not impact the system. (C) Signup and view all the answers

Which of the following performance benefits is primarily associated with data localization in distributed databases?

Reduced contention for CPU and I/O services. (A) Signup and view all the answers

What is the primary difference between scaling up and scaling out a database system?

Scaling up enhances the capacity of a single server, while scaling out distributes the load across multiple servers. (B) Signup and view all the answers

Why is distribution still desirable when you want to manage the whole system?

It is more expensive to manage, although is still desirable for handling very large data volumes and distributed data accesses (B) Signup and view all the answers

In the context of DDB architecture, what is the role of the Global Conceptual Schema (GCS)?

To provide a unified, integrated view of the entire database. (D) Signup and view all the answers

Which of the following factors is LEAST critical when designing a distributed database?

The brand of database software used at each site. (D) Signup and view all the answers

What is the main advantage of the top-down approach to distributed database design?

Optimal data consistency can be achieved, starting with a comprehensive design. (C) Signup and view all the answers

In the context of a top-down design process for a distributed database, what does 'data fragmentation' primarily involve?

Dividing data into smaller, manageable units/fragments. (B) Signup and view all the answers

Which of the following is an example of the 'View Design' step in a top-down distributed database design process?

Defining what store managers can view inventory and sales data for their specific location. (B) Signup and view all the answers

What is the primary purpose of 'data allocation' in the context of distributed database design?

To assign database fragments to specific locations for optimal performance. (A) Signup and view all the answers

Which aspect of database design is primarily addressed during the 'physical design' phase?

Choosing hardware, storage solutions, indexing and clustering strategy. (B) Signup and view all the answers

What is the primary goal of fragmentation in a distributed database?

To allow for parallel processing by partitioning data and improve data locality. (C) Signup and view all the answers

What are the 'CDR properties' of fragmentation?

Completeness, Disjointness, Reconstruction. (A) Signup and view all the answers

Why is it important for data fragmentation to allow data reconstruction?

To enable complete recovery of the original data from fragments. (B) Signup and view all the answers

What is the main difference between horizontal and vertical fragmentation?

Horizontal divides into rows; vertical divides a table into columns. (C) Signup and view all the answers

Which of the following is an benefit of Round Robin data distribution for different queries?

Distribute data evenly (D) Signup and view all the answers

Which of the following is a drawback of data fragmentation?

Increased query overhead for global queries. (A) Signup and view all the answers

What is a primary characteristic of primary horizontal fragmentation?

It is defined using simple conditions on a single primary table. (B) Signup and view all the answers

In the context of fragmentation, what is the purpose of the 'minterm predicates approach'?

To automatically generate predicates with properties such as completeness and disjointness. (A) Signup and view all the answers

What is derived horizontal fragmentation?

Fragmentation that is being derived from primary relation (using predicates with joined foreign relations). (A) Signup and view all the answers

In the context of database fragmentation, what must derived fragmentation avoid?

Reconstructing or reconstructing tables (D) Signup and view all the answers

In derived horizontal fragmentation, if relation R is the owner and relation S is a member, how is the fragmentation defined?

Fragments of S are defined in terms of R. Semi-join operator is used to define the fragments (A) Signup and view all the answers

What must all fragments includes when using vertical fragmentation?

The primary key for reconstruction (D) Signup and view all the answers

When is fragmentation sometimes forced onto database administrators?

When sites may own data (D) Signup and view all the answers

What does a semi-join operation reduce when used in a centralized database?

Quantity of data from the hard disk into the memory (C) Signup and view all the answers

In the context of replication, what does updating replicated data mean?

All copies of replicated data must be updated in a single transaction to maintain atomicity and consistency (D) Signup and view all the answers

How can fragments be allocated to sites?

Finding an optimal mapping can be minimized with heuristics-based algorithms (C) Signup and view all the answers

What is the SSOT Property?

Single Source of Truth (C) Signup and view all the answers

Which of the following is a goal of allocating fragments in distributed database design?

Minimize query response time (D) Signup and view all the answers

Which database fragmentation property is best described as the following: '$∀ F₁, Fj ∈ F, i ≠ j ⇒ F¿ ∩ Fj = ¢$'?

Disjointness (B) Signup and view all the answers

In allocating data fragments to sites, how is total cost calculated?

$total_cost = total_local_processing_cost + total_data_exchange_cost + total_stoarge_cost$ (A) Signup and view all the answers

Which of the following must be maintained in order for all copies of replicated data to be updated?

Atomicity and Consistency. (A) Signup and view all the answers

What is NOT an example of a question to consider when using SSOT?

Where is the database cluster located? (D) Signup and view all the answers

What term refers to the number of tuples that need to be accessed to process a query?

Fragment Selectivity (B) Signup and view all the answers

Which of the following best describes what SSOT solves in DDB?

Duplication conflicts (D) Signup and view all the answers

Which of the following is NOT a reason that Semi-join operations are used?

Semi-Join operators can be used in semi-structure fragmentation (D) Signup and view all the answers

From the trade-off of Fragmentation with Replication image, does fragmentation or full replication have greater update problems?

Full replication (B) Signup and view all the answers

Flashcards

Distributed Database (DDB)

A database spread across multiple locations or nodes.

DDBMS

Software that manages a distributed database and hides distribution details.