Caching Concepts and CDN Essentials

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

What is a key reason a cache server is not ideal for persisting data?

Cache servers are only designed for small amounts of data.
Cache servers are more expensive to maintain.
Cache servers use more power than databases.
Cached data is stored in volatile memory. (correct)

What happens to cached data when it expires, according to an expiration policy?

It is compressed to save space.
It is removed from the cache. (correct)
It is automatically backed up to a persistent database.
It is moved to a slower tier of cache storage.

What does consistency in caching refer to?

Having a uniform expiration policy across all cached data.
Keeping the data store and the cache in sync. (correct)
Using the same type of memory for all cache servers.
Ensuring cached data is always encrypted.

What is a single point of failure (SPOF)?

A part of a system that, if it fails, will stop the entire system from working. (A) Signup and view all the answers

What is a recommended strategy to mitigate the risk of a single cache server being a SPOF?

Using multiple cache servers across different data centers. (A) Signup and view all the answers

What is cache eviction?

The process of removing existing items from the cache to make room for new items. (B) Signup and view all the answers

What type of content is typically delivered by a Content Delivery Network (CDN)?

Static content like images and videos. (D) Signup and view all the answers

What does overprovisioning memory for a cache achieve?

It provides a buffer as memory usage increases. (B) Signup and view all the answers

What is the primary function of a CDN?

To deliver static content from a server closest to the user. (D) Signup and view all the answers

What happens if a CDN server doesn't have a requested file in its cache?

It requests the file from the origin server. (D) Signup and view all the answers

What does TTL stand for in the context of CDNs?

Time To Live (A) Signup and view all the answers

What is a key consideration when using a CDN regarding infrequently used assets?

They may not provide significant benefits when cached and could be moved out of the CDN. (D) Signup and view all the answers

What should a website do if a CDN experiences a temporary outage?

Clients should detect the problem and request resources from the origin. (D) Signup and view all the answers

What is one way to immediately remove a file from a CDN before its TTL expires?

Invalidating the CDN object using APIs provided by CDN vendors. (A) Signup and view all the answers

What is the ultimate goal when answering system design questions?

To propose an architecture that meets the system design goals (C) Signup and view all the answers

What is object versioning in the context of CDNs?

Using different versions of an object by adding a parameter to the URL. (A) Signup and view all the answers

What should you understand well to shape the direction of the discussion?

System requirements, constraints, and potential bottlenecks (A) Signup and view all the answers

Why is setting an appropriate cache expiry time important for time-sensitive content?

To ensure content is neither too stale nor reloaded too frequently. (C) Signup and view all the answers

What is vital to the success of a system design interview?

The right strategy and knowledge (A) Signup and view all the answers

What is the initial setup described in the chapter 'Scale From Zero to Millions of Users'?

A single server setup (A) Signup and view all the answers

During the initial design blueprint phase, what should you do with your interviewer?

Treat them as a teammate and collaborate. (D) Signup and view all the answers

What is the purpose of drawing box diagrams during a system design interview?

To visually represent key components and their interactions. (A) Signup and view all the answers

In the single server setup, what components typically reside on the same server?

Web app, database, and cache (C) Signup and view all the answers

What is the primary reason for performing back-of-the-envelope calculations during a system design interview?

To quickly evaluate if the design fits the scale constraints. (D) Signup and view all the answers

What is the first step in the request flow when a user accesses a website?

The DNS service resolves the domain name to an IP address. (B) Signup and view all the answers

When is it appropriate to include API endpoints and database schema in a system design discussion?

Only for smaller, more focused problems. (D) Signup and view all the answers

What is the purpose of DNS in the request flow?

To translate domain names into IP addresses (A) Signup and view all the answers

Before diving into the details, what should you do regarding back-of-the-envelope calculations?

Communicate with your interviewer if they are necessary. (D) Signup and view all the answers

In the initial request flow (before scaling), who typically provides the Domain Name System (DNS) service?

Third-party providers (C) Signup and view all the answers

What benefit is gained by going through a few concrete use cases?

It helps you frame the high-level design and discover edge cases. (A) Signup and view all the answers

What should you and the interviewer have already agreed on before the design deep dive?

The overall goals and feature scope. (A) Signup and view all the answers

During a senior candidate interview, what might the discussion focus on?

System performance characteristics, bottlenecks, and resource estimations. (C) Signup and view all the answers

What is a disadvantage of the sliding window counter algorithm related to memory usage?

It consumes a lot of memory because even if a request is rejected, its timestamp might still be stored in memory. (D) Signup and view all the answers

The sliding window counter algorithm is a hybrid approach that combines which of the following?

Fixed window counter and sliding window log (A) Signup and view all the answers

In the sliding window counter algorithm, the number of requests in the rolling window is calculated using which formula?

Requests in current window + requests in the previous window * overlap percentage of the rolling window and previous window (A) Signup and view all the answers

What is a primary advantage of the sliding window counter algorithm?

It smooths out spikes in traffic. (A) Signup and view all the answers

For what type of look back window does the sliding window counter algorithm work?

It only works for not-so-strict look back window. (B) Signup and view all the answers

Why is a database generally not a good choice for storing rate limiting counters?

Due to slowness of disk access. (A) Signup and view all the answers

Which technology is a popular option to implement rate limiting?

Redis (D) Signup and view all the answers

Which commands does Redis offer that makes it suitable for rate limiting?

INCR and EXPIRE (A) Signup and view all the answers

Which CAP characteristic is sacrificed by CP systems?

Availability (B) Signup and view all the answers

Why are CA systems impractical in real-world distributed applications?

Network partitions are unavoidable. (D) Signup and view all the answers

In a distributed system with three replicas (n1, n2, n3), what happens when n3 goes down during a network partition if the system chooses consistency (CP)?

Writes to n1 and n2 are blocked to avoid data inconsistency. (B) Signup and view all the answers

In a distributed system, what is the primary concern for a bank system when choosing between consistency and availability?

Displaying up-to-date balance information (C) Signup and view all the answers

In an AP system, what is the typical behavior when a network partition occurs and a client attempts a write operation?

The write operation is accepted, and data is synced when the partition is resolved. (C) Signup and view all the answers

What is the main trade-off between CP and AP systems?

Consistency vs. Availability (D) Signup and view all the answers

What is a key consideration when choosing the appropriate CAP guarantees for a distributed key-value store?

The specific use case requirements (B) Signup and view all the answers

Flashcards

Cache Server Data Persistence

Data in cache servers is lost upon restart because it's stored in volatile memory.

Expiration Policy

A strategy to remove data from the cache after a set time period.