Caching Strategies in Software Applications

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

In which caching strategy does the cache directly update the database whenever data is modified?

Write-through (correct)
Cache-aside
Write-behind
Read-through

Which caching strategy is best suited for applications where data is frequently updated and needs to be immediately available?

Write-behind
Cache-aside
Write-through (correct)
Read-through

Which caching strategy provides flexibility in managing cache population and eviction, but may require app-level logic for cache management?

Read-through
Write-behind
Cache-aside (correct)
Write-through

Which caching strategy is designed for applications with complex caching needs or irregular access patterns?

Cache-aside (D)

Signup and view all the answers

Which caching strategy is best for applications where the data is typically retrieved more frequently than it is updated?

Read-through (A)

Signup and view all the answers

Which caching strategy is particularly well-suited for applications that prioritize low write latency and can tolerate some data loss in the event of a cache failure?

Write-behind (A)

Signup and view all the answers

Which caching strategy centralizes control over cache management, thus reducing the risk of cache stampedes?

Read-through (B)

Signup and view all the answers

Which caching strategy typically involves the use of a separate cache layer that acts as a backup for the database?

Write-behind (B)

Signup and view all the answers

Which of the following is NOT a valid target destination for Kinesis Data Firehose?

Amazon Aurora (C)

Signup and view all the answers

What is the primary use case for Kinesis Data Firehose?

High-throughput data ingestion and delivery (D)

Signup and view all the answers

How does Kinesis Data Firehose ensure near real-time data delivery?

It buffers data and flushes it to the destination based on time and size rules. (D)

Signup and view all the answers

Which of these is a benefit of using Kinesis Data Firehose compared to Kinesis Data Streams (KDS)?

Kinesis Data Firehose handles scaling and resource allocation automatically. (B)

Signup and view all the answers

Which of the following is NOT a benefit of using Enhanced Fan Out consumers in Kinesis Data Streams?

Lower costs due to reduced resource utilization. (B)

Signup and view all the answers

What is the purpose of the Kinesis Client Library (KCL)?

To read and process records from Kinesis Data Streams. (C)

Signup and view all the answers

What is a record processor in the context of Kinesis Client Library (KCL)?

A component that reads and processes individual records from Kinesis Data Streams. (D)

Signup and view all the answers

How can a user prevent the ExpiredIterationException from occurring when using Kinesis Client Library (KCL)?

Increase the provisioned write capacity units (WCU) for DynamoDB table used for KCL coordination. (B)

Signup and view all the answers

Which of the following technologies CAN read data from Kinesis Data Firehose?

Amazon Lambda (B)

Signup and view all the answers

What is the primary difference between Enhanced Fan Out consumers and Standard Consumers in Kinesis Data Streams?

Enhanced Fan Out consumers are designed for higher data throughput and real-time processing. (C)

Signup and view all the answers

Which data formats are supported by Athena?

CSV, TSV, JSON, ORC, Parquet, Avro (B)

Signup and view all the answers

Which of the following is NOT a valid use case for Athena?

Creating reports and visualizations for data stored in S3 (B)

Signup and view all the answers

Which security features are available for Athena queries?

IAM, ACLs, S3 bucket policies, SSE-S3, SSE-KMS, CSE-KMS, TLS (B)

Signup and view all the answers

How does Athena handle data encryption when querying S3 files?

Athena can query encrypted S3 data without decrypting it. (A)

Signup and view all the answers

Which of the following is NOT a valid method for optimizing Athena performance?

Using a large number of partitions for the data (A)

Signup and view all the answers

What are the two ways to define the partition key of a DynamoDB table?

Partition key (HASH) and Sort key (RANGE) (D)

Signup and view all the answers

What is the maximum size of a DynamoDB item?

400 KB (A)

Signup and view all the answers

Which of the following data types are not supported by DynamoDB?

Datetime (D)

Signup and view all the answers

Which read capacity unit (RCU) consumption is correct, given 10 strong consistent reads (SCR) per second for an item of size 6 KB?

20 RCUs (A)

Signup and view all the answers

What kind of read capacity unit will you consume when you use the `ConsistentRead` parameter set to `True` in the API calls?

Strong consistent read (SCR) (B)

Signup and view all the answers

What is the consequence of exceeding the provisioned capacity for a DynamoDB table?

The table will be throttled, resulting in errors. (B)

Signup and view all the answers

Which of the following is not considered an 'anti-pattern' for DynamoDB?

Utilizing DynamoDB for managing user profiles and session data. (D)

Signup and view all the answers

What is the purpose of 'burst capacity' in DynamoDB?

It allows exceeding the provisioned capacity temporarily. (A)

Signup and view all the answers

What is the function of the 'partition keys' in DynamoDB?

They distribute data across multiple physical servers. (D)

Signup and view all the answers

Which of the following would be a suitable scenario for using DynamoDB?

Maintaining a database for a real-time analytics application. (D)

Signup and view all the answers

What is a primary feature of Workgroups in the context of user organization and query access?

They can provide separate query histories for each group. (C)

Signup and view all the answers

Which aspect of AWS Glue Data Catalog security is broader than data filters in Lake Formation?

IAM-based database and table level security. (D)

Signup and view all the answers

Which of the following is NOT a key feature of Athena Notebook?

Support for unstructured data only. (A)

Signup and view all the answers

What best describes the purpose of Spark in the context of big data analytics?

It processes data using a distributed computing framework. (A)

Signup and view all the answers

Which feature of Spark Streaming allows it to handle constantly growing datasets?

Structured streaming capabilities. (A)

Signup and view all the answers

What is the primary component responsible for managing memory and scheduling in Spark?

Spark Context. (C)

Signup and view all the answers

Which of the following operations can be restricted through IAM policies in relation to the AWS Glue Data Catalog?

Altering database structures. (B)

Signup and view all the answers

Which programming support is NOT provided by Spark Integration within the Athena console?

Java APIs. (A)

Signup and view all the answers

Which library within Spark is designed specifically for machine learning at a large scale?

MLLib. (B)

Signup and view all the answers

What type of data format does Spark NOT support?

XML. (A)

Signup and view all the answers

What is a crucial feature of Workgroups in terms of cost management?

They can track costs by workload. (C)

Signup and view all the answers

Which component of Spark is primarily responsible for fault recovery?

Spark core. (D)

Signup and view all the answers

Which operation is NOT part of the supported functionalities for Spark streaming?

Batch processing. (B)

Signup and view all the answers

What best describes the relationship between Spark and Athena?

Athena can run Jupyter notebooks with Spark, enabling enhanced data analysis. (B)

Signup and view all the answers

What is a key benefit of using EMRFS with S3?

Enables persistent storage after cluster termination (D)

Signup and view all the answers

Which of the following describes the nature of data stored in EBS for HDFS?

Data is deleted when the cluster is terminated (C)

Signup and view all the answers

What does the serverless feature of EMR do?

It decides the number of nodes required for tasks automatically (B)

Signup and view all the answers

Kinesis data streams utilize which of the following components?

Shards for ordered sequence of records (C)

Signup and view all the answers

What is a characteristic of on-demand mode in Kinesis?

Automatically adjusts capacity based on previous peak usage (C)

Signup and view all the answers

How does Kinesis ensure the immutability of data once it is inserted?

Records cannot be deleted after they are added to the stream (B)

Signup and view all the answers

What is the function of Kinesis' shard splitting?

It increases the overall stream capacity (D)

Signup and view all the answers

When merging shards in Kinesis, what happens to the old shards?

They are closed and deleted once data expires (D)

Signup and view all the answers

What is a security measure implemented by Kinesis for data in transit?

Encryption using HTTPS endpoints (B)

Signup and view all the answers

What happens if a consumer in Kinesis tries to read the same data twice?

It can occur due to retry mechanisms (B)

Signup and view all the answers

What should be done to prevent duplicate records caused by producer retries?

Embed unique record IDs in the data (B)

Signup and view all the answers

In what scenario would resharding limitations affect Kinesis streams?

When multiple resharding operations are needed simultaneously (D)

Signup and view all the answers

Which statement about local file storage in EMR is accurate?

Utilized for temporary data storage (B)

Signup and view all the answers

Flashcards

SQL interface for S3

A way to run SQL queries directly on data stored in S3 without loading it.

Supported data formats

Formats that can be queried directly including CSV, JSON, ORC, Parquet, and Avro.