AWS S3 Quiz
117 Questions
2 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the primary benefit of using CloudFront edge locations with S3?

  • Enhanced data analytics capabilities
  • Improved security for sensitive data
  • Reduced latency for read-intensive requests (correct)
  • Increased storage capacity for buckets
  • What type of data store is S3 classified as?

  • Volatile data store
  • Persistent data store (correct)
  • Ephemeral data store
  • Transient data store
  • What is the default number of buckets allowed per AWS account?

  • Unlimited buckets
  • 100 buckets (correct)
  • 50 buckets
  • 200 buckets
  • What is the purpose of CORS headers in S3?

    <p>To allow requests to a different origin</p> Signup and view all the answers

    Which storage class stores objects redundantly within a single Availability Zone?

    <p>S3 One Zone-IA</p> Signup and view all the answers

    What is the format for specifying resources in a policy?

    <p>arn:partition:service:region:namespace:relative-id</p> Signup and view all the answers

    What is the purpose of access auditing in S3?

    <p>To capture IAM/user identity information in logs</p> Signup and view all the answers

    Who can grant cross-account permissions to another AWS account?

    <p>The bucket owner</p> Signup and view all the answers

    What type of policy defines access to resources and can be associated with resources and users?

    <p>Access policy</p> Signup and view all the answers

    What is the maximum size of an object that can be uploaded in a single PUT request?

    <p>5 gigabytes</p> Signup and view all the answers

    What is the consistency model used for overwrite PUTS and DELETES in Amazon S3?

    <p>Eventual consistency</p> Signup and view all the answers

    How are objects stored in Amazon S3?

    <p>On multiple devices across multiple facilities (AZs) in an Amazon S3 region</p> Signup and view all the answers

    What is the purpose of the Requester Pays function in Amazon S3?

    <p>To remove anonymous access and make the requester pay</p> Signup and view all the answers

    What is the minimum size of a file that can be stored in Amazon S3?

    <p>0 bytes</p> Signup and view all the answers

    What is the recommended way to access Amazon S3?

    <p>Through SDKs and APIs</p> Signup and view all the answers

    What is the purpose of Event notifications in Amazon S3?

    <p>To send alerts or trigger actions for specific events</p> Signup and view all the answers

    What is the scalability of Amazon S3?

    <p>Scalable to high request rates</p> Signup and view all the answers

    What is the maximum size of a bucket policy?

    <p>20 KB</p> Signup and view all the answers

    What is the recommended use case for bucket ACLs?

    <p>Granting write permissions to the S3 Log Delivery group</p> Signup and view all the answers

    Which of the following is NOT a valid use case for bucket policies?

    <p>Defining access control lists for objects</p> Signup and view all the answers

    What is the maximum number of permissions that can be granted per object ACL?

    <p>100</p> Signup and view all the answers

    What is the purpose of transfer acceleration?

    <p>To reduce latency for object uploads to S3 over long distances</p> Signup and view all the answers

    What is the URL format for S3 Transfer Acceleration?

    <p>s3-accelerate.amazonaws.com</p> Signup and view all the answers

    Which of the following is NOT a supported feature for S3 Transfer Acceleration?

    <p>Transfer acceleration can be disabled</p> Signup and view all the answers

    What is the purpose of pre-signed URLs?

    <p>To grant temporary access to specific objects to users without AWS credentials</p> Signup and view all the answers

    Which of the following statements about S3 Static Websites is FALSE?

    <p>S3 Static Websites can use dynamic content such as PHP and .Net</p> Signup and view all the answers

    Which of the following is a valid use case for multipart upload?

    <p>To upload objects larger than 5 GB</p> Signup and view all the answers

    What happens when an object is deleted in a bucket with versioning enabled?

    <p>A DELETE marker is placed on the object.</p> Signup and view all the answers

    Which option is NOT a server-side encryption method supported by Amazon S3?

    <p>SSE-IA</p> Signup and view all the answers

    What must be enabled for Cross Region Replication to function?

    <p>Versioning on both source and destination buckets</p> Signup and view all the answers

    When is a lifecycle action applied in Amazon S3?

    <p>Based on an XML file configuration.</p> Signup and view all the answers

    What does enabling versioning in S3 mean for previously existing objects?

    <p>They will be assigned a version ID of NULL.</p> Signup and view all the answers

    What action does Multi-factor authentication (MFA) delete apply to?

    <p>Deleting versioned objects permanently</p> Signup and view all the answers

    Which of the following statements about lifecycle management is false?

    <p>It applies only to current versions of an object.</p> Signup and view all the answers

    What must be done before transitioning objects to the ONEZONE_IA class?

    <p>Store them for at least 30 days in STANDARD_IA.</p> Signup and view all the answers

    What is the minimum permission required to use a KMS customer master key while uploading an encrypted object?

    <p>kms:Decrypt</p> Signup and view all the answers

    What is true about the versioning feature in Amazon S3?

    <p>It cannot be suspended once enabled.</p> Signup and view all the answers

    What is a necessary requirement for enabling Cross Region Replication (CRR) in S3?

    <p>Replication can take place only if versioning is enabled for the source bucket.</p> Signup and view all the answers

    What does S3 object tagging allow you to do?

    <p>Create IAM policies based on object attributes.</p> Signup and view all the answers

    How often are CloudWatch Storage Metrics reported for S3 buckets?

    <p>Once per day.</p> Signup and view all the answers

    What is a requirement for configuring notifications in Amazon S3?

    <p>You need to define event types and message destinations.</p> Signup and view all the answers

    Which of the following is NOT a trigger for S3 replication?

    <p>Changes made to bucket policies.</p> Signup and view all the answers

    Which permission is necessary for Amazon S3 to replicate objects into a destination bucket?

    <p>The bucket owner must grant S3 permission to replicate objects.</p> Signup and view all the answers

    What is the intended use of S3 Lifecycle policies?

    <p>To manage the storage duration of S3 objects.</p> Signup and view all the answers

    What does the replication feature of S3 ensure regarding the objects being replicated?

    <p>Replicas will be exact copies with matching key names and metadata.</p> Signup and view all the answers

    What type of objects cannot be replicated with Cross Region Replication (CRR)?

    <p>KMS-encrypted objects without a valid KMS key.</p> Signup and view all the answers

    What is a key characteristic of Same Region Replication (SRR)?

    <p>It is asynchronous and requires no specific permissions.</p> Signup and view all the answers

    What types of Amazon S3 storage classes can objects be replicated to?

    <p>Any Amazon S3 storage class, including S3 Glacier</p> Signup and view all the answers

    What does S3 Inventory primarily provide?

    <p>Daily or weekly reports on object replication and encryption status</p> Signup and view all the answers

    What is a critical factor to avoid when setting up server access logging?

    <p>Designating the same bucket for both logging and log storage</p> Signup and view all the answers

    Which strategy is recommended for optimizing performance of Amazon S3?

    <p>Use byte-range fetches in GET Object requests</p> Signup and view all the answers

    What happens when changes occur to an S3 object that has replication configured?

    <p>A new replication is triggered to the destination bucket.</p> Signup and view all the answers

    What is the recommended tool for logging bucket and object-level actions in Amazon S3?

    <p>AWS CloudTrail</p> Signup and view all the answers

    Which of the following best describes the method to achieve optimal performance when working with S3?

    <p>Scale storage connections horizontally by making multiple concurrent requests</p> Signup and view all the answers

    How can the performance of applications using Amazon S3 be understood and improved?

    <p>Through Amazon CloudWatch metrics</p> Signup and view all the answers

    What is one of the benefits of using Object Tags in relation to replication?

    <p>They ensure that only specific objects are replicated based on tags.</p> Signup and view all the answers

    What happens to retry requests for latency-sensitive applications in Amazon S3?

    <p>They may take a different path and could quickly succeed.</p> Signup and view all the answers

    What does SSE-S3 use to encrypt each object in Amazon S3?

    <p>A unique key for each object and rotates a master key</p> Signup and view all the answers

    Which encryption method allows for an audit trail of key usage?

    <p>SSE-KMS</p> Signup and view all the answers

    In which scenario must you specify the encryption algorithm during the request?

    <p>SSE-C</p> Signup and view all the answers

    What header is used to provide the encrypted key for SSE-C in Amazon S3?

    <p>x-amz-server-side-encryption-customer-key</p> Signup and view all the answers

    What type of encryption allows the user to manage the encryption keys directly?

    <p>SSE-C</p> Signup and view all the answers

    What feature does client-side encryption using AWS KMS provide when downloading an object?

    <p>A version of the encrypted data key embedded in the metadata</p> Signup and view all the answers

    Which option does NOT require any additional permissions for key usage?

    <p>SSE-S3</p> Signup and view all the answers

    Which encryption standard is used by SSE-S3 for encrypting data?

    <p>256-bit AES</p> Signup and view all the answers

    What kind of keys does SSE-KMS support for added protection?

    <p>Customer-managed CMKs and AWS managed CMKs</p> Signup and view all the answers

    What is a use case for Amazon S3 event notifications?

    <p>To run workflows in response to object changes</p> Signup and view all the answers

    What is the primary advantage of accessing an S3 bucket from EC2 instances in the same AWS Region?

    <p>It reduces network latency and data transfer costs.</p> Signup and view all the answers

    What is the maximum archive size that can be uploaded to Glacier in a single operation?

    <p>4 GB</p> Signup and view all the answers

    Which retrieval tier in Glacier offers the fastest access to archived data?

    <p>S3 Glacier Instant Retrieval</p> Signup and view all the answers

    What happens to archived data after a retrieval request in Glacier?

    <p>The data is copied to S3 and the archive remains in Glacier.</p> Signup and view all the answers

    What is the minimum retrieval time for data stored in the S3 Glacier Flexible Retrieval tier?

    <p>Minutes to hours</p> Signup and view all the answers

    What is one of the main features of Amazon S3 Transfer Acceleration?

    <p>It manages fast transfers of files over long distances.</p> Signup and view all the answers

    How long do you have to download data after retrieving it from Glacier?

    <p>24 hours</p> Signup and view all the answers

    Which type of objects cannot be modified after they are uploaded to Glacier?

    <p>Archive objects</p> Signup and view all the answers

    What additional data is added to each object when transitioning to Glacier from other classes using lifecycle policies?

    <p>Indexing and archive metadata</p> Signup and view all the answers

    Which of the following is NOT a storage tier offered by Amazon S3 Glacier?

    <p>Glacier Standard Storage</p> Signup and view all the answers

    What is the maximum size of an object that can be uploaded in a single PUT request?

    <p>5 gigabytes</p> Signup and view all the answers

    What is the consistency model used for overwrite PUTS and DELETES in Amazon S3?

    <p>Eventual consistency</p> Signup and view all the answers

    What is the purpose of the Requester Pays function in Amazon S3?

    <p>To ensure that the requester pays for the cost of accessing objects</p> Signup and view all the answers

    What is the recommended way to access Amazon S3?

    <p>Through SDKs and APIs</p> Signup and view all the answers

    What is the purpose of Event notifications in Amazon S3?

    <p>To send alerts or trigger actions for specific events</p> Signup and view all the answers

    What is the primary purpose of using S3 Transfer Acceleration?

    <p>To enhance transfer speeds over long distances</p> Signup and view all the answers

    Which of the following statements about pre-signed URLs is true?

    <p>They allow temporary access to specific objects for users without credentials</p> Signup and view all the answers

    What is the limitation of bucket policies regarding size?

    <p>They are limited to 20 KB in size</p> Signup and view all the answers

    When uploading objects using multipart upload, which of the following is not an advantage?

    <p>Allows modifications to the object metadata during upload</p> Signup and view all the answers

    Which statement about static website hosting on S3 is false?

    <p>S3 can host dynamic content such as PHP scripts</p> Signup and view all the answers

    What is the primary purpose of Amazon S3 Cross Region Replication (CRR)?

    <p>To automatically replicate data across AWS Regions for disaster recovery and data redundancy</p> Signup and view all the answers

    Which of the following is NOT a requirement for enabling Cross Region Replication (CRR) in S3?

    <p>The source and destination buckets must be in the same AWS Region.</p> Signup and view all the answers

    Which of the following actions are NOT replicated during Cross Region Replication (CRR)?

    <p>Object ACL changes</p> Signup and view all the answers

    What is the key difference between Cross Region Replication (CRR) and Same Region Replication (SRR)?

    <p>CRR replicates data across AWS Regions, while SRR replicates data within the same AWS Region.</p> Signup and view all the answers

    What is the purpose of enabling S3 object tagging?

    <p>To facilitate data analysis and filtering of objects based on tags using CloudWatch metrics.</p> Signup and view all the answers

    What role does versioning play in Amazon S3?

    <p>It protects against accidental data overwrites and deletions.</p> Signup and view all the answers

    What happens to a DELETE marker when you delete an object in S3 with versioning enabled?

    <p>The DELETE marker is placed on the object, making it unavailable.</p> Signup and view all the answers

    Which statements are true regarding Amazon S3 server-side encryption?

    <p>SSE-S3 encrypts data with AES-256 using unique keys for each object.</p> Signup and view all the answers

    What must be configured before transitioning objects to the ONEZONE_IA class in S3?

    <p>Objects must be stored at least 30 days in the STANDARD_IA storage class.</p> Signup and view all the answers

    When versioning is enabled, who can permanently delete objects in an S3 bucket?

    <p>Only the S3 bucket owner.</p> Signup and view all the answers

    How does Amazon S3 ensure that replicated objects maintain their integrity and associated information?

    <p>By copying the object data along with its metadata, ACLs, and object tags.</p> Signup and view all the answers

    What is the primary function of S3 Inventory, in the context of Amazon S3 analytics?

    <p>Analyzing and reporting on the replication and encryption status of objects.</p> Signup and view all the answers

    Which of these strategies is NOT recommended for optimizing performance when interacting with Amazon S3?

    <p>Using byte-range fetches to retrieve portions of large objects.</p> Signup and view all the answers

    How does Amazon S3 server access logging differ from AWS CloudTrail logs in terms of logging capabilities?

    <p>Server access logging records detailed request information, while CloudTrail logs record a more generalized view of actions.</p> Signup and view all the answers

    Why does Amazon S3 recommend using CloudTrail for logging bucket and object-level actions?

    <p>CloudTrail offers more granular logging capabilities, enabling detailed auditing of actions on S3 resources.</p> Signup and view all the answers

    Which of the following is NOT a valid sub-resource associated with an Amazon S3 object?

    <p>Replication</p> Signup and view all the answers

    Which of the following is NOT a mechanism for controlling access to Amazon S3 resources?

    <p>CORS Configuration</p> Signup and view all the answers

    What is the purpose of the 'Access-Control-Allow-Origin' header in CORS configuration for Amazon S3?

    <p>To allow requests from specific origins or domains</p> Signup and view all the answers

    What is the main purpose of Amazon S3 Transfer Acceleration?

    <p>To minimize latency caused by distance</p> Signup and view all the answers

    Which Amazon S3 storage class is designed for objects that are accessed infrequently and are stored redundantly within a single Availability Zone?

    <p>One Zone-IA</p> Signup and view all the answers

    What is the key difference between S3 Glacier Deep Archive and S3 Glacier?

    <p>Retrieval times</p> Signup and view all the answers

    Which of the following statements about Amazon S3 bucket naming is TRUE?

    <p>Bucket names are case-sensitive.</p> Signup and view all the answers

    What happens when you retrieve an archive from Glacier?

    <p>The archive is copied to S3 and remains in Glacier</p> Signup and view all the answers

    What is the maximum size of a file that can be archived in Glacier?

    <p>1 byte to 40 TB</p> Signup and view all the answers

    How do you upload data to Glacier?

    <p>Using the CLI, SDKs, or APIs</p> Signup and view all the answers

    What encryption method uses 256-bit AES to secure data stored in Amazon S3?

    <p>Server-Side Encryption with S3 Managed Keys (SSE-S3)</p> Signup and view all the answers

    Which of the following features is unique to Server-Side Encryption with AWS KMS Keys (SSE-KMS) compared to others?

    <p>Provides an audit trail of key usage</p> Signup and view all the answers

    What headers must be specified when using Server-Side Encryption with Customer-Provided Keys (SSE-C)?

    <p>x-amz-server-side-encryption-customer-algorithm and x-amz-server-side-encryption-customer-key</p> Signup and view all the answers

    Which option correctly describes client-side encryption in relation to AWS services?

    <p>Objects are encrypted before they are sent to Amazon S3.</p> Signup and view all the answers

    Which type of encryption allows users to manage their own encryption keys and also specify where keys are stored?

    <p>Server-Side Encryption with Customer-Provided Keys (SSE-C)</p> Signup and view all the answers

    Study Notes

    Amazon S3 Overview

    • Amazon S3 is an object storage service that allows storing and retrieving any amount of data from anywhere on the Internet.
    • It's a simple storage service that offers an extremely durable, highly available, and infinitely scalable data storage infrastructure at very low costs.

    Key Features

    • Distributed architecture with objects redundantly stored on multiple devices across multiple facilities (AZs) in an Amazon S3 region.
    • Simple key-based object store with keys that can be any string and can be constructed to mimic hierarchical attributes.
    • Supports S3 Object Tagging to organize data across all S3 buckets and/or prefixes.
    • Provides a simple, standards-based REST web services interface that is designed to work with any Internet-development toolkit.

    Storage Details

    • Files can be from 0 bytes to 5TB in size.
    • The largest object that can be uploaded in a single PUT is 5 gigabytes.
    • Objects larger than 100 megabytes require the Multipart Upload capability.
    • Updates to an object are atomic, ensuring either the new object or the old one is retrieved, never partial or corrupt data.

    Access Control

    • Access to buckets and objects can be granted to AWS accounts, IAM users, and federated users.
    • Supports four mechanisms for controlling access to Amazon S3 resources: IAM permissions, Bucket Policies, ACLs, and Query String Authentication.
    • Requester pays function causes the requester to pay (removes anonymous access) and provides time-limited access to objects.

    Data Consistency

    • Provides read after write consistency for PUTS of new objects.
    • Provides eventual consistency for overwrite PUTS and DELETES (takes time to propagate).

    Additional Capabilities

    • Supports unlimited storage with automatic scaling to high request rates.
    • Allows for event notifications for specific actions, sending alerts or triggering actions.
    • Supports cross-region replication with versioning enabled on both source and destination buckets.
    • Enables fast, easy, and secure transfers of files over long distances with Amazon S3 Transfer Acceleration.

    Storage Classes

    • Offers six storage classes: S3 Standard, S3 Infrequent Access, S3 One Zone-Infrequent Access, S3 Glacier, S3 Glacier Deep Archive, and S3 Intelligent-Tiering.

    Bucket Management

    • Buckets are region-specific and have a unique namespace.
    • Bucket names can be up to 63 characters and must be DNS-compliant.
    • Supports up to 100 buckets per account by default, and objects can be stored in folders within buckets.
    • Bucket ownership is not transferable, and bucket names cannot be changed after creation.

    Objects and Sub-Resources

    • Each object is stored and retrieved by a unique key (ID or name).
    • Sub-resources are subordinate to objects, including object tags, access control lists (ACLs), and metadata.

    Security and Encryption

    • Supports server-side encryption options: SSE-S3, SSE-KMS, and SSE-C.
    • Enables client-side encryption using customer-provided encryption keys.
    • Supports encryption and decryption of data in transit using SSL/TLS.

    Event Notifications

    • Supports event notifications for specific actions, such as PUTs, POSTs, COPYs, or DELETEs.
    • Enables notifications to be sent to Amazon SNS, Amazon SQS, or AWS Lambda.

    Metrics and Analytics

    • Supports Amazon S3 CloudWatch metrics for request and storage data.
    • Enables configuration of filters for metrics using a prefix or object tag.

    Cross-Region Replication

    • Automatically replicates data across AWS Regions with Cross-Region Replication (CRR).

    • Requires versioning to be enabled on both the source and destination buckets.

    • Supports automatic, asynchronous copying of objects between buckets in different regions.### Cross-Region Replication (CRR)

    • Source and destination buckets must be in different regions.

    • Replication is 1:1 (one source bucket to one destination bucket).

    • You can configure separate S3 Lifecycle rules on the source and destination buckets.

    • You can replicate KMS-encrypted objects by providing a destination KMS key in your replication configuration.

    • You can set up CRR across AWS accounts to store your replicated data in a different account in the target region.

    • To activate CRR, you need to configure the replication on the source bucket.

    • Replicas will be exact replicas and share the same key names and metadata.

    • You can specify a different storage class (by default, the source storage class will be used).

    • AWS S3 will encrypt data in-transit with SSL.

    Same Region Replication (SRR)

    • SRR allows you to replicate objects to a destination bucket within the same region as the source bucket.
    • Replication is automatic and asynchronous.
    • New objects uploaded to an Amazon S3 bucket are configured for replication at the bucket, prefix, or object tag levels.
    • Replicated objects can be owned by the same AWS account as the original copy or by different accounts, to protect from accidental deletion.
    • Replication can be to any Amazon S3 storage class, including S3 Glacier and S3 Glacier Deep Archive to create backups and long-term archives.

    S3 Analytics

    • S3 Analytics allows you to run analytics on data stored on Amazon S3.
    • This includes data lakes, IoT streaming data, machine learning, and artificial intelligence.
    • You can use S3 Inventory to audit and report on the replication and encryption status of your objects for business, compliance, and regulatory needs.

    S3 Inventory

    • S3 Inventory provides comma-separated values (CSV), Apache optimized row columnar (ORC) or Apache Parquet (Parquet) output files that list your objects and their corresponding metadata on a daily or weekly basis for an S3 bucket or a shared prefix.
    • You can use S3 Inventory to audit and report on the replication and encryption status of your objects for business, compliance, and regulatory needs.

    Monitoring and Reporting

    • Amazon CloudWatch metrics for Amazon S3 can help you understand and improve the performance of applications that use Amazon S3.
    • You can use CloudWatch with Amazon S3 to monitor and report on performance.

    Logging and Auditing

    • You can record the actions taken by users, roles, or AWS services on Amazon S3 resources and maintain log records for auditing and compliance purposes.
    • You can use Amazon S3 server access logging, AWS CloudTrail logs, or a combination of both to log and audit actions on Amazon S3 resources.

    S3 Performance Guidelines

    • Measure Performance: Look at network throughput, CPU, and DRAM requirements when optimizing performance.
    • Scale Storage Connections Horizontally: Issue multiple concurrent requests to Amazon S3 and spread them over separate connections to maximize the accessible bandwidth from Amazon S3.
    • Use Byte-Range Fetches: Fetch a byte-range from an object using the Range HTTP header in a GET Object request to achieve higher aggregate throughput.
    • Retry Requests for Latency-Sensitive Applications: Aggressive timeouts and retries help drive consistent latency.
    • Combine Amazon S3 (Storage) and Amazon EC2 (Compute) in the Same AWS Region: Access the bucket from Amazon EC2 instances in the same AWS Region when possible to reduce network latency and data transfer costs.
    • Use Amazon S3 Transfer Acceleration to Minimize Latency Caused by Distance: Use Amazon S3 Transfer Acceleration to minimize latency caused by distance between the client and an S3 bucket.

    Glacier

    • Glacier is an archiving storage solution for infrequently accessed data.
    • There are three storage tiers: S3 Glacier Instant Retrieval, S3 Glacier Flexible Retrieval (Formerly S3 Glacier), and Amazon S3 Glacier Deep Archive (S3 Glacier Deep Archive).
    • The key difference between the top tiers is that Deep Archive is lower cost, but retrieval times are much longer (12 hours).
    • Archived objects are not available for real-time access, and you need to submit a retrieval request.
    • Glacier must complete a job before you can get its output.
    • Requested archival data is copied to S3 One Zone-IA.
    • Following retrieval, you have 24 hours to download your data.
    • You cannot specify Glacier as the storage class at the time you create an object.
    • Glacier is designed to sustain the loss of two facilities.
    • Glacier automatically encrypts data at rest using AES 256 symmetric keys and supports secure transfer of data over SSL.
    • Glacier may not be available in all AWS regions.
    • Glacier objects are visible through S3 only (not Glacier directly).
    • Glacier does not archive object metadata; you need to maintain a client-side database to maintain this information.
    • Archives can be 1 byte up to 40TB.
    • Glacier file archives of 1 byte – 4 GB can be performed in a single operation.
    • Glacier file archives from 100MB up to 40TB can be uploaded to Glacier using the multipart upload API.
    • Uploading archives is synchronous, and downloading archives is asynchronous.
    • The contents of an archive that has been uploaded cannot be modified.
    • You can upload data to Glacier using the CLI, SDKs, or APIs – you cannot use the AWS Console.
    • Glacier adds 32-40KB (indexing and archive metadata) to each object when transitioning from other classes using lifecycle policies.
    • AWS recommends that if you have lots of small objects, they are combined in an archive (e.g., zip file) before uploading.
    • A description can be added to archives, but no other metadata can be added.
    • Glacier archive IDs are added upon upload and are unique for each upload.

    Archive Retrieval

    • You can retrieve parts of an archive.
    • When data is retrieved, it is copied to S3, and the archive remains in Glacier, and the storage class does not change.
    • AWS SNS can send notifications when retrieval jobs are complete.
    • Retrieved data is available for 24 hours by default (can be changed).
    • To retrieve specific objects within an archive, you can specify the byte range (Range) in the HTTP GET request (need to maintain a DB of byte ranges).

    Glacier Charges

    • There is no charge for data transfer between EC2 and Glacier in the same region.
    • There is a charge if you delete data within 90 days.
    • When you restore, you pay for:

    Amazon S3 Overview

    • Amazon S3 is an object storage service that allows storing and retrieving any amount of data from anywhere on the Internet.
    • It's a simple storage service that offers an extremely durable, highly available, and infinitely scalable data storage infrastructure at very low costs.

    Key Features

    • Distributed architecture with objects redundantly stored on multiple devices across multiple facilities (AZs) in an Amazon S3 region.
    • Simple key-based object store with keys that can be any string and can be constructed to mimic hierarchical attributes.
    • Supports S3 Object Tagging to organize data across all S3 buckets and/or prefixes.
    • Provides a simple, standards-based REST web services interface that is designed to work with any Internet-development toolkit.

    Storage Details

    • Files can be from 0 bytes to 5TB in size.
    • The largest object that can be uploaded in a single PUT is 5 gigabytes.
    • Objects larger than 100 megabytes require the Multipart Upload capability.
    • Updates to an object are atomic, ensuring either the new object or the old one is retrieved, never partial or corrupt data.

    Access Control

    • Access to buckets and objects can be granted to AWS accounts, IAM users, and federated users.
    • Supports four mechanisms for controlling access to Amazon S3 resources: IAM permissions, Bucket Policies, ACLs, and Query String Authentication.
    • Requester pays function causes the requester to pay (removes anonymous access) and provides time-limited access to objects.

    Data Consistency

    • Provides read after write consistency for PUTS of new objects.
    • Provides eventual consistency for overwrite PUTS and DELETES (takes time to propagate).

    Additional Capabilities

    • Supports unlimited storage with automatic scaling to high request rates.
    • Allows for event notifications for specific actions, sending alerts or triggering actions.
    • Supports cross-region replication with versioning enabled on both source and destination buckets.
    • Enables fast, easy, and secure transfers of files over long distances with Amazon S3 Transfer Acceleration.

    Storage Classes

    • Offers six storage classes: S3 Standard, S3 Infrequent Access, S3 One Zone-Infrequent Access, S3 Glacier, S3 Glacier Deep Archive, and S3 Intelligent-Tiering.

    Bucket Management

    • Buckets are region-specific and have a unique namespace.
    • Bucket names can be up to 63 characters and must be DNS-compliant.
    • Supports up to 100 buckets per account by default, and objects can be stored in folders within buckets.
    • Bucket ownership is not transferable, and bucket names cannot be changed after creation.

    Objects and Sub-Resources

    • Each object is stored and retrieved by a unique key (ID or name).
    • Sub-resources are subordinate to objects, including object tags, access control lists (ACLs), and metadata.

    Security and Encryption

    • Supports server-side encryption options: SSE-S3, SSE-KMS, and SSE-C.
    • Enables client-side encryption using customer-provided encryption keys.
    • Supports encryption and decryption of data in transit using SSL/TLS.

    Event Notifications

    • Supports event notifications for specific actions, such as PUTs, POSTs, COPYs, or DELETEs.
    • Enables notifications to be sent to Amazon SNS, Amazon SQS, or AWS Lambda.

    Metrics and Analytics

    • Supports Amazon S3 CloudWatch metrics for request and storage data.
    • Enables configuration of filters for metrics using a prefix or object tag.

    Cross-Region Replication

    • Automatically replicates data across AWS Regions with Cross-Region Replication (CRR).

    • Requires versioning to be enabled on both the source and destination buckets.

    • Supports automatic, asynchronous copying of objects between buckets in different regions.### Cross-Region Replication (CRR)

    • Source and destination buckets must be in different regions.

    • Replication is 1:1 (one source bucket to one destination bucket).

    • You can configure separate S3 Lifecycle rules on the source and destination buckets.

    • You can replicate KMS-encrypted objects by providing a destination KMS key in your replication configuration.

    • You can set up CRR across AWS accounts to store your replicated data in a different account in the target region.

    • To activate CRR, you need to configure the replication on the source bucket.

    • Replicas will be exact replicas and share the same key names and metadata.

    • You can specify a different storage class (by default, the source storage class will be used).

    • AWS S3 will encrypt data in-transit with SSL.

    Same Region Replication (SRR)

    • SRR allows you to replicate objects to a destination bucket within the same region as the source bucket.
    • Replication is automatic and asynchronous.
    • New objects uploaded to an Amazon S3 bucket are configured for replication at the bucket, prefix, or object tag levels.
    • Replicated objects can be owned by the same AWS account as the original copy or by different accounts, to protect from accidental deletion.
    • Replication can be to any Amazon S3 storage class, including S3 Glacier and S3 Glacier Deep Archive to create backups and long-term archives.

    S3 Analytics

    • S3 Analytics allows you to run analytics on data stored on Amazon S3.
    • This includes data lakes, IoT streaming data, machine learning, and artificial intelligence.
    • You can use S3 Inventory to audit and report on the replication and encryption status of your objects for business, compliance, and regulatory needs.

    S3 Inventory

    • S3 Inventory provides comma-separated values (CSV), Apache optimized row columnar (ORC) or Apache Parquet (Parquet) output files that list your objects and their corresponding metadata on a daily or weekly basis for an S3 bucket or a shared prefix.
    • You can use S3 Inventory to audit and report on the replication and encryption status of your objects for business, compliance, and regulatory needs.

    Monitoring and Reporting

    • Amazon CloudWatch metrics for Amazon S3 can help you understand and improve the performance of applications that use Amazon S3.
    • You can use CloudWatch with Amazon S3 to monitor and report on performance.

    Logging and Auditing

    • You can record the actions taken by users, roles, or AWS services on Amazon S3 resources and maintain log records for auditing and compliance purposes.
    • You can use Amazon S3 server access logging, AWS CloudTrail logs, or a combination of both to log and audit actions on Amazon S3 resources.

    S3 Performance Guidelines

    • Measure Performance: Look at network throughput, CPU, and DRAM requirements when optimizing performance.
    • Scale Storage Connections Horizontally: Issue multiple concurrent requests to Amazon S3 and spread them over separate connections to maximize the accessible bandwidth from Amazon S3.
    • Use Byte-Range Fetches: Fetch a byte-range from an object using the Range HTTP header in a GET Object request to achieve higher aggregate throughput.
    • Retry Requests for Latency-Sensitive Applications: Aggressive timeouts and retries help drive consistent latency.
    • Combine Amazon S3 (Storage) and Amazon EC2 (Compute) in the Same AWS Region: Access the bucket from Amazon EC2 instances in the same AWS Region when possible to reduce network latency and data transfer costs.
    • Use Amazon S3 Transfer Acceleration to Minimize Latency Caused by Distance: Use Amazon S3 Transfer Acceleration to minimize latency caused by distance between the client and an S3 bucket.

    Glacier

    • Glacier is an archiving storage solution for infrequently accessed data.
    • There are three storage tiers: S3 Glacier Instant Retrieval, S3 Glacier Flexible Retrieval (Formerly S3 Glacier), and Amazon S3 Glacier Deep Archive (S3 Glacier Deep Archive).
    • The key difference between the top tiers is that Deep Archive is lower cost, but retrieval times are much longer (12 hours).
    • Archived objects are not available for real-time access, and you need to submit a retrieval request.
    • Glacier must complete a job before you can get its output.
    • Requested archival data is copied to S3 One Zone-IA.
    • Following retrieval, you have 24 hours to download your data.
    • You cannot specify Glacier as the storage class at the time you create an object.
    • Glacier is designed to sustain the loss of two facilities.
    • Glacier automatically encrypts data at rest using AES 256 symmetric keys and supports secure transfer of data over SSL.
    • Glacier may not be available in all AWS regions.
    • Glacier objects are visible through S3 only (not Glacier directly).
    • Glacier does not archive object metadata; you need to maintain a client-side database to maintain this information.
    • Archives can be 1 byte up to 40TB.
    • Glacier file archives of 1 byte – 4 GB can be performed in a single operation.
    • Glacier file archives from 100MB up to 40TB can be uploaded to Glacier using the multipart upload API.
    • Uploading archives is synchronous, and downloading archives is asynchronous.
    • The contents of an archive that has been uploaded cannot be modified.
    • You can upload data to Glacier using the CLI, SDKs, or APIs – you cannot use the AWS Console.
    • Glacier adds 32-40KB (indexing and archive metadata) to each object when transitioning from other classes using lifecycle policies.
    • AWS recommends that if you have lots of small objects, they are combined in an archive (e.g., zip file) before uploading.
    • A description can be added to archives, but no other metadata can be added.
    • Glacier archive IDs are added upon upload and are unique for each upload.

    Archive Retrieval

    • You can retrieve parts of an archive.
    • When data is retrieved, it is copied to S3, and the archive remains in Glacier, and the storage class does not change.
    • AWS SNS can send notifications when retrieval jobs are complete.
    • Retrieved data is available for 24 hours by default (can be changed).
    • To retrieve specific objects within an archive, you can specify the byte range (Range) in the HTTP GET request (need to maintain a DB of byte ranges).

    Glacier Charges

    • There is no charge for data transfer between EC2 and Glacier in the same region.
    • There is a charge if you delete data within 90 days.
    • When you restore, you pay for:

    Studying That Suits You

    Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

    Quiz Team

    Description

    Assess your knowledge of Amazon S3 features and concepts, including CloudFront edge locations, data storage, bucket limits, CORS headers, and storage classes.

    More Like This

    Use Quizgecko on...
    Browser
    Browser