Cloud Services: Cost, Pricing, and Architecture
53 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which factor most significantly influences the cost-effectiveness of cloud services compared to on-premise IT solutions?

  • The color scheme of the cloud provider's user interface.
  • The geographical location of the data centers.
  • The extent to which administration can be automated and costs can be amortized. (correct)
  • The brand reputation of the cloud vendor.

What is a key implication of cloud pricing structures on software architecture?

  • The frequency of software updates.
  • The best software architecture to leverage cloud services effectively. (correct)
  • The user interface design.
  • The selection of the programming language.

Why is understanding cloud service pricing crucial for organizations utilizing cloud resources?

  • To impress stakeholders with technical jargon.
  • To gain access to exclusive cloud vendor merchandise.
  • To ensure compliance with industry-standard color palettes.
  • To accurately budget and optimize cloud expenditures, avoiding unexpected costs. (correct)

A company is planning to migrate its services to the cloud. Which of the following strategies would be most effective in managing and minimizing cloud costs?

<p>Automating resource scaling and leveraging cost amortization strategies. (B)</p> Signup and view all the answers

A development team is designing a new application to be hosted on EC2. Which of the following factors should they consider least when selecting an EC2 instance type?

<p>The proximity of the instance to the equator. (B)</p> Signup and view all the answers

Which of the following strategies can be used within the AWS EC2 environment to minimize service disruption when resizing an instance?

<p>Implement a load balancer to distribute traffic across multiple instances, allowing one to be stopped and resized without affecting overall service availability. (A)</p> Signup and view all the answers

An application requires low-latency network connectivity. Based on the provided data, which region would be the most suitable for deploying resources if the primary region is eu-central-1?

<p><code>eu-south-1</code> (Milan) (A)</p> Signup and view all the answers

How does EC2 handle bandwidth allocation for smaller instance slices on the same machine?

<p>Bandwidth is dynamically allocated, providing unused bandwidth from neighbors to other instances on the same machine for a limited time. (D)</p> Signup and view all the answers

What happens to the extra bandwidth allocated to an EC2 instance in steady state?

<p>The instance is forced back to its baseline bandwidth, approximating the paid-for fraction. (B)</p> Signup and view all the answers

What is a key consideration when selecting an AWS region for deploying resources, besides network latency?

<p>Pricing variations, as some regions can be more expensive than others. (A)</p> Signup and view all the answers

Which Amazon S3 storage class offers the lowest storage cost per TB per month?

<p>Glacier Deep Archive (D)</p> Signup and view all the answers

What is a primary benefit for customers using the AWS Marketplace from a purchasing perspective?

<p>Consolidating third-party software spending on their AWS bill. (D)</p> Signup and view all the answers

A company requires immediate consistency for listing objects after writes in their cloud storage solution. Which Amazon S3 feature directly supports this requirement?

<p>Strong Consistency for List Operations (A)</p> Signup and view all the answers

What role does the seller fulfill in AWS Marketplace subscription-based pricing?

<p>Metering and reporting resource consumption via the AWS Marketplace Metering API. (A)</p> Signup and view all the answers

A development team needs a PaaS solution on AWS. Which service should they consider?

<p>Beanstalk (B)</p> Signup and view all the answers

A company wants to implement a content delivery network (CDN) to improve the speed and availability of their web content. Which AWS service should they use?

<p>CloudFront (C)</p> Signup and view all the answers

What is the function of the Amazon Simple Notification Service (SNS)?

<p>Facilitating publish/subscribe messaging. (D)</p> Signup and view all the answers

Which AWS service is specifically designed for big data batch processing using frameworks like Spark and MapReduce?

<p>EMR (D)</p> Signup and view all the answers

Consider a scenario where an application needs to ensure that updates to an S3 object are made only if specific conditions are met. Which S3 feature would best support these requirements?

<p>Conditional Writes (D)</p> Signup and view all the answers

A company wants to monitor the performance of their AWS resources and receive alerts for potential issues. Which AWS service should they use?

<p>CloudWatch (B)</p> Signup and view all the answers

What is the primary benefit of AWS Nitro Cards in the context of virtualization?

<p>They reduce virtualization overhead by offloading tasks to dedicated hardware. (D)</p> Signup and view all the answers

Compared to CPUs, what is a significant advantage GPUs offer for machine learning tasks?

<p>Significantly more floating point operations per second (FLOPS). (C)</p> Signup and view all the answers

Why might a company consider switching cloud providers specifically for machine learning training?

<p>Because the cost of specialized GPU instances can vary significantly between providers. (C)</p> Signup and view all the answers

What is the main advantage of using AWS Trainium and Inferentia2 accelerators over general-purpose GPUs for machine learning?

<p>Optimized performance and cost-effectiveness for specific machine learning workloads. (B)</p> Signup and view all the answers

In what scenario would FPGAs be a more suitable choice than CPUs, GPUs, or ML accelerators?

<p>When the workload can benefit from pipeline parallelism and customized hardware logic. (C)</p> Signup and view all the answers

What is the primary reason that cloud computing, if not carefully monitored, can become very expensive?

<p>The ease of provisioning resources without fully understanding the pricing structure. (B)</p> Signup and view all the answers

What is the purpose of custom AWS hardware?

<p>To improve performance, security, and efficiency of AWS services. (D)</p> Signup and view all the answers

Which of the following accurately describes the trade-off between GPUs and CPUs?

<p>GPUs typically offer more FLOPS but less memory capacity than CPUs. (A)</p> Signup and view all the answers

Which of the following characteristics makes Function as a Service (FaaS) like AWS Lambda most suitable for specific types of workloads?

<p>Its automatic scaling and stateless nature, ideal for short invocations and tasks that can be broken into stateless computations. (B)</p> Signup and view all the answers

Which of the following is NOT a benefit of using FPGAs?

<p>They have extremely low upfront costs. (C)</p> Signup and view all the answers

Compute, storage, and network resources can all be cost bottlenecks. What is the most important consideration when handling these bottlenecks?

<p>Pricing structure. (B)</p> Signup and view all the answers

What is the primary factor that determines the cost of Lambda function execution, besides the number of requests?

<p>The duration of the function's execution and the amount of memory (RAM) allocated to it. (A)</p> Signup and view all the answers

Considering the limitations of AWS Lambda, which scenario would be least suitable for its implementation?

<p>Running a web server that requires persistent connections. (D)</p> Signup and view all the answers

How does the pricing model of Hetzner's dedicated servers differ from their shared/dedicated vCPU offerings?

<p>Dedicated servers have a monthly fee, plus a one-time setup fee, while vCPUs are billed hourly. (D)</p> Signup and view all the answers

If an application requires guaranteed CPU performance and is sensitive to resource contention, which of the following AWS EC2 instance types would be the most suitable?

<p>C6g.medium (Compute Optimized) (C)</p> Signup and view all the answers

What is a key disadvantage of using burstable performance instances like t4g.medium compared to general-purpose instances like m6g.2xlarge for demanding applications?

<p>Burstable instances have unpredictable CPU performance due to resource oversubscription. (B)</p> Signup and view all the answers

A company is deciding between using AWS Lambda and provisioning their own servers. They need to process a high volume of unpredictable, short-lived requests. What advantage does AWS Lambda offer in this scenario?

<p>Automatic scaling and no operational overhead for server management. (C)</p> Signup and view all the answers

A developer wants to optimize the cost of running an AWS Lambda function that processes data from S3. The function's execution time varies significantly depending on the input data size. Which strategy would be most effective for cost optimization?

<p>Profile the function's performance with different memory allocations and choose the configuration that minimizes the product of execution time and memory usage. (A)</p> Signup and view all the answers

What is the primary factor influencing network latency within an Availability Zone (AZ)?

<p>Network infrastructure and proximity of data centers. (C)</p> Signup and view all the answers

Which of the following outbound data transfer scenarios would typically be free of charge in AWS, as of the information provided?

<p>Data transfer to the internet when closing an AWS account. (B)</p> Signup and view all the answers

What is the cost associated with data transfer between two EC2 instances located within the same Availability Zone (AZ) using private IP addresses?

<p>Free. (A)</p> Signup and view all the answers

A company needs to transfer 2TB of data between two Availability Zones within the same AWS region. What would be the total cost of this transfer?

<p>$40 (D)</p> Signup and view all the answers

Which Placement Group strategy is best suited for distributing instances across different physical racks to minimize correlated failures?

<p>Spread. (A)</p> Signup and view all the answers

A company is deploying a new application and must comply with data sovereignty laws requiring data to reside within a specific geographic boundary. Which AWS Region/AZ choice consideration is most relevant in this scenario?

<p>Legal Reasons. (B)</p> Signup and view all the answers

Which EC2 pricing model offers the most flexibility but is also typically the most expensive?

<p>On Demand. (A)</p> Signup and view all the answers

A company has a consistent, predictable workload and wants to minimize EC2 costs. Which pricing model would be the most cost-effective choice?

<p>Reserved. (A)</p> Signup and view all the answers

What is the primary purpose of the Reserved Instance Marketplace?

<p>To buy and sell unused Reserved Instances with remaining contract duration. (A)</p> Signup and view all the answers

A company commits to spending $80 per hour on AWS compute resources through a Savings Plan. In one particular hour, their actual spending is only $60. What happens with the remaining $20?

<p>The company receives a discount on the $60 of resources used, and the remaining $20 is lost. (D)</p> Signup and view all the answers

What is the key difference between Compute Savings Plans and EC2 Instance Savings Plans?

<p>Compute Savings Plans can be used for any EC2 instance, while EC2 Instance Savings Plans are specific to an instance family and region. (A)</p> Signup and view all the answers

A company is running a large batch processing job that is not time-sensitive and can tolerate interruptions. Which EC2 pricing model would be the most cost-effective?

<p>Spot. (B)</p> Signup and view all the answers

What advance notice does AWS typically provide before terminating a Spot Instance?

<p>2 minutes. (A)</p> Signup and view all the answers

A company has a base load of critical applications that must always be available, as well as some large, non-critical jobs that can be interrupted. What is the recommended strategy for EC2 pricing models?

<p>Use 3-year Reserved Instances for the base load and Spot Instances for the non-critical jobs. (C)</p> Signup and view all the answers

What is a good rule of thumb for average EC2 costs when using a mix of pricing models?

<p>Pay half the On-Demand cost. (A)</p> Signup and view all the answers

Flashcards

Cloud Costs

Expenses incurred while using cloud services, often significant for software products.

Public Cloud Vendors

Companies that offer cloud services like AWS, which have changed the hardware landscape.

Elastic Compute Cloud (EC2)

Amazon's service for providing virtual machines of varying types for different computing needs.

Instance Types

Different configurations of virtual machines in EC2, optimized for various tasks like memory or compute.

Signup and view all the flashcards

Pricing Structures

How cloud services charge for usage, impacting software architecture decisions.

Signup and view all the flashcards

Bandwidth Allocation

Unused bandwidth from one instance can be allocated to others temporarily.

Signup and view all the flashcards

EC2 Instance Change

To modify an EC2 instance, it must be stopped to avoid disruption.

Signup and view all the flashcards

AWS Regions

Physical locations of AWS data centers, with varying prices across regions.

Signup and view all the flashcards

Network Latency

The time it takes for data to travel between data centers, measured as round-trip time.

Signup and view all the flashcards

Live Migration Support

EC2 does not support live migration; instances must be stopped for size changes.

Signup and view all the flashcards

Lambda: Function as a Service

A cloud computing service that automatically assigns code to compute hardware for short invocations.

Signup and view all the flashcards

Automatic Scaling

The ability to adjust resource usage dynamically based on demand, often seen in serverless architectures.

Signup and view all the flashcards

Stateless Tasks

Tasks that do not maintain a persistent state across executions, allowing for easier management in Lambda.

Signup and view all the flashcards

Lambda Pricing - Duration

Cost of Lambda functions is based on memory size and execution time, measured in GB-seconds.

Signup and view all the flashcards

Request Cost

The charge incurred for executing requests in AWS Lambda, set at $0.20 per 1 million requests.

Signup and view all the flashcards

Lambda Limitations

Maximum constraints of Lambda include 15-minute execution time, 10,240 MB memory, and network communication limits.

Signup and view all the flashcards

CPU Proportionality

In AWS Lambda, the number of vCPUs is proportional to the allocated main memory, ensuring resource optimization.

Signup and view all the flashcards

Networking Constraints

Lambda functions cannot communicate directly with each other, which impacts design of workflows.

Signup and view all the flashcards

S3 Strong Consistency

S3 ensures immediate visibility of changes after a write operation when listing objects.

Signup and view all the flashcards

S3 Variants

Different types of storage classes in S3 with varying costs and access times.

Signup and view all the flashcards

Glacier Storage Classes

S3 Glacier offers options for long-term storage with different retrieval times and costs.

Signup and view all the flashcards

Intelligent Tiering

A storage option that automatically moves data between frequent and infrequent access tiers.

Signup and view all the flashcards

AWS Marketplace

Platform for buying/selling third-party software and SaaS services on AWS.

Signup and view all the flashcards

Subscription-based Pricing

Pricing model where customers are charged based on usage, with no upfront costs.

Signup and view all the flashcards

Conditional Writes

Allows specific conditions to be set for data writing operations in S3.

Signup and view all the flashcards

Identity and Access Management (IAM)

Service for managing user permissions and access to AWS resources.

Signup and view all the flashcards

AWS CloudWatch

Monitoring service for logging, tracking, and alarming AWS resources.

Signup and view all the flashcards

Simple Notification Service (SNS)

Service used for messaging and notifications between decoupled applications.

Signup and view all the flashcards

Availability Zones (AZ)

AZs are isolated data centers within a region, minimizing latency.

Signup and view all the flashcards

Internet Traffic Cost

Charges for data leaving a cloud service, typically ranging from $0.05-$0.09/GB.

Signup and view all the flashcards

Intra-AWS Transfer Cost

Cost structure for data transfer within AWS, free in the same AZ but charged across regions.

Signup and view all the flashcards

Placement Groups

Groupings of EC2 instances to control where they are deployed physically.

Signup and view all the flashcards

On Demand Pricing

Flexible cloud pricing model based on per-second usage without long-term commitment.

Signup and view all the flashcards

Reserved Instances

Prepaid EC2 instances for 1 or 3 years with significant savings of 50-70%.

Signup and view all the flashcards

Savings Plans

Commit to a specific spend over 1 or 3 years for discounts on usage.

Signup and view all the flashcards

Spot Instances

EC2 instances sold at a discount of 60-70% based on surplus capacity.

Signup and view all the flashcards

Interruption Notice

A 2-minute warning received before a spot instance is forcibly stopped.

Signup and view all the flashcards

EC2 Pricing Models Summary

Best practice involves using a mix of EC2 pricing models.

Signup and view all the flashcards

Fault Tolerance

The ability of a system to continue operating despite failures.

Signup and view all the flashcards

Data Transfer via S3

A potentially cheaper method for transferring data between AZs in the same region.

Signup and view all the flashcards

Network Bandwidth Cost

Charges related to data transfer speeds and volumes across the network.

Signup and view all the flashcards

Legal Reasons for Region Choice

Legal compliance can dictate data handling location decisions.

Signup and view all the flashcards

AWS Nitro Cards

Specialized hardware cards that enhance performance for VPC, EBS, and storage.

Signup and view all the flashcards

Floating Point Operations Per Second (FLOPS)

A measure of a computer's performance, especially in machine learning tasks.

Signup and view all the flashcards

Graphics Processing Unit (GPU)

A hardware component designed for graphics but excel in machine learning computations.

Signup and view all the flashcards

Machine Learning Accelerators

Specialized hardware designed to speed up machine learning tasks, like AWS Trainium and Inferentia.

Signup and view all the flashcards

Field Programmable Gate Array (FPGA)

Programmable hardware used for specific tasks; good for customization and parallelism.

Signup and view all the flashcards

Application-Specific Integrated Circuit (ASIC)

Custom hardware designed for a specific use case, often costly.

Signup and view all the flashcards

AWS Trainium

An AWS machine learning accelerator offering high performance for ML tasks.

Signup and view all the flashcards

AWS Inferentia2

An AWS chip designed specifically for inference in machine learning.

Signup and view all the flashcards

GPU instance types

EC2 types that utilize GPUs for enhanced computational tasks, like g4dn and p3.

Signup and view all the flashcards

Machine Learning Training Cost

High cost involved in training ML models, often leading to considering alternative clouds.

Signup and view all the flashcards

Study Notes

Cloud Information Systems

  • Presented by Prof. Dr. Viktor Leis, Chair for Decentralized Information Systems and Data Management
  • Focuses on Cloud Information Systems, specifically Amazon Web Services (AWS)

Case Study: Amazon Web Services

  • Amazon's extreme growth led to sophisticated in-house IT infrastructure
  • AWS launched EC2 and S3 in 2006, triggering the modern era of cloud computing
  • Microsoft Azure followed in 2010, Google Cloud in 2012

Modern Cloud Era

  • Highlighting Amazon's decision to sell its internal IT services
  • Cloud costs are a significant factor in software product development and administration

Hyperscalers Market Share

  • Showing market share trend of hyperscalers like Amazon, Microsoft, Google, Alibaba, and IBM
  • Data from Synergy Research Group.
  • Market share data from Q4 2017 to Q2 2022, showing global revenue trends.

Market Share

  • Graph depicts market cap and market share of major cloud providers (EU & US)
  • Figures in EUR billion and percentage points respectively.

AWS Deep Dive

  • Detailed AWS Services and Pricing Models will be explored
  • Costs of cloud computing resources are a significant consideration.
  • Public cloud providers' offerings are affecting hardware landscapes
  • Minor changes can significantly impact costs
  • Cloud pricing plays critical roles for software architecture and effective cloud usage.

EC2

  • A large set of heterogeneous virtual machines (instances).
  • Types include general purpose, compute optimized, memory optimized, accelerated computing, storage optimized, and HPC optimized.
  • Previously primarily based on 2-socket Intel servers; now also includes AMD and ARM chips (Graviton).
  • Includes extensive details on available instance types.

Instance Sizes

  • Tabular data on mainstream instance families with Intel CPUs, featuring vCPUs, DRAM, instance SSD, network bandwidth & cost per hour
  • Data is outdated.

Instance Slices

  • Virtual machines created by slicing physical servers
  • Uses Intel and ARM/Graviton processors
  • Costs usually scale linearly with resources.
  • Offers fine-grained RAM/CPU configuration using various API names.

Burst Bandwidth

  • Bandwidth shared among multiple machines.
  • Unused bandwidth is allocated to other instances.
  • Referencing specific AWS documentation links.

Changing Instances

  • EC2 instances need to be stopped and restarted to change configurations (e.g., size)
  • Important considerations to prevent service disruption.
  • Alternative cloud service providers offer live migration

Regions, AZs, Networking

  • AWS has 27 physical locations (data centers) globally.
  • Price variations exist based on region, e.g., Frankfurt vs. US east coast.
  • Region sizes, presented with a pie chart of IP address counts for AWS regions in 2022
  • Network latency data (round-trip latency) from eu-central-1 to other regions (e.g., Milan, Ireland, US East, US West, Singapore).

Availability Zones (AZs)

  • Clusters of geographically close data centers, but not directly adjacent.
  • Latency within an AZ is very low (down to 0.03ms)

Internet Traffic Cost

  • Inbound data transfer is free; outbound transfer costs varied to have become free.
  • Pricing depends on the data volume.
  • This was due to March 2024 free transfer changes, which were possibly triggered by European Data Act.

Intra-AWS Transfer Cost

  • Intra-AZ transfer cost is free (private IP).
  • Inter-region transfer is priced at $0.01 - $0.02/GB.
  • S3 can sometimes be a cheaper alternative for inter-AZ transfer.
  • Microsoft Azure made a similar change.

Placement Groups

  • Users have limited control over the allocation of their physical machines in racks.
  • Placing compute resources in the same rack may result in related failures.
  • Using placement groups helps with certain control over placement. (e.g. cluster in the same rack, spread across racks, or partition group).

Region/AZ Choice Considerations

  • Legal requirements and other factors can influence the best regions for deploying services.
  • Network bandwidth costs, speed, tolerance to failures, and overall availability are factored in.

EC2 Pricing Models

  • Services include on-demand, spot, reserved instances, and savings plans.
  • On-demand: per-second pricing, most flexible.
  • Spot: significant lower price, possible interruption
  • Reserved: long-term commitment
  • Savings Plans: hourly spending discount

On Demand

  • Most flexible, but also most expensive pricing option in EC2..
  • Measured per minute.
  • No guarantee of instance availability.

Reserved Instance, Reserved Instance Market

  • Long-term reservations (1 or 3 years), offering discounts.
  • Partial upfront or monthly payment possible.

Reserved Instance Example

  • Example of pricing for c5.24xlarge on demand.
  • Includes hourly rate.

Savings Plans

  • Fixed spending commitments over a year or three years can get discounts.
  • Discounts are calculated on hourly spending below the committed amount.
  • Two variants: Compute Savings Plans and EC2 Instance Savings Plans.

Spot

  • Offering significant discounts (often 60-70%) on unused capacity.
  • Customers submit requests at a maximum price.
  • Price and availability vary across AZs..

Spot Instance Interruption

  • Spot instances may be forcefully terminated.
  • AWS provides notice before termination
  • Interruptions happen much less often and may have very few disruptions in operation.

EC2 Pricing Models: Summary

  • A mix of pricing models is usually the best strategy for running EC2 services.
  • Reserved instances for critical mission-critical services.
  • Spot instances for large jobs (less latency).
  • On-demand for short-term or filling gaps.

Why Are We Talking About Cost So Much?

  • Cloud-native systems could automatically pick the best instances and pricing models.
  • Current models are mainly done manually; this is an open research question.

Outages and SLAs

  • Provider outages are possible in the cloud market.
  • Service Level Agreements (SLAs) describe a standard of quality for a service, like uptime.
  • AWS gives refunds based on uptime.

EC2 SLAs

  • Instance level SLA, the pricing model, refunds may be given to clients in case of availability issues. Regional level issues for multiple AZs can be equally problematic.
  • Refunds will be made if below the given standards.

AWS Outage History

  • Hyperscalers occasionally have outages.
  • Outages frequently happen less than in other market segments; instances are commonly available.

Burstable Instances

  • Utilization is low, spikes happen occasionally.
  • Savings can be substantial for such workload patterns.

Burstable Instances Implementation

  • CPU oversubscription to utilize one physical server across multiple virtual machines for seamless handling of spikes.
  • Implemented using a hypervisor, which enables allocating VMs more CPUs than physically available.

Burstable Instances Pricing Example

  • Base load for particular instance types, with hourly costs
  • Credits accumulate for instances below base load..
  • If beyond base load, standard mode or unlimited mode charges or discounts apply.

Steady-State Comparison

  • Steady-state cost comparisons of Graviton2 instances with 32GB RAM, along with pricing models for different vCPU use cases.
  • Examples include t4g.2xlarge(low cost), m6g.2xlarge ,r6g.xlarge(medium cost).

Steady-State Comparison (4 GB RAM)

  • Cost comparisons for instances with 4 GB RAM. (trn1, c6g.large, m6g.medium)

Burstable Instance Analysis

  • Resource savings in the cloud ecosystem, by load smoothing across different customers using CPU core fractions..
  • Practical savings are substantial at the small instances size, and less so at larger ones due to inherent oversubscription
  • Performance prediction may be unpredictability.

Hetzner (Hourly pricing including 19% VAT)

  • Includes hourly vCPU pricing, shared and dedicated vCPUs, and dedicated server monthly and one-time setup fees
  • Data for different vCPU/RAM configurations

Lambda

  • Functions as a service (serverless).
  • Code automatically assigned to compute hardware, with scaling available.
  • Intended for small invocations and stateless tasks.
  • Complex jobs can be broken down into graphs of stateless computations.

Lambda Pricing

  • Duration cost depends on RAM capacity and proportional to CPU time.
  • Request cost for 1 million requests is $0.20.
  • Pricing is in millisecond granularity..
  • Savings plans are available with a 17% maximum discount.

Limitations

  • Maximum duration of 15 minutes, and 10240 MB RAM and 5.8 vCPUs.
  • No network communication with other Lambdas.

Lambda vs. EC2

  • Lambda has automatic elasticity and scalability, and fine-grained pricing but needs to utilize EC2 resources for long tasks.
  • Lambda is higher cost overall than EC2, due to run-time overhead, however, more cost-effective for short tasks
  • Limited hardware customization flexibility.

Compute vs. Space Tradeoff

  • There are various ways to implement the same job function.
  • Prices for Fargate suggest that a CPU core is comparable to 10 GB of RAM, based on prices for Fargate.
  • Computation, storage, and RAM/CPU tradeoffs

Compute vs. Space Tradeoff (2)

  • A comparison of algorithms assuming perfect scalability.

Storage

  • Including examples like c5d.24xlarge (900 GB NVMe SSD), i3en.24xlarge (8 * 7500 GB NVMe SSD), d3en.12xlarge (24 * 13980 GB HDD)
  • Persistent and transient data can be managed.
  • Instance storage is not persistent; durability is similar to EC2.

Amazon Elastic Block Store (EBS)

  • Virtual disks (not a standard file system).
  • Used for root volumes and additional volumes.
  • Capacity, bandwidth, throughput.
  • Storage only accessible to one instance at a time.
  • SAN network, replicated for fault tolerance.

EBS General Purpose Storage (gp3)

  • Volume sizes from 1 GB to 16 TB.
  • 16.000 I/Os per second & 1 GBps throughput.
  • Volume durability (99.8-99.9%), and pricing details: $80 per TB per month, and free 3000 IOPS

EBS Variants

  • Different variant types of EBS provide varying pricing, IOPS, performance, and durability.
  • Examples include io1, io2, st1, and sc1,

Simple Storage Service (S3)

  • Foundational service for storage.
  • Redundancy (across AZs within a region).
  • Key terminology (objects, prefixes, buckets).
  • API access for operations and bucket management.
  • Public or private access to buckets.

S3 Standard

  • Storage: $21-25 per TB/month
  • Request prices: PUT/LIST ($5/million), GET ($0.4/million), DELETE (free).
  • Bandwidth charges within a region are free (within a region).

S3 Standard Durability and Availability SLAs

  • High availability and durability standards.
  • Data loss likelihood is low across typical usage patterns.

S3 Consistency

  • Strong consistency offered after 2020; no longer weak consistency offering.

S3 Variants

  • Different variants based on access time, cost, and durability, such as Standard, Standard-IA, One Zone-IA, One Zone-Express. Glaciers, and Intelligent-Tiering.

Other Storage Alternatives

  • Includes Amazon Elastic File System (EFS), DynamoDB, Relational Database Service (RDS), and other options in AWS.

Other Services

  • Listing various AWS services; includes Marketplace for third-party software and SaaS, and others such as IAM, CloudWatch, DNS (Route 53), VPC, CDN (CloudFront), SQS, SNS, Step Functions, SageMaker, Beanstalk, EMR, Spark, and MapReduce.
  • EC2 on-demand prices rarely change
  • New instances are regularly introduced
  • Older instances are deprecated (removed).
  • The costs across different tiers evolve over the years, especially given the introduction or removal of specific hardware services, such as NVMe based arrays

EC2 Hardware Landscape

  • Includes discussion of on-demand prices, new instances, and deprecated instances.

EC2 CPU Cost Evolution (Intel & AMD)

  • Graph for the evolution of CPU costs across different types of CPU types (Intel and AMD) over various years.

EC2 DRAM Cost Evolution

  • Graph for the evolution of DRAM(Memory) costs over the years, across different instance types

EC2 I/O Bandwidth Cost Evolution

  • Graph for the evolution of I/O Bandwidth costs over the years, across diffferent instance types

EC2 Storage Capacity Cost Evolution

  • Graph for the evolution of Storage capacity costs over the years, across different instance types.

EC2 Network Bandwidth Cost Evolution

  • Graph for the evolution of Network Bandwidth costs across different instance types over the years

Specialized Hardware and Accelerators

  • Moore's law is slowing for CPUs and DRAMS; specialized hardware is becoming more attractive for specialized use cases. Increasing importance of specialized processors.
  • The cloud environment simplifies using specialized hardware components.

CPU Stagnation

  • Recent trends show a notable pause in Moore's Law, making specialized hardware more appealing.. Hardware developments, including CPUs, GPU, ML and ASIC based accelerators, etc...
  • Graphs showing trends and costs for leading edge semiconductor manufacturing.
  • Data collected and shown from different perspectives, including time trends, costs and capabilities of hardware fabrication.

CPU Stagnation (Graphs)

  • Showing microprocessor performance improvements and improvements over dollar costs across different time periods.

AMD CPU Cost Stagnation

  • Graph showing the cost per core for different AMD CPU families, showing stability of the pricing.

When Does Specialized Hardware Pay Off?

  • The cost and performance tradeoff for choosing between universal and specialized processors across different usage patterns & scenarios.
  • The graphs in the slides show optimal ranges to determine when to choose a specific piece of hardware.

Custom AWS Hardware

  • Custom hardware from hyperscalers isn't commercially available..
  • AWS Nitro technology
  • Recent progress in Language and Image Recognition capabilities in Al compared to human standards.
  • Trends show Al rapidly improving
  • Performance developments, and requirements needed

Machine Learning Training FLOPS

  • Shows training FLOPS improvement over time and its implications.

Graphics Processing Unit (GPU)

  • GPUs were originally designed for graphics and are being more often used as compute accelerators, especially within the ML workload domain where a great performance boost can be achieved.

Machine Learning Accelerators

  • Details on AWS Trainium and Inferentia2 accelerators, and Google's TPUs.
  • A comparison of these accelerators' characteristics.

Field Programmable Gate Array (FPGA)

  • FPGAs are well-suited for implementing application-specific operations and computations. This allows tailoring computations for specific purposes without requiring a full ASIC design but at the cost of increased complexity and high development upfront costs.
  • EC2 instances with FPGAs (such as f1.2xlarge, f1.4xlarge, f1.16xlarge) using Xilinx UltraScale+ VU9P FPGAs.

FPGA Example

  • Illustrates an FPGA-based architecture for analytic query processing (AQUA).
  • An example of such an implementation, including various layers and AWS services used.. Highlights the scalability.

Summary

  • Covers basic infrastructure services (EC2, EBS, S3).
  • Pricing complexity, cost bottlenecks, and importance of specialized configurations.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

AWS Cloud Services PDF

Description

Explore cloud service cost-effectiveness compared to on-premise IT, the implications of cloud pricing on software architecture, and strategies for minimizing cloud costs during migration. Understand EC2 instance selection and minimizing service disruption during resizing.

More Like This

Use Quizgecko on...
Browser
Browser