Podcast
Questions and Answers
Which factor most significantly influences the cost-effectiveness of cloud services compared to on-premise IT solutions?
Which factor most significantly influences the cost-effectiveness of cloud services compared to on-premise IT solutions?
- The color scheme of the cloud provider's user interface.
- The geographical location of the data centers.
- The extent to which administration can be automated and costs can be amortized. (correct)
- The brand reputation of the cloud vendor.
What is a key implication of cloud pricing structures on software architecture?
What is a key implication of cloud pricing structures on software architecture?
- The frequency of software updates.
- The best software architecture to leverage cloud services effectively. (correct)
- The user interface design.
- The selection of the programming language.
Why is understanding cloud service pricing crucial for organizations utilizing cloud resources?
Why is understanding cloud service pricing crucial for organizations utilizing cloud resources?
- To impress stakeholders with technical jargon.
- To gain access to exclusive cloud vendor merchandise.
- To ensure compliance with industry-standard color palettes.
- To accurately budget and optimize cloud expenditures, avoiding unexpected costs. (correct)
A company is planning to migrate its services to the cloud. Which of the following strategies would be most effective in managing and minimizing cloud costs?
A company is planning to migrate its services to the cloud. Which of the following strategies would be most effective in managing and minimizing cloud costs?
A development team is designing a new application to be hosted on EC2. Which of the following factors should they consider least when selecting an EC2 instance type?
A development team is designing a new application to be hosted on EC2. Which of the following factors should they consider least when selecting an EC2 instance type?
Which of the following strategies can be used within the AWS EC2 environment to minimize service disruption when resizing an instance?
Which of the following strategies can be used within the AWS EC2 environment to minimize service disruption when resizing an instance?
An application requires low-latency network connectivity. Based on the provided data, which region would be the most suitable for deploying resources if the primary region is eu-central-1
?
An application requires low-latency network connectivity. Based on the provided data, which region would be the most suitable for deploying resources if the primary region is eu-central-1
?
How does EC2 handle bandwidth allocation for smaller instance slices on the same machine?
How does EC2 handle bandwidth allocation for smaller instance slices on the same machine?
What happens to the extra bandwidth allocated to an EC2 instance in steady state?
What happens to the extra bandwidth allocated to an EC2 instance in steady state?
What is a key consideration when selecting an AWS region for deploying resources, besides network latency?
What is a key consideration when selecting an AWS region for deploying resources, besides network latency?
Which Amazon S3 storage class offers the lowest storage cost per TB per month?
Which Amazon S3 storage class offers the lowest storage cost per TB per month?
What is a primary benefit for customers using the AWS Marketplace from a purchasing perspective?
What is a primary benefit for customers using the AWS Marketplace from a purchasing perspective?
A company requires immediate consistency for listing objects after writes in their cloud storage solution. Which Amazon S3 feature directly supports this requirement?
A company requires immediate consistency for listing objects after writes in their cloud storage solution. Which Amazon S3 feature directly supports this requirement?
What role does the seller fulfill in AWS Marketplace subscription-based pricing?
What role does the seller fulfill in AWS Marketplace subscription-based pricing?
A development team needs a PaaS solution on AWS. Which service should they consider?
A development team needs a PaaS solution on AWS. Which service should they consider?
A company wants to implement a content delivery network (CDN) to improve the speed and availability of their web content. Which AWS service should they use?
A company wants to implement a content delivery network (CDN) to improve the speed and availability of their web content. Which AWS service should they use?
What is the function of the Amazon Simple Notification Service (SNS)?
What is the function of the Amazon Simple Notification Service (SNS)?
Which AWS service is specifically designed for big data batch processing using frameworks like Spark and MapReduce?
Which AWS service is specifically designed for big data batch processing using frameworks like Spark and MapReduce?
Consider a scenario where an application needs to ensure that updates to an S3 object are made only if specific conditions are met. Which S3 feature would best support these requirements?
Consider a scenario where an application needs to ensure that updates to an S3 object are made only if specific conditions are met. Which S3 feature would best support these requirements?
A company wants to monitor the performance of their AWS resources and receive alerts for potential issues. Which AWS service should they use?
A company wants to monitor the performance of their AWS resources and receive alerts for potential issues. Which AWS service should they use?
What is the primary benefit of AWS Nitro Cards in the context of virtualization?
What is the primary benefit of AWS Nitro Cards in the context of virtualization?
Compared to CPUs, what is a significant advantage GPUs offer for machine learning tasks?
Compared to CPUs, what is a significant advantage GPUs offer for machine learning tasks?
Why might a company consider switching cloud providers specifically for machine learning training?
Why might a company consider switching cloud providers specifically for machine learning training?
What is the main advantage of using AWS Trainium and Inferentia2 accelerators over general-purpose GPUs for machine learning?
What is the main advantage of using AWS Trainium and Inferentia2 accelerators over general-purpose GPUs for machine learning?
In what scenario would FPGAs be a more suitable choice than CPUs, GPUs, or ML accelerators?
In what scenario would FPGAs be a more suitable choice than CPUs, GPUs, or ML accelerators?
What is the primary reason that cloud computing, if not carefully monitored, can become very expensive?
What is the primary reason that cloud computing, if not carefully monitored, can become very expensive?
What is the purpose of custom AWS hardware?
What is the purpose of custom AWS hardware?
Which of the following accurately describes the trade-off between GPUs and CPUs?
Which of the following accurately describes the trade-off between GPUs and CPUs?
Which of the following characteristics makes Function as a Service (FaaS) like AWS Lambda most suitable for specific types of workloads?
Which of the following characteristics makes Function as a Service (FaaS) like AWS Lambda most suitable for specific types of workloads?
Which of the following is NOT a benefit of using FPGAs?
Which of the following is NOT a benefit of using FPGAs?
Compute, storage, and network resources can all be cost bottlenecks. What is the most important consideration when handling these bottlenecks?
Compute, storage, and network resources can all be cost bottlenecks. What is the most important consideration when handling these bottlenecks?
What is the primary factor that determines the cost of Lambda function execution, besides the number of requests?
What is the primary factor that determines the cost of Lambda function execution, besides the number of requests?
Considering the limitations of AWS Lambda, which scenario would be least suitable for its implementation?
Considering the limitations of AWS Lambda, which scenario would be least suitable for its implementation?
How does the pricing model of Hetzner's dedicated servers differ from their shared/dedicated vCPU offerings?
How does the pricing model of Hetzner's dedicated servers differ from their shared/dedicated vCPU offerings?
If an application requires guaranteed CPU performance and is sensitive to resource contention, which of the following AWS EC2 instance types would be the most suitable?
If an application requires guaranteed CPU performance and is sensitive to resource contention, which of the following AWS EC2 instance types would be the most suitable?
What is a key disadvantage of using burstable performance instances like t4g.medium
compared to general-purpose instances like m6g.2xlarge
for demanding applications?
What is a key disadvantage of using burstable performance instances like t4g.medium
compared to general-purpose instances like m6g.2xlarge
for demanding applications?
A company is deciding between using AWS Lambda and provisioning their own servers. They need to process a high volume of unpredictable, short-lived requests. What advantage does AWS Lambda offer in this scenario?
A company is deciding between using AWS Lambda and provisioning their own servers. They need to process a high volume of unpredictable, short-lived requests. What advantage does AWS Lambda offer in this scenario?
A developer wants to optimize the cost of running an AWS Lambda function that processes data from S3. The function's execution time varies significantly depending on the input data size. Which strategy would be most effective for cost optimization?
A developer wants to optimize the cost of running an AWS Lambda function that processes data from S3. The function's execution time varies significantly depending on the input data size. Which strategy would be most effective for cost optimization?
What is the primary factor influencing network latency within an Availability Zone (AZ)?
What is the primary factor influencing network latency within an Availability Zone (AZ)?
Which of the following outbound data transfer scenarios would typically be free of charge in AWS, as of the information provided?
Which of the following outbound data transfer scenarios would typically be free of charge in AWS, as of the information provided?
What is the cost associated with data transfer between two EC2 instances located within the same Availability Zone (AZ) using private IP addresses?
What is the cost associated with data transfer between two EC2 instances located within the same Availability Zone (AZ) using private IP addresses?
A company needs to transfer 2TB of data between two Availability Zones within the same AWS region. What would be the total cost of this transfer?
A company needs to transfer 2TB of data between two Availability Zones within the same AWS region. What would be the total cost of this transfer?
Which Placement Group strategy is best suited for distributing instances across different physical racks to minimize correlated failures?
Which Placement Group strategy is best suited for distributing instances across different physical racks to minimize correlated failures?
A company is deploying a new application and must comply with data sovereignty laws requiring data to reside within a specific geographic boundary. Which AWS Region/AZ choice consideration is most relevant in this scenario?
A company is deploying a new application and must comply with data sovereignty laws requiring data to reside within a specific geographic boundary. Which AWS Region/AZ choice consideration is most relevant in this scenario?
Which EC2 pricing model offers the most flexibility but is also typically the most expensive?
Which EC2 pricing model offers the most flexibility but is also typically the most expensive?
A company has a consistent, predictable workload and wants to minimize EC2 costs. Which pricing model would be the most cost-effective choice?
A company has a consistent, predictable workload and wants to minimize EC2 costs. Which pricing model would be the most cost-effective choice?
What is the primary purpose of the Reserved Instance Marketplace?
What is the primary purpose of the Reserved Instance Marketplace?
A company commits to spending $80 per hour on AWS compute resources through a Savings Plan. In one particular hour, their actual spending is only $60. What happens with the remaining $20?
A company commits to spending $80 per hour on AWS compute resources through a Savings Plan. In one particular hour, their actual spending is only $60. What happens with the remaining $20?
What is the key difference between Compute Savings Plans and EC2 Instance Savings Plans?
What is the key difference between Compute Savings Plans and EC2 Instance Savings Plans?
A company is running a large batch processing job that is not time-sensitive and can tolerate interruptions. Which EC2 pricing model would be the most cost-effective?
A company is running a large batch processing job that is not time-sensitive and can tolerate interruptions. Which EC2 pricing model would be the most cost-effective?
What advance notice does AWS typically provide before terminating a Spot Instance?
What advance notice does AWS typically provide before terminating a Spot Instance?
A company has a base load of critical applications that must always be available, as well as some large, non-critical jobs that can be interrupted. What is the recommended strategy for EC2 pricing models?
A company has a base load of critical applications that must always be available, as well as some large, non-critical jobs that can be interrupted. What is the recommended strategy for EC2 pricing models?
What is a good rule of thumb for average EC2 costs when using a mix of pricing models?
What is a good rule of thumb for average EC2 costs when using a mix of pricing models?
Flashcards
Cloud Costs
Cloud Costs
Expenses incurred while using cloud services, often significant for software products.
Public Cloud Vendors
Public Cloud Vendors
Companies that offer cloud services like AWS, which have changed the hardware landscape.
Elastic Compute Cloud (EC2)
Elastic Compute Cloud (EC2)
Amazon's service for providing virtual machines of varying types for different computing needs.
Instance Types
Instance Types
Signup and view all the flashcards
Pricing Structures
Pricing Structures
Signup and view all the flashcards
Bandwidth Allocation
Bandwidth Allocation
Signup and view all the flashcards
EC2 Instance Change
EC2 Instance Change
Signup and view all the flashcards
AWS Regions
AWS Regions
Signup and view all the flashcards
Network Latency
Network Latency
Signup and view all the flashcards
Live Migration Support
Live Migration Support
Signup and view all the flashcards
Lambda: Function as a Service
Lambda: Function as a Service
Signup and view all the flashcards
Automatic Scaling
Automatic Scaling
Signup and view all the flashcards
Stateless Tasks
Stateless Tasks
Signup and view all the flashcards
Lambda Pricing - Duration
Lambda Pricing - Duration
Signup and view all the flashcards
Request Cost
Request Cost
Signup and view all the flashcards
Lambda Limitations
Lambda Limitations
Signup and view all the flashcards
CPU Proportionality
CPU Proportionality
Signup and view all the flashcards
Networking Constraints
Networking Constraints
Signup and view all the flashcards
S3 Strong Consistency
S3 Strong Consistency
Signup and view all the flashcards
S3 Variants
S3 Variants
Signup and view all the flashcards
Glacier Storage Classes
Glacier Storage Classes
Signup and view all the flashcards
Intelligent Tiering
Intelligent Tiering
Signup and view all the flashcards
AWS Marketplace
AWS Marketplace
Signup and view all the flashcards
Subscription-based Pricing
Subscription-based Pricing
Signup and view all the flashcards
Conditional Writes
Conditional Writes
Signup and view all the flashcards
Identity and Access Management (IAM)
Identity and Access Management (IAM)
Signup and view all the flashcards
AWS CloudWatch
AWS CloudWatch
Signup and view all the flashcards
Simple Notification Service (SNS)
Simple Notification Service (SNS)
Signup and view all the flashcards
Availability Zones (AZ)
Availability Zones (AZ)
Signup and view all the flashcards
Internet Traffic Cost
Internet Traffic Cost
Signup and view all the flashcards
Intra-AWS Transfer Cost
Intra-AWS Transfer Cost
Signup and view all the flashcards
Placement Groups
Placement Groups
Signup and view all the flashcards
On Demand Pricing
On Demand Pricing
Signup and view all the flashcards
Reserved Instances
Reserved Instances
Signup and view all the flashcards
Savings Plans
Savings Plans
Signup and view all the flashcards
Spot Instances
Spot Instances
Signup and view all the flashcards
Interruption Notice
Interruption Notice
Signup and view all the flashcards
EC2 Pricing Models Summary
EC2 Pricing Models Summary
Signup and view all the flashcards
Fault Tolerance
Fault Tolerance
Signup and view all the flashcards
Data Transfer via S3
Data Transfer via S3
Signup and view all the flashcards
Network Bandwidth Cost
Network Bandwidth Cost
Signup and view all the flashcards
Legal Reasons for Region Choice
Legal Reasons for Region Choice
Signup and view all the flashcards
AWS Nitro Cards
AWS Nitro Cards
Signup and view all the flashcards
Floating Point Operations Per Second (FLOPS)
Floating Point Operations Per Second (FLOPS)
Signup and view all the flashcards
Graphics Processing Unit (GPU)
Graphics Processing Unit (GPU)
Signup and view all the flashcards
Machine Learning Accelerators
Machine Learning Accelerators
Signup and view all the flashcards
Field Programmable Gate Array (FPGA)
Field Programmable Gate Array (FPGA)
Signup and view all the flashcards
Application-Specific Integrated Circuit (ASIC)
Application-Specific Integrated Circuit (ASIC)
Signup and view all the flashcards
AWS Trainium
AWS Trainium
Signup and view all the flashcards
AWS Inferentia2
AWS Inferentia2
Signup and view all the flashcards
GPU instance types
GPU instance types
Signup and view all the flashcards
Machine Learning Training Cost
Machine Learning Training Cost
Signup and view all the flashcards
Study Notes
Cloud Information Systems
- Presented by Prof. Dr. Viktor Leis, Chair for Decentralized Information Systems and Data Management
- Focuses on Cloud Information Systems, specifically Amazon Web Services (AWS)
Case Study: Amazon Web Services
- Amazon's extreme growth led to sophisticated in-house IT infrastructure
- AWS launched EC2 and S3 in 2006, triggering the modern era of cloud computing
- Microsoft Azure followed in 2010, Google Cloud in 2012
Modern Cloud Era
- Highlighting Amazon's decision to sell its internal IT services
- Cloud costs are a significant factor in software product development and administration
Hyperscalers Market Share
- Showing market share trend of hyperscalers like Amazon, Microsoft, Google, Alibaba, and IBM
- Data from Synergy Research Group.
- Market share data from Q4 2017 to Q2 2022, showing global revenue trends.
Market Share
- Graph depicts market cap and market share of major cloud providers (EU & US)
- Figures in EUR billion and percentage points respectively.
AWS Deep Dive
- Detailed AWS Services and Pricing Models will be explored
- Costs of cloud computing resources are a significant consideration.
- Public cloud providers' offerings are affecting hardware landscapes
- Minor changes can significantly impact costs
- Cloud pricing plays critical roles for software architecture and effective cloud usage.
EC2
- A large set of heterogeneous virtual machines (instances).
- Types include general purpose, compute optimized, memory optimized, accelerated computing, storage optimized, and HPC optimized.
- Previously primarily based on 2-socket Intel servers; now also includes AMD and ARM chips (Graviton).
- Includes extensive details on available instance types.
Instance Sizes
- Tabular data on mainstream instance families with Intel CPUs, featuring vCPUs, DRAM, instance SSD, network bandwidth & cost per hour
- Data is outdated.
Instance Slices
- Virtual machines created by slicing physical servers
- Uses Intel and ARM/Graviton processors
- Costs usually scale linearly with resources.
- Offers fine-grained RAM/CPU configuration using various API names.
Burst Bandwidth
- Bandwidth shared among multiple machines.
- Unused bandwidth is allocated to other instances.
- Referencing specific AWS documentation links.
Changing Instances
- EC2 instances need to be stopped and restarted to change configurations (e.g., size)
- Important considerations to prevent service disruption.
- Alternative cloud service providers offer live migration
Regions, AZs, Networking
- AWS has 27 physical locations (data centers) globally.
- Price variations exist based on region, e.g., Frankfurt vs. US east coast.
- Region sizes, presented with a pie chart of IP address counts for AWS regions in 2022
- Network latency data (round-trip latency) from eu-central-1 to other regions (e.g., Milan, Ireland, US East, US West, Singapore).
Availability Zones (AZs)
- Clusters of geographically close data centers, but not directly adjacent.
- Latency within an AZ is very low (down to 0.03ms)
Internet Traffic Cost
- Inbound data transfer is free; outbound transfer costs varied to have become free.
- Pricing depends on the data volume.
- This was due to March 2024 free transfer changes, which were possibly triggered by European Data Act.
Intra-AWS Transfer Cost
- Intra-AZ transfer cost is free (private IP).
- Inter-region transfer is priced at $0.01 - $0.02/GB.
- S3 can sometimes be a cheaper alternative for inter-AZ transfer.
- Microsoft Azure made a similar change.
Placement Groups
- Users have limited control over the allocation of their physical machines in racks.
- Placing compute resources in the same rack may result in related failures.
- Using placement groups helps with certain control over placement. (e.g. cluster in the same rack, spread across racks, or partition group).
Region/AZ Choice Considerations
- Legal requirements and other factors can influence the best regions for deploying services.
- Network bandwidth costs, speed, tolerance to failures, and overall availability are factored in.
EC2 Pricing Models
- Services include on-demand, spot, reserved instances, and savings plans.
- On-demand: per-second pricing, most flexible.
- Spot: significant lower price, possible interruption
- Reserved: long-term commitment
- Savings Plans: hourly spending discount
On Demand
- Most flexible, but also most expensive pricing option in EC2..
- Measured per minute.
- No guarantee of instance availability.
Reserved Instance, Reserved Instance Market
- Long-term reservations (1 or 3 years), offering discounts.
- Partial upfront or monthly payment possible.
Reserved Instance Example
- Example of pricing for c5.24xlarge on demand.
- Includes hourly rate.
Savings Plans
- Fixed spending commitments over a year or three years can get discounts.
- Discounts are calculated on hourly spending below the committed amount.
- Two variants: Compute Savings Plans and EC2 Instance Savings Plans.
Spot
- Offering significant discounts (often 60-70%) on unused capacity.
- Customers submit requests at a maximum price.
- Price and availability vary across AZs..
Spot Instance Interruption
- Spot instances may be forcefully terminated.
- AWS provides notice before termination
- Interruptions happen much less often and may have very few disruptions in operation.
EC2 Pricing Models: Summary
- A mix of pricing models is usually the best strategy for running EC2 services.
- Reserved instances for critical mission-critical services.
- Spot instances for large jobs (less latency).
- On-demand for short-term or filling gaps.
Why Are We Talking About Cost So Much?
- Cloud-native systems could automatically pick the best instances and pricing models.
- Current models are mainly done manually; this is an open research question.
Outages and SLAs
- Provider outages are possible in the cloud market.
- Service Level Agreements (SLAs) describe a standard of quality for a service, like uptime.
- AWS gives refunds based on uptime.
EC2 SLAs
- Instance level SLA, the pricing model, refunds may be given to clients in case of availability issues. Regional level issues for multiple AZs can be equally problematic.
- Refunds will be made if below the given standards.
AWS Outage History
- Hyperscalers occasionally have outages.
- Outages frequently happen less than in other market segments; instances are commonly available.
Burstable Instances
- Utilization is low, spikes happen occasionally.
- Savings can be substantial for such workload patterns.
Burstable Instances Implementation
- CPU oversubscription to utilize one physical server across multiple virtual machines for seamless handling of spikes.
- Implemented using a hypervisor, which enables allocating VMs more CPUs than physically available.
Burstable Instances Pricing Example
- Base load for particular instance types, with hourly costs
- Credits accumulate for instances below base load..
- If beyond base load, standard mode or unlimited mode charges or discounts apply.
Steady-State Comparison
- Steady-state cost comparisons of Graviton2 instances with 32GB RAM, along with pricing models for different vCPU use cases.
- Examples include t4g.2xlarge(low cost), m6g.2xlarge ,r6g.xlarge(medium cost).
Steady-State Comparison (4 GB RAM)
- Cost comparisons for instances with 4 GB RAM. (trn1, c6g.large, m6g.medium)
Burstable Instance Analysis
- Resource savings in the cloud ecosystem, by load smoothing across different customers using CPU core fractions..
- Practical savings are substantial at the small instances size, and less so at larger ones due to inherent oversubscription
- Performance prediction may be unpredictability.
Hetzner (Hourly pricing including 19% VAT)
- Includes hourly vCPU pricing, shared and dedicated vCPUs, and dedicated server monthly and one-time setup fees
- Data for different vCPU/RAM configurations
Lambda
- Functions as a service (serverless).
- Code automatically assigned to compute hardware, with scaling available.
- Intended for small invocations and stateless tasks.
- Complex jobs can be broken down into graphs of stateless computations.
Lambda Pricing
- Duration cost depends on RAM capacity and proportional to CPU time.
- Request cost for 1 million requests is $0.20.
- Pricing is in millisecond granularity..
- Savings plans are available with a 17% maximum discount.
Limitations
- Maximum duration of 15 minutes, and 10240 MB RAM and 5.8 vCPUs.
- No network communication with other Lambdas.
Lambda vs. EC2
- Lambda has automatic elasticity and scalability, and fine-grained pricing but needs to utilize EC2 resources for long tasks.
- Lambda is higher cost overall than EC2, due to run-time overhead, however, more cost-effective for short tasks
- Limited hardware customization flexibility.
Compute vs. Space Tradeoff
- There are various ways to implement the same job function.
- Prices for Fargate suggest that a CPU core is comparable to 10 GB of RAM, based on prices for Fargate.
- Computation, storage, and RAM/CPU tradeoffs
Compute vs. Space Tradeoff (2)
- A comparison of algorithms assuming perfect scalability.
Storage
- Including examples like c5d.24xlarge (900 GB NVMe SSD), i3en.24xlarge (8 * 7500 GB NVMe SSD), d3en.12xlarge (24 * 13980 GB HDD)
- Persistent and transient data can be managed.
- Instance storage is not persistent; durability is similar to EC2.
Amazon Elastic Block Store (EBS)
- Virtual disks (not a standard file system).
- Used for root volumes and additional volumes.
- Capacity, bandwidth, throughput.
- Storage only accessible to one instance at a time.
- SAN network, replicated for fault tolerance.
EBS General Purpose Storage (gp3)
- Volume sizes from 1 GB to 16 TB.
- 16.000 I/Os per second & 1 GBps throughput.
- Volume durability (99.8-99.9%), and pricing details: $80 per TB per month, and free 3000 IOPS
EBS Variants
- Different variant types of EBS provide varying pricing, IOPS, performance, and durability.
- Examples include io1, io2, st1, and sc1,
Simple Storage Service (S3)
- Foundational service for storage.
- Redundancy (across AZs within a region).
- Key terminology (objects, prefixes, buckets).
- API access for operations and bucket management.
- Public or private access to buckets.
S3 Standard
- Storage: $21-25 per TB/month
- Request prices: PUT/LIST ($5/million), GET ($0.4/million), DELETE (free).
- Bandwidth charges within a region are free (within a region).
S3 Standard Durability and Availability SLAs
- High availability and durability standards.
- Data loss likelihood is low across typical usage patterns.
S3 Consistency
- Strong consistency offered after 2020; no longer weak consistency offering.
S3 Variants
- Different variants based on access time, cost, and durability, such as Standard, Standard-IA, One Zone-IA, One Zone-Express. Glaciers, and Intelligent-Tiering.
Other Storage Alternatives
- Includes Amazon Elastic File System (EFS), DynamoDB, Relational Database Service (RDS), and other options in AWS.
Other Services
- Listing various AWS services; includes Marketplace for third-party software and SaaS, and others such as IAM, CloudWatch, DNS (Route 53), VPC, CDN (CloudFront), SQS, SNS, Step Functions, SageMaker, Beanstalk, EMR, Spark, and MapReduce.
EC2 Trends
- EC2 on-demand prices rarely change
- New instances are regularly introduced
- Older instances are deprecated (removed).
- The costs across different tiers evolve over the years, especially given the introduction or removal of specific hardware services, such as NVMe based arrays
EC2 Hardware Landscape
- Includes discussion of on-demand prices, new instances, and deprecated instances.
EC2 CPU Cost Evolution (Intel & AMD)
- Graph for the evolution of CPU costs across different types of CPU types (Intel and AMD) over various years.
EC2 DRAM Cost Evolution
- Graph for the evolution of DRAM(Memory) costs over the years, across different instance types
EC2 I/O Bandwidth Cost Evolution
- Graph for the evolution of I/O Bandwidth costs over the years, across diffferent instance types
EC2 Storage Capacity Cost Evolution
- Graph for the evolution of Storage capacity costs over the years, across different instance types.
EC2 Network Bandwidth Cost Evolution
- Graph for the evolution of Network Bandwidth costs across different instance types over the years
Specialized Hardware and Accelerators
- Moore's law is slowing for CPUs and DRAMS; specialized hardware is becoming more attractive for specialized use cases. Increasing importance of specialized processors.
- The cloud environment simplifies using specialized hardware components.
CPU Stagnation
- Recent trends show a notable pause in Moore's Law, making specialized hardware more appealing.. Hardware developments, including CPUs, GPU, ML and ASIC based accelerators, etc...
Semiconductor Fabrication Trends
- Graphs showing trends and costs for leading edge semiconductor manufacturing.
- Data collected and shown from different perspectives, including time trends, costs and capabilities of hardware fabrication.
CPU Stagnation (Graphs)
- Showing microprocessor performance improvements and improvements over dollar costs across different time periods.
AMD CPU Cost Stagnation
- Graph showing the cost per core for different AMD CPU families, showing stability of the pricing.
When Does Specialized Hardware Pay Off?
- The cost and performance tradeoff for choosing between universal and specialized processors across different usage patterns & scenarios.
- The graphs in the slides show optimal ranges to determine when to choose a specific piece of hardware.
Custom AWS Hardware
- Custom hardware from hyperscalers isn't commercially available..
- AWS Nitro technology
Machine Learning trends
- Recent progress in Language and Image Recognition capabilities in Al compared to human standards.
- Trends show Al rapidly improving
- Performance developments, and requirements needed
Machine Learning Training FLOPS
- Shows training FLOPS improvement over time and its implications.
Graphics Processing Unit (GPU)
- GPUs were originally designed for graphics and are being more often used as compute accelerators, especially within the ML workload domain where a great performance boost can be achieved.
Machine Learning Accelerators
- Details on AWS Trainium and Inferentia2 accelerators, and Google's TPUs.
- A comparison of these accelerators' characteristics.
Field Programmable Gate Array (FPGA)
- FPGAs are well-suited for implementing application-specific operations and computations. This allows tailoring computations for specific purposes without requiring a full ASIC design but at the cost of increased complexity and high development upfront costs.
- EC2 instances with FPGAs (such as f1.2xlarge, f1.4xlarge, f1.16xlarge) using Xilinx UltraScale+ VU9P FPGAs.
FPGA Example
- Illustrates an FPGA-based architecture for analytic query processing (AQUA).
- An example of such an implementation, including various layers and AWS services used.. Highlights the scalability.
Summary
- Covers basic infrastructure services (EC2, EBS, S3).
- Pricing complexity, cost bottlenecks, and importance of specialized configurations.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Explore cloud service cost-effectiveness compared to on-premise IT, the implications of cloud pricing on software architecture, and strategies for minimizing cloud costs during migration. Understand EC2 instance selection and minimizing service disruption during resizing.