Arie Bregman 48db2d4664 Update

2022-08-25 09:17:29 +03:00

114 KiB

Raw Blame History

AWS

Note: Some of the exercises cost $$$ and can't be performed using the free tier/resources

2nd Note: Provided solutions are using the AWS console. It's recommended you'll use IaC technologies to solve the exercises (e.g. Terraform, Pulumi).

AWS
- Exercises
  - IAM
  - EC2
  - S3
  - ELB
  - Auto Scaling Groups
  - VPC
  - Databases
  - DNS
  - Containers
  - Lambda
  - Elastic Beanstalk
  - CodePipeline
  - Misc
- Questions
  - Global Infrastructure
  - IAM
  - EC2
    - AMI
    - EBS
    - Instance Store
    - EFS
    - Pricing Models
    - Launch Template
    - ENI
    - Placement Groups
  - Lambda
  - Containers
    - ECS
    - Fargate
  - S3
    - Basics
    - Buckets 101
    - Objects
    - Security
    - Misc
  - Disaster Recovery
  - CloudFront
  - ELB
    - ALB
  - Auto Scaling Group
  - Security
  - Databases
    - RDS
    - Aurora
    - DynamoDB
    - ElastiCache
    - RedShift
  - VPC
  - Identify the Service
  - DNS (Route 53)
  - Monitoring and Logging
  - Billing and Support
  - Automation
  - Misc
  - High Availability
  - Production Operations and Migrations
  - Scenarios
  - Architecture Design
  - Misc

Exercises

IAM

Name	Topic	Objective & Instructions	Solution
Create a User	IAM	Exercise	Solution
Password Policy	IAM	Exercise	Solution
Create a role	IAM	Exercise	Solution
Credential Report	IAM	Exercise	Solution
Access Advisor	IAM	Exercise	Solution

EC2

Name	Topic	Objective & Instructions	Solution
Launch EC2 web instance	EC2	Exercise	Solution
Security Groups	EC2	Exercise	Solution
IAM Roles	EC2, IAM	Exercise	Solution
Spot Instances	EC2	Exercise	Solution
Elastic IP	EC2, Networking	Exercise	Solution
Placement Groups Creation	EC2, Placement Groups	Exercise	Solution
Elastic Network Interfaces	EC2, ENI	Exercise	Solution
Hibernate an Instance	EC2	Exercise	Solution
Volume Creation	EC2, EBS	Exercise	Solution
Snapshots	EC2, EBS	Exercise	Solution
Create an AMI	EC2, AMI	Exercise	Solution
Create EFS	EC2, EFS	Exercise	Solution

S3

Name	Topic	Objective & Instructions	Solution	Comments
Create buckets	S3	Exercise	Solution

ELB

Name	Topic	Objective & Instructions	Solution
Application Load Balancer	ELB, ALB	Exercise	Solution
Multiple Target Groups	ELB, ALB	Exercise	Solution
Network Load Balancer	ELB, NLB	Exercise	Solution

Auto Scaling Groups

Name	Topic	Objective & Instructions	Solution	Comments
Auto Scaling Groups Basics	ASG	Exercise	Solution
Dynamic Scaling Policy	ASG, Policies	Exercise	Solution

VPC

Name	Topic	Objective & Instructions	Solution	Comments
My First VPC	VPC	Exercise	Solution
Subnets	VPC	Exercise	Solution

Databases

Name	Topic	Objective & Instructions	Solution
MySQL DB	RDS	Exercise	Solution
Aurora DB	RDS	Exercise	Solution
ElastiCache	ElastiCache	Exercise	Solution

DNS

Name	Topic	Objective & Instructions	Solution
Register Domain	Route 53	Exercise	Solution
Creating Records	Route 53	Exercise	Solution
Health Checks	Route 53	Exercise	Solution
Failover	Route 53	Exercise	Solution

Containers

Name	Topic	Objective & Instructions	Solution	Comments
ECS Task	ECS, Fargate	Exercise	Solution

Lambda

Name	Topic	Objective & Instructions	Solution	Comments
Hello Function	Lambda	Exercise	Solution
URL Function	Lambda	Exercise	Solution

Elastic Beanstalk

Name	Topic	Objective & Instructions	Solution	Comments
Simple Elastic Beanstalk Node.js app	Elastic Beanstalk	Exercise	Solution

CodePipeline

Name	Topic	Objective & Instructions	Solution	Comments
Basic CI with S3	CodePipeline & S3	Exercise	Solution

Misc

Name	Topic	Objective & Instructions	Solution	Comments
Budget Setup	Budget	Exercise	Solution
No Application :'(	Troubleshooting	Exercise	Solution

Questions

Global Infrastructure

Explain the following

Availability zone
Region
Edge location

AWS regions are data centers hosted across different geographical locations worldwide.

Within each region, there are multiple isolated locations known as Availability Zones. Each availability zone is one or more data-centers with redundant network and connectivity and power supply. Multiple availability zones ensure high availability in case one of them goes down.

Edge locations are basically content delivery network which caches data and insures lower latency and faster delivery to the users in any location. They are located in major cities in the world.

True or False? Each AWS region is designed to be completely isolated from the other AWS regions

True.

True or False? Each region has a minimum number of 1 availability zones and the maximum is 4

False. The minimum is 2 while the maximum is 6.

What considerations to take when choosing an AWS region for running a new application?

Services Availability: not all service (and all their features) are available in every region

Reduced latency: deploy application in a region that is close to customers

Compliance: some countries have more strict rules and requirements such as making sure the data stays within the borders of the country or the region. In that case, only specific region can be used for running the application

Pricing: the pricing might not be consistent across regions so, the price for the same service in different regions might be different.

IAM

What is IAM? What are some of its features?

In short, it's used for managing users, groups, access policies & roles Full explanation can be found here

True or False? IAM configuration is defined globally and not per region

True

True or False? When creating an AWS account, root account is created by default. This is the recommended account to use and share in your organization

False. Instead of using the root account, you should be creating users and use them.

True or False? Groups in AWS IAM, can contain only users and not other groups

True

True or False? Users in AWS IAM, can belong only to a single group

False. Users can belong to multiple groups.

What are some best practices regarding IAM in AWS?

Delete root account access keys and don't use root account regularly

Create IAM user for any physical user. Don't share users.

Apply "least privilege principle": give users only the permissions they need, nothing more than that.

Set up MFA and consider enforcing using it

Make use of groups to assign permissions ( user -> group -> permissions )

What permissions does a new user have?

Only a login access.

True or False? If a user in AWS is using password for authenticating, he doesn't needs to enable MFA

False(!). MFA is a great additional security layer to use for authentication.

What ways are there to access AWS?

AWS Management Console

AWS CLI

AWS SDK

What are Roles?

AWS docs: "An IAM role is an IAM identity that you can create in your account that has specific permissions...it is an AWS identity with permission policies that determine what the identity can and cannot do in AWS." For example, you can make use of a role which allows EC2 service to access s3 buckets (read and write).

What are Policies?

Policies documents used to give permissions as to what a user, group or role are able to do. Their format is JSON.

A user is unable to access an s3 bucket. What might be the problem?

There can be several reasons for that. One of them is lack of policy. To solve that, the admin has to attach the user with a policy what allows him to access the s3 bucket.

What should you use to:

Grant access between two services/resources?
Grant user access to resources/services?

Role

Policy

What statements AWS IAM policies are consist of?

Sid: identifier of the statement (optional)

Effect: allow or deny access

Action: list of actions (to deny or allow)

Resource: a list of resources to which the actions are applied

Principal: role or account or user to which to apply the policy

Condition: conditions to determine when the policy is applied (optional)

Explain the following policy:

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect:": "Allow",
            "Action": "*",
            "Resources": "*"
        }
    ]
}

This policy permits to perform any action on any resource. It happens to be the "AdministratorAccess" policy.

What security tools AWS IAM provides?

IAM Credentials Report: lists all the account users and the status of their credentials

IAM Access Advisor: Shows service permissions granted to a user and information on when he accessed these services the last time

Which tool would you use to optimize user permissions by identifying which services he doesn't regularly (or at all) access?

IAM Access Advisor

EC2

What is EC2?

"a web service that provides secure, resizable compute capacity in the cloud". Read more here

True or False? EC2 is a regional service

True. As opposed to IAM for example, which is a global service, EC2 is a regional service.

What are some of the properties/configuration options of EC2 instances that can be set or modified?

OS (Linux, Windows)

RAM and CPU

Networking - IP, Card properties like speed

Storage Space - (EBS, EFS, EC2 Instance Store)

EC2 User Data

Security groups

What would you use for customizing EC2 instances? As in software installation, OS configuration, etc.

AMI. With AMI (Amazon Machine Image) you can customize EC2 instances by specifying which software to install, what OS changes should be applied, etc.

AMI

What is AMI?

Amazon Machine Images is "An Amazon Machine Image (AMI) provides the information required to launch an instance". Read more here

What are the different sources for AMIs?

Personal AMIs - AMIs you create

AWS Marketplace for AMIs - AMIs made by others, mostly sold for some price

Public AMIs - Provided by AWS

True or False? AMI are built for specific region

True (but they can be copied from one region to another).

Describe in high-level the process of creating AMIs

Start an EC2 instance

Customized the EC2 instance (install packages, change OS configuration, etc.)

Stop the instance (for avoiding data integrity issues)

Create EBS snapshot and build an AMI

To verify and test the AMI, launch an instance from the AMI

What is an instance type?

"the instance type that you specify determines the hardware of the host computer used for your instance" Read more about instance types here

Explain the instance type naming convention

Let's take for example the following instance type: m5.large

m is the instance class 5 is the generation large is the size of the instance (affects the spec properties like vCPUs and RAM)

True or False? The following are instance types available for a user in AWS:

Compute optimized
Network optimized
Web optimized

False. From the above list only compute optimized is available.

Explain each of the following instance types:

"Compute Optimized"
"Memory Optimized"
"Storage Optimized"

Compute Optimized:

Used for compute-intensive tasks

It has high performance processors

Use cases vary: gaming serves, machine learning, batch processing, etc.

Memory Optimized:

Used for processing large data sets in memory

Other use cases: high performance, databases, distributed cache stores

Storage Optimized:

Used for storage intensive tasks - high read and write access to large data sets

Use cases: databases, OLTP system, distributing file systems

What can you attach to an EC2 instance in order to store data?

EBS

EBS

Explain Amazon EBS

AWS Docs: "provides block level storage volumes for use with EC2 instances. EBS volumes behave like raw, unformatted block devices."

What happens to EBS volumes when the instance is terminated?

By deafult, the root volume is marked for deletion, while other volumes will still remain.
You can control what will happen to every volume upon termination.

What happens to the EC2 disk (EBS) when the instance is stopped?

Disk is intact and can be used when the instance starts.

True or False? EBS volumes are locked to a specific availability zone

True

Explain EBS Snapshots

EBS snapshots used for making a backup of the EBS volume at point of time.

What are the use cases for using EBS snapshots?

Backups of the data

Moving the data between AZs

Is it possible to attach the same EBS volume to multiple EC2 instances?

Yes, with multi-attach it's possible to attach a single EBS volume to multiple instances.

True or False? EBS is a network drive hence, it requires network connectivity

True

What EBS volume types are there?

HDD (st 1, sc 1): Low cost HDD volumes

SSD
- gp2, gp3: General purpose SSD

If you need an EBS volume for low latency workloads, which volume type would you use?

SSD - io1, io2

If you need an EBS volume for workloads that require good performance but the cost is also an important aspect for you, which volume type would you use?

SSD - gp2, gp3

If you need an EBS volume for high-throughput, which volume type would you use?

SSD - io1, io2

If you need an EBS volume for infrequently data access, which volume type would you use?

HDD - sc1

Which EBS volume types can be used as boot volumes for EC2 instances?

SSD: gp2, gp3, io1, io2

True or False? In EBS gp2 volume type, IP will increase if the disk size increases

True.

Instance Store

If you would like to have an hardware disk attached to your EC2 instead of a network one (EBS). What would you use?

EC2 Instance Store.

Explain EC2 Instance Store. Why would someone choose to use it over other options?

EC2 instance store provides better I/O performances when compared to EBS.
It is mostly used for cache and temporary data purposes.

Are there any disadvantages in using instance store over EBS?

Yes, the data on instance store is lost when they are stopped.

EFS

What is Amazon EFS?

AWS Docs: "Amazon Elastic File System (Amazon EFS) provides a simple, scalable, fully managed elastic NFS file system for use with AWS Cloud services and on-premises resources."

In simpler words, it's a network file system you can mount on one or more EC2 instances.

True or False? EFS is locked into a single availability zone

False. EFS can be mounted across multiple availability zones.

What are some use cases for using EFS?

Data sharing (e.g. developers working on the same source control)

Web serving

Content management

True or False? EFS only compatible with Linux based AMI

True

True or False? EFS requires the user to perform capacity planning as it doesn't scales automatically

False. EFS scales automatically and you pay-per-use.

What EFS modes are there?

Performance mode

General purpose: used mainly for CMS, web serving, ... as it's optimal for latency sensitive applications

Max I/O: great for scaling to high levels of throughput and I/O operations per second

Throughput mode
- Provisioned: fixed throughput

Which EFS mode would you use if you need to perform media processing?

Performance Mode (Max I/O): It provides high throughput and scales to operations per second. Mainly used for big data, media processing, etc.

What is the default EFS mode?

Performance Mode (General Purpose): Used for web serving, CMS, ... anything that is sensitive to latency.

What EFS storage tiers are there?

Standard: frequently accessed files

Infrequent access: lower prices to store files but it also costs to retrieve them

Pricing Models

What EC2 pricing models are there?

On Demand - pay a fixed rate by the hour/second with no commitment. You can provision and terminate it at any given time. Reserved - you get capacity reservation, basically purchase an instance for a fixed time of period. The longer, the cheaper. Spot - Enables you to bid whatever price you want for instances or pay the spot price. Dedicated Hosts - physical EC2 server dedicated for your use.

True or False? Reserved instance has to be used for a minimum of 1 year

True.

Explain the following types of reserved instances:

Convertible Reserved Instances
Scheduled Reserved Instances

Convertible Reserved Instances: used for long running workloads but used when instance type might change during the period of time it's reserved

Scheduled Reserved Instances: when you need to reserve an instance for a long period but you don't need it continuously (so for example you need it only in the morning)

True or False? In EC2 On Demand, you pay per hour when using Linux or Windows and per second (after first minute) when using any other operating system

False. You pay per second (after the first minute) when using Windows or Linux and per hour for any other OS.

You need an instance for short-term and the workload running on instance must not be interrupted. Which pricing model would you use?

On Demand is good for short-term non-interrupted workloads (but it also has the highest cost).

You need an instance for running an application for a period of 2 years continuously, without changing instance type. Which pricing model would you use?

Reserved instances: they are cheaper than on-demand and the instance is yours for the chosen period of time.

Which pricing model has potentially the biggest discount and what its advantage

Spot instances provide the biggest discount but has the disadvantage of risking losing them due bigger bid price.

You need an instance for two years, but only between 10:00-15:00 every day. Which pricing model would you use?

Reserved instances from the "Scheduled Reserved Instances" type which allows you to reserve for specific time window (like 10:00-15:00 every day).

You need an instance for running workloads. You don't care if they fail for a given moment as long as they run eventually. Which pricing model would you use?

Spot instances. The discount potential is the highest compared to all other pricing models. The disadvantage is that you can lose the instance at any point so, you must run only workloads that you are fine with them failing suddenly.

You need a physical server only for your use. Which pricing model are you going to use?

EC2 Dedicated Host

What are some of the differences between dedicated hosts and dedicated instances?

In dedicated hosts you have per host billing, you have more visibility (sockets, cores, ...) and you can control where instance will be placed.
In dedicated instances the billing is per instance but you can't control placement and you don't have visibility of sockets, cores, ...

For what use cases, EC2 dedicated hosts are useful for?

Compliance needs

When the software license is complex (Bring Your Own License) and doesn't support cloud or multi-tenants

Regulatory requirements

What are Security Groups?

"A security group acts as a virtual firewall that controls the traffic for one or more instances" More on this subject here

True or False? Security groups only contain deny rules

False. Security groups only contain allow rules.

True or False? One security group can be attached to multiple instances

True

True or False? Security groups are not locked down to a region and VPC (meaning you don't have to create a new one when switching regions)

False. They are locked down to regions and VPC.

True or False? By default, when using security groups, all inbound traffic to an EC2 instance is blocked and all outbound traffic is allowed

True

What is the advantage of referencing security groups from a given security group?

Imagine you have an instance referencing two security groups, allowing to get inbound traffic from them.
Now imagine you have two instances, each using one of the security groups referenced in the instance we've just mentioned. This means you can get traffic from these two instances because they use security groups which referenced in the instance mentioned at the beginning. No need to use IPs.

How to migrate an instance to another availability zone?

What can you attach to an EC2 instance in order to store data?

EBS

What EC2 reserved instance types are there?

Standard RI - most significant discount + suited for steady-state usage Convertible RI - discount + change attribute of RI + suited for steady-state usage Scheduled RI - launch within time windows you reserve

Learn more about EC2 RI here

For how long can reserved instances be reserved?

1 or 3 years.

What allows you to control inbound and outbound instance traffic?

Security Groups

What bootstrapping means and how to use it in AWS EC2?

Bootstrapping is about launching commands when a machine starts for the first time. In AWS EC2 this is done using the EC2 user data script.

You get time out when trying reach your application which runs on an EC2 instance. Specify one reason why it would possibly happen

Security group isn't configured properly.

What is the AWS Instance Connect?

AWS: "Amazon EC2 Instance Connect provides a simple and secure way to connect to your Linux instances using Secure Shell (SSH)."

You try to run EC2 commands in an EC2 instance you've just created but it fails due to missing credentials. What would you do?

DO NOT configure AWS credentials on the instance (this means anyone else in your account would be able to use and see your credentials).
The best practice is to attach an IAM role with sufficient permissions (like IAMReadOnlyAccess)

True or False? Cancelling a Spot instance request terminates the instance

False. When you cancel a Spot instance request, you are not terminating the instances created by it.
To terminate such instances, you must cancel the Spot instance request first.

What are Spot Fleets?

Set of Spot instances and if you would like, also on-demand instances.

What strategies are there to allocate Spot instances?

lowestPrice: launch instances from the pool that has the lowest price

diversified: distributed across all pools

capacityOptimized: optimized based on the number of instances

From networking perspective, what do you get by default when running an EC2 instance?

A private IP and a public IP.

Explain EC2 hibernate

[AWS Docs](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/Hibernate.html: "Hibernation saves the contents from the instance memory (RAM) to your Amazon Elastic Block Store (Amazon EBS) root volume."

True or False? Using EC2 hibernate option results in having faster instance boot

True. This is because the operating system isn't restarted or stopped.

What are some use cases for using EC2 hibernate option?

Save RAM state

Service with long time initialization

Keep long-running processes

What are some limitations of EC2 hibernate option?

Instance RAM size is limited

Root volume must be encrypted EBS

Hibernation time is limited

Doesn't supports all instances types

No support for bare metal. Only On-Demand and Reserved instances

Doesn't supports all AMIs

Explain what is EC2 Nitro

Next generation EC2 instances using new virtualization technology

Better EBS: 64,000 EBS IOPS

Better networking: HPC, IPv6

Better security

What CPU customization is available with EC2?

Modifying number of CPU cores (useful for high RAM and low CPU applications)

Modifying number of threads per cure (useful for HPC workloads)

Explain EC2 Capacity Reservations

Allows you to ensure you have EC2 capacity when you need it

Usually combined with Reserved Instances and Saving Plans to achieve cost saving

Launch Template

What is a launch template?

AWS Docs: "You can create a launch template that contains the configuration information to launch an instance. You can use launch templates to store launch parameters so that you do not have to specify them every time you launch an instance"

What is the difference between Launch Configuration and Launch Template?

Launch configuration is a legacy form of Launch Template that must be recreated every time you would like to update the configuration.

In addition, launch template has the clear benefits of:

Provision both On-Demand and Spot instances

supporting multiple versions

support creating parameters subsets (used for re-use and inheritance)

ENI

Explain Elastic Network Interfaces (ENI)

AWS Docs: "An elastic network interface is a logical networking component in a VPC that represents a virtual network card."

Name at least three attributes the Elastic Network Interfaces (ENI) can include

One public IPv4 address

Mac Address

A primary private IPv4 address (from the address range of your VPC)

True or False? ENI are not bound to a specific availability zone

False. ENI are bound to specific availability zone.

True or False? ENI can be created independently of EC2 instances

True. They can be attached later on and on the fly (for failover purposes).

Placement Groups

What are "Placement Groups"?

AWS Docs: "When you launch a new EC2 instance, the EC2 service attempts to place the instance in such a way that all of your instances are spread out across underlying hardware to minimize correlated failures. You can use placement groups to influence the placement of a group of interdependent instances to meet the needs of your workload."

What Placement Groups strategies are there?

Cluster: places instance close together in an AZ.

Spread: spreads the instance across the hardware

Partition: spreads the instances across different partitions (= different sets of hardware/racks) within an AZ

For each of the following scenarios choose a placement group strategy:

High availability is top priority
Low latency between instances
Instances must be isolated from each other
Big Data applications that are partition aware
Big Data process that needs to end quickly

High availability is top priority - Spread

Low latency between instances - Cluster

Instances must be isolated from each other - Spread

Big Data applications that are partition aware - Partition

Big Data process that needs to end quickly - Cluster

What are the cons and pros of the "Cluster" placement group strategy?

Cons: if the hardware fails, all instances fail Pros: Low latency & high throughput network

What are the cons and pros of the "Spread" placement group strategy?

Cons:

Current limitation is 7 instances per AZ (per replacement group) Pros:

Maximized high availability (instances on different hardware, span across AZs)

Lambda

Explain what is AWS Lambda

AWS definition: "AWS Lambda lets you run code without provisioning or managing servers. You pay only for the compute time you consume."

Read more on it here

True or False? In AWS Lambda, you are charged as long as a function exists, regardless of whether it's running or not

False. Charges are being made when the code is executed.

Which of the following set of languages Lambda supports?

R, Swift, Rust, Kotlin
Python, Ruby, Go
Python, Ruby, PHP

Python, Ruby, Go

True or False? Basic lambda permissions allow you only to upload logs to Amazon CloudWatch Logs

True

Containers

ECS

What is Amazon ECS?

AWS Docs: "Amazon Elastic Container Service (Amazon ECS) is a fully managed container orchestration service. Customers such as Duolingo, Samsung, GE, and Cook Pad use ECS to run their most sensitive and mission critical applications because of its security, reliability, and scalability."

In simpler words, it allows you to launch containers on AWS.
While AWS takes care of starting/stopping containers, you need to provision and maintain the infrastructure where the containers are running (EC2 instances).

What one should do in order to make EC2 instance part of an ECS cluster?

Install ECS agent on it. Some AMIs have built-in configuration for that.

What ECS launch types are there?

EC2 Instance

AWS Fargate

What is Amazon ECR?

AWS Docs: "Amazon Elastic Container Registry (ECR) is a fully-managed Docker container registry that makes it easy for developers to store, manage, and deploy Docker container images."

What the role "EC2 Instance Profile" is used for in regards to ECS?

EC2 Instance Profile used by ECS agent on an EC2 instance to:

Make API calls to ECS Service

Send logs to CloudWatch from the container

Use secrets defined in SSM Parameter Store or Secrets Manager

Pull container images from ECR (Registry)

How to share data between containers (some from ECS and some from Fargate)?

Using EFS is a good way to share data between containers and it works also between different AZs.

Fargate

What is AWS Fargate?

Amazon Docs: "AWS Fargate is a serverless, pay-as-you-go compute engine that lets you focus on building applications without managing servers. AWS Fargate is compatible with both Amazon Elastic Container Service (ECS) and Amazon Elastic Kubernetes Service (EKS)"

In simpler words, AWS Fargate allows you launch containers on AWS without worrying about managing infrastructure. It runs containers based on the CPU and RAM you need.

How AWS Fargate different from AWS ECS?

In AWS ECS, you manage the infrastructure - you need to provision and configure the EC2 instances.
While in AWS Fargate, you don't provision or manage the infrastructure, you simply focus on launching Docker containers. You can think of it as the serverless version of AWS ECS.

True or False? Fargate creates an ENI for every task it runs

True.

S3

Basics

Explain what is AWS S3?

S3 is a object storage service which is fast, scalable and durable. S3 enables customers to upload, download or store any file or object that is up to 5 TB in size.

S3 stands for: Simple Storage Service

As a user you don't have to worry about filesystems or disk space

Buckets 101

What is a bucket?

An S3 bucket is a resource which is similar to folders in a file system and allows storing objects, which consist of data.

True or False? Buckets are defined globally

False. They are defined at the region level.

True or False? A bucket name must be globally unique

True

How to rename a bucket in S3?

A S3 bucket name is immutable. That means it's not possible to change it, without removing and creating a new bucket.

This is why the process for renaming a bucket is as follows:

Create a new bucket with the desired name

Move the data from the old bucket to it

Delete the old bucket

With the AWS CLI that would be:

# Create new bucket aws s3 mb s3://[NEW_BUCKET_NAME] # Sync the content from the old bucket to the new bucket $ aws s3 sync s3://[OLD_BUCKET_NAME] s3://[NEW_BUCKET_NAME] # Remove old bucket $ aws s3 rb --force s3://[OLD_BUCKET_NAME]

True or False? The max object size a user can upload in one go, is 5TB

True

Explain "Multi-part upload"

Amazon docs: "Multipart upload allows you to upload a single object as a set of parts. Each part is a contiguous portion of the object's data...In general, when your object size reaches 100 MB, you should consider using multipart uploads instead of uploading the object in a single operation."

Objects

Explain "Object Versioning"

When enabled at a bucket level, versioning allows you to upload new version of files, overriding previous version and so be able to easily roll-back and protect your data from being permanently deleted.

Explain the following:

Object Lifecycles
Object Sharing

Object Lifecycles - Transfer objects between storage classes based on defined rules of time periods

Object Sharing - Share objects via a URL link

Explain Object Durability and Object Availability

Object Durability: The percent over a one-year time period that a file will not be lost Object Availability: The percent over a one-year time period that a file will be accessible

Security

True or False? Every new S3 bucket is public by default

False. A newly created bucket is private unless it was configured to be public.

What's a presigned URL?

Since every newly created bucket is by default private it doesn't allows to share files with users. Even if the person who uploaded them tries to view them, it gets denied.

A presigned URL is a way to bypass that and allow sharing the files with users by including the credentials (token) as part of the URL. It can be done for limited time.

What security measures have you taken in context of S3?

* Don't make a bucket public. * Enable encryption if it's disabled. * Define an access policy

True or False? In case of SSE-AES encryption, you manage the key

False. S3 manages the key and uses AES-256 algorithm for the encryption.

True or False? In case of SSE-C encryption, both S3 and you manage the keys

False. You manage the keys. It's customer provided key.

True or False? Traffic between a host an S3 (e.g. uploading a file) is encrypted using SSL/TLS

True

Misc

What is a storage class? What storage classes are there?

Each object has a storage class assigned to, affecting its availability and durability. This also has effect on costs. Storage classes offered today:

Standard:

Used for general, all-purpose storage (mostly storage that needs to be accessed frequently)

The most expensive storage class

11x9% durability

2x9% availability

Default storage class

Standard-IA (Infrequent Access)

Long lived, infrequently accessed data but must be available the moment it's being accessed

11x9% durability

99.90% availability

One Zone-IA (Infrequent Access):

Long-lived, infrequently accessed, non-critical data

Less expensive than Standard and Standard-IA storage classes

2x9% durability

99.50% availability

Intelligent-Tiering:

Long-lived data with changing or unknown access patterns. Basically, In this class the data automatically moves to the class most suitable for you based on usage patterns

Price depends on the used class

11x9% durability

99.90% availability

Glacier: Archive data with retrieval time ranging from minutes to hours

Glacier Deep Archive: Archive data that rarely, if ever, needs to be accessed with retrieval times in hours

Both Glacier and Glacier Deep Archive are:

The most cheap storage classes

have 9x9% durability

More on storage classes here

A customer would like to move data which is rarely accessed from standard storage class to the most cheapest class there is. Which storage class should be used?

One Zone-IA
Glacier Deep Archive
Intelligent-Tiering

Glacier Deep Archive

What Glacier retrieval options are available for the user?

Expedited, Standard and Bulk

True or False? Each AWS account can store up to 500 PetaByte of data. Any additional storage will cost double

False. Unlimited capacity.

Explain what is Storage Gateway

"AWS Storage Gateway is a hybrid cloud storage service that gives you on-premises access to virtually unlimited cloud storage". More on Storage Gateway here

Explain the following Storage Gateway deployments types

File Gateway
Volume Gateway
Tape Gateway

Explained in detail here

What is the difference between stored volumes and cached volumes?

Stored Volumes - Data is located at customer's data center and periodically backed up to AWS Cached Volumes - Data is stored in AWS cloud and cached at customer's data center for quick access

What is "Amazon S3 Transfer Acceleration"?

AWS definition: "Amazon S3 Transfer Acceleration enables fast, easy, and secure transfers of files over long distances between your client and an S3 bucket"

Learn more here

Explain data consistency

S3 Data Consistency provides strong read-after-write consistency for PUT and DELETE requests of objects in the S3 bucket in all AWS Regions. S3 always return latest file version.

Can you host dynamic websites on S3? What about static websites?

No. S3 support only statis hosts. On a static website, individual webpages include static content. They might also contain client-side scripts. By contrast, a dynamic website relies on server-side processing, including server-side scripts such as PHP, JSP, or ASP.NET. Amazon S3 does not support server-side scripting.

Disaster Recovery

In regards to disaster recovery, what is RTO and RPO?

RTO - The maximum acceptable length of time that your application can be offline.

RPO - The maximum acceptable length of time during which data might be lost from your application due to an incident.

What types of disaster recovery techniques AWS supports?

The Cold Method - Periodically backups and sending the backups off-site

Pilot Light - Data is mirrored to an environment which is always running

Warm Standby - Running scaled down version of production environment

Multi-site - Duplicated environment that is always running

Which disaster recovery option has the highest downtime and which has the lowest?

Lowest - Multi-site Highest - The cold method

CloudFront

Explain what is CloudFront

AWS definition: "Amazon CloudFront is a fast content delivery network (CDN) service that securely delivers data, videos, applications, and APIs to customers globally with low latency, high transfer speeds, all within a developer-friendly environment."

More on CloudFront here

Explain the following

Origin
Edge location
Distribution

What delivery methods available for the user with CDN?

True or False?. Objects are cached for the life of TTL

True

What is AWS Snowball?

A transport solution which was designed for transferring large amounts of data (petabyte-scale) into and out the AWS cloud.

ELB

What is ELB (Elastic Load Balancing)?

AWS Docs: "Elastic Load Balancing automatically distributes incoming application traffic across multiple targets, such as Amazon EC2 instances, containers, IP addresses, and Lambda functions."

True or False? Elastic Load Balancer is a managed resource (= AWS takes care of it)

True. AWS responsible for making sure ELB is operational and takes care of lifecycle operations like upgrades, maintenance and high availability.

What types of AWS load balancers are there?

Classic Load Balancer (CLB): Mainly for TCP (layer 4) and HTTP, HTTPS (layer 7)

Application Load Balancer (ALB): Mainly for HTTP, HTTPS and WebSocket

Network Load Balancer (NLB): Mainly for TCP, TLS and UDP

Gateway Load Balancer (GWLB): Mainly for layer 3 operations (IP protocol)

What's a "listener" in regards to ELB?

What's a "target group" in regards to ELB?

Which load balancer would you use for services which use HTTP or HTTPS traffic?

Application Load Balancer (ALB).

What are some use cases for using Gateway Load Balancer?

Intrusion Detection

Firewall

Payload manipulation

Explain "health checks" in the context of AWS ELB

Health checks used by ELB to check whether EC2 instance(s) are properly working.
If health checks fail, ELB knows to not forward traffic to that specific EC2 instance where the health checks failed.

True or False? AWS ELB health checks are done on a port and a route

True.

For example, port 2017 and endpoint /health.

What types of load balancers are supported in EC2 and what are they used for?

Application LB - layer 7 traffic

Network LB - ultra-high performances or static IP address (layer 4)

Classic LB - low costs, good for test or dev environments (retired by August 15, 2022)

Gateway LB - transparent network gateway and and distributes traffic such as firewalls, intrusion detection and prevention systems, and deep packet inspection systems. (layer 3)

Which type of AWS load balancer is used in the following drawing?

Application Load Balancer (routing based on different endpoints + HTTP is used).

What are possible target groups for ALB (Application Load Balancer)?

EC2 tasks

ECS instances

Lambda functions

Private IP Addresses

True or False? ALB can route only to a single route group

False. ALB can route to multiple target groups.

If you wanted to analyze network traffic, you would use the `____ load balancer`

Gateway Load Balancer

Who has better latency? Application Load Balancer or Network Load Balancer?

Network Load Balancer (~100 ms) as ALB has a latency of ~400 ms

True or False? Network load balancer has one static IP per availability zone

True.

What are the supported target groups for network load balancer?

EC2 instance

IP addresses

Application Load Balancer

What are the supported target groups for gateway load balancer?

EC2 instance

IP addresses (must be private IPs)

Name one use case for using application load balancer as a target group for network load balancer

You might want to have a fixed IP address (NLB) and then forward HTTP traffic based on path, query, ... which is then done by ALB

What are some use cases for using Network Load Balancer?

TCP, UDP traffic

Extreme performance

True or False? Network load balancers operate in layer 4

True. They forward TCP, UDP traffic.

True or False? It's possible to enable sticky session for network load balancer so the same client is always redirected to the same instance

False. This is only supported in Classic Load Balancer and Application Load Balancer.

Explain Cross Zone Load Balancing

With cross zone load balancing, traffic distributed evenly across all (registered) instances in all the availability zones.

True or False? For network load balancer, cross zone load balancing is always on and can't be disabled

False. It's disabled by default

True or False? In regards to cross zone load balancing, AWS charges you for inter AZ data in network load balancer but no in application load balancer

False. It charges for inter AZ data in network load balancer, but not in application load balancer

True or False? Both ALB and NLB support multiple listeners with multiple SSL certificates

True

Explain Deregistration Delay (or Connection Draining) in regards to ELB

The period of time or process of "draining" instances from requests/traffic (basically let it complete all active connections but don't start new ones) so it can be de-registered eventually and ELB won't send requests/traffic to it anymore.

ALB

True or False? With ALB (Application Load Balancer) it's possible to do routing based on query string and/or headers

True.

True or False? For application load balancer, cross zone load balancing is always on and can't be disabled

True

Auto Scaling Group

Explain Auto Scaling Group

Amazon Docs: "An Auto Scaling group contains a collection of Amazon EC2 instances that are treated as a logical grouping for the purposes of automatic scaling and management. An Auto Scaling group also enables you to use Amazon EC2 Auto Scaling features such as health check replacements and scaling policies"

You have two instance running as part of ASG. You change the desired capacity to 1. What will be the outcome of this change?

One of the instances will be terminated.

How can you customize the trigger for the scaling in/out of an auto scaling group?

One way is to use CloudWatch alarms where an alarm will monitor a metric and based on a certain value (or range) you can choose to scale-in or scale-out the ASG.

What are some metrics/rules used for auto scaling

Network In/Out

Number of requests on ELB per instance

Average CPU, RAM usage

What is dynamic Scaling policy in regards to Auto Scaling Groups?

A policy in which scaling will occur automatically based on different metrics.

There are 3 types:

Target Tracking Scaling: scale when the baseline changes (e.g. CPU is over 60%)

Step Scaling: more granular scaling where you can choose different actions for different metrics values (e.g. when CPU less than 20%, remove one instance. When CPU is over 40%, add 3 instances)

Scheduled Actions: set in advance scaling for specific period of time (e.g. add instances on Monday between 10:00 am to 11:00 am)

What is a predictive scaling policy in regards to Auto Scaling Groups?

Scale by analyzing historical load and schedule scaling based on forecast load.

Explain scaling cooldowns in regards to Auto Scaling Groups

During a scaling cooldown, ASG will not terminate or launch additional instances. The cooldown happens after scaling activity and the reason for this behaviour is that some metrics have to be collected and stabilize before another scaling operating can take place.

Explain the default ASG termination policy

It finds the AZ which the most number of EC2 instnaces

If number of instances > 1, choose the one with oldest launch configuration, template and terminate it

True or False? by deafult, ASG tries to balance the number of instances across AZ

True, this is why when it terminates instances, it chooses the AZ with the most instances.

Explain Lifecycle hooks in regards to Auto Scaling Groups

Lifecycle hooks allows you perform extra steps before the instance goes in service (During pending state) or before it terminates (during terminating state).

If you use ASG and you would like to run extra steps before the instance goes in service, what will you use?

Lifecycle hooks in pending state.

Describe one way to test ASG actually works

In Linux instnaces, you can install the 'stress' package and run stress to load the system for certain period of time and see if ASG kicks in by adding additional capacity (= more instances).

Security

What is the shared responsibility model? What AWS is responsible for and what the user is responsible for based on the shared responsibility model?

The shared responsibility model defines what the customer is responsible for and what AWS is responsible for.

More on the shared responsibility model here

True or False? Based on the shared responsibility model, Amazon is responsible for physical CPUs and security groups on instances

False. It is responsible for Hardware in its sites but not for security groups which created and managed by the users.

Explain "Shared Controls" in regards to the shared responsibility model

AWS definition: "apply to both the infrastructure layer and customer layers, but in completely separate contexts or perspectives. In a shared control, AWS provides the requirements for the infrastructure and the customer must provide their own control implementation within their use of AWS services"

Learn more about it here

What is the AWS compliance program?

How to secure instances in AWS?

Instance IAM roles should have minimal permissions needed. You don't want an instance-level incident to become an account-level incident

Use "AWS System Manager Session Manager" for SSH

Using latest OS images with your instances

What is AWS Artifact?

AWS definition: "AWS Artifact is your go-to, central resource for compliance-related information that matters to you. It provides on-demand access to AWS’ security and compliance reports and select online agreements."

Read more about it here

What is AWS Inspector?

AWS definition: "Amazon Inspector is an automated security assessment service that helps improve the security and compliance of applications deployed on AWS. Amazon Inspector automatically assesses applications for exposure, vulnerabilities, and deviations from best practices.""

Learn more here

What is AWS Guarduty?

AWS definition: "Amazon GuardDuty is a threat detection service that continuously monitors for malicious activity and unauthorized behavior to protect your Amazon Web Services accounts, workloads, and data stored in Amazon S3"
Monitor VPC Flow lows, DNS logs, CloudTrail S3 events and CloudTrail Mgmt events.

What is AWS Shield?

AWS definition: "AWS Shield is a managed Distributed Denial of Service (DDoS) protection service that safeguards applications running on AWS."

What is AWS WAF? Give an example of how it can used and describe what resources or services you can use it with

What AWS VPN is used for?

What is the difference between Site-to-Site VPN and Client VPN?

What is AWS CloudHSM?

Amazon definition: "AWS CloudHSM is a cloud-based hardware security module (HSM) that enables you to easily generate and use your own encryption keys on the AWS Cloud."

Learn more here

True or False? AWS Inspector can perform both network and host assessments

True

What is AWS Key Management Service (KMS)?

AWS definition: "KMS makes it easy for you to create and manage cryptographic keys and control their use across a wide range of AWS services and in your applications." More on KMS here

What is AWS Acceptable Use Policy?

It describes prohibited uses of the web services offered by AWS. More on AWS Acceptable Use Policy here

True or False? A user is not allowed to perform penetration testing on any of the AWS services

False. On some services, like EC2, CloudFront and RDS, penetration testing is allowed.

True or False? DDoS attack is an example of allowed penetration testing activity

False.

True or False? AWS Access Key is a type of MFA device used for AWS resources protection

False. Security key is an example of an MFA device.

What is Amazon Cognito?

Amazon definition: "Amazon Cognito handles user authentication and authorization for your web and mobile apps."

Learn more here

What is AWS ACM?

Amazon definition: "AWS Certificate Manager is a service that lets you easily provision, manage, and deploy public and private Secure Sockets Layer/Transport Layer Security (SSL/TLS) certificates for use with AWS services and your internal connected resources."

Learn more here

Databases

RDS

What is AWS RDS?

Relational Database Service

Managed DB service (you can't ssh the machine)

Supports multiple DBs: MySQL, Oracle, Aurora (AWS Proprietary), ...

Why to use AWS RDS instead of launching an EC2 instance and install a database on it?

AWS RDS is a managed service, that means it's automatically provisioned and patched for you.

In addition, it provides you with continuous backup (and the ability to restore from any point of time), scaling capability (both horizontal and vertical), monitoring dashboard and read replicas.

What do you know about RDS backups?

Automated backups

Full daily backup (done during maintenance window)

Transactions logs backup every 5 minutes

Retention can be increased and by default it's 7 days

Explain AWS RDS Storage Auto Scaling

RDS storage can automatically be increased upon lack in storage

The user needs to set "Maximum Storage Threshold" to have some limit on storage scaling

Use cases: applications with unpredictable workloads

Supports multiple RDS database engines

Explain Amazon RDS Read Replicas

AWS Docs: "Amazon RDS Read Replicas provide enhanced performance and durability for RDS database (DB) instances. They make it easy to elastically scale out beyond the capacity constraints of a single DB instance for read-heavy database workloads."

In simpler words, it allows you to scale your reads.

True or False? RDS read replicas are supported within az, cross az and cross region

True

True or False? RDS read replicas are asynchronous

True. This is done so the reads are consistent.

True or False? Amazon RDS supports MongoDB

False. RDS is relational database and MongoDB is a NoSQL db.

What are some use cases for using RDS read replicas?

You have a main application which works against your database but you would like to add additional app, one used for logging, analytics, ... so you prefer it won't use the same database. In this case, you create a read replica instance and the second application works against that instance.

Explain RDS Multi Availability Zone

RDS multi AZ used mainly for disaster recovery purposes

There is an RDS master instance and in another AZ an RDS standby instance

The data is synced synchronously between them

The user, application is accessing one DNS name and where there is a failure with the master instance, the DNS name moves to the standby instance, so the failover done automatically

True or False? Moving AWS RDS from single AZ to multi AZ is an operation with downtime (meaning there is a need to stop the DB)

False. It's a zero downtime operation = no need to stop the database.

How AWS RDS switches from single AZ to multi AZ?

Snapshot is taken by RDS

The snapshot is restored to another, standby, RDS instance

Synchronization is enabled between the two instances

True or False? RDS encryption should be defined at launch time

True

True or False? in regards to RDS, replicas can be encrypted even if the master isn't encrypted

False

How to make RDS snapshots encrypted?

If RDS database is encrypted then, the snapshot itself is also encrypted

If RDS database isn't encrypted then, the snapshot itself isn't encrypted and then you can copy the un-encrypted snapshot to created an encrypted copy

How to encrypt an un-encrypted RDS instance?

Create a copy of the un-encrypted instance -> copy the snapshot to create an encrypted copy -> restore the database from the encrypted snapshot -> migrate the application to work against the copied instance -> remove the original DB instance

How IAM authentication works with RDS?

For example:

EC2 instance uses IAM role to make an API call to get auth token

The token, with SSL encryption, is used for accessing the RDS instance

Note: The token has a lifetime of 15 minutes

True or False? In case of RDS (not Aurora), read replicas require you to change the SQL connection string

True. Since read replicas add endpoints, each with its own DNS name, you need to modify your app to reference these new endpoints to balance the load read.

Aurora

What do you know about Amazon Aurora?

A MySQL & Postgresql based relational database.

Proprietary technology from AWS

The default database proposed for the user when using RDS for creating a database.

Storage automatically grows in increments of 10 GiB

HA native - failover in instant

Has better performances over MySQL and Postgres

Supports 15 replicas (while MySQL supports 5)

True or False? Aurora stores 4 copies of your data across 2 availability zones

False. It stores 6 copies across 3 availability zones

True or False? Aurora support self healing where corrupted data replaced by doing peer-to-peer replication

True

True or False? Aurora storage is striped across 20 volumes

False. 100 volumes.

True or False? It's possible to scale Aurora replicas

True. If your read replica instances exhaust their CPU, you can scale by adding more instances

Explain Aurora Serverless. What use cases is it good for?

Aurora serverless is an automated database instantiation and it's auto scaled based on an actual usage

It's good mainly for infrequent or unpredictable workflows

You pay per second so it can eventually be more cost effective

What is the use case for Aurora multi-master?

Aurora multi-master is perfect for a use case where you want to have instant failover for write node.

DynamoDB

What is AWS DynamoDB?

Explain "Point-in-Time Recovery" feature in DynamoDB

Amazon definition: "You can create on-demand backups of your Amazon DynamoDB tables, or you can enable continuous backups using point-in-time recovery. For more information about on-demand backups, see On-Demand Backup and Restore for DynamoDB."

Learn more here

Explain "Global Tables" in DynamoDB

Amazon definition: "A global table is a collection of one or more replica tables, all owned by a single AWS account."

Learn more here

What is DynamoDB Accelerator?

Amazon definition: "Amazon DynamoDB Accelerator (DAX) is a fully managed, highly available, in-memory cache for DynamoDB that delivers up to a 10x performance improvement – from milliseconds to microseconds..."

Learn more here

ElastiCache

What is AWS ElastiCache? In what use case should it be used?

Amazon Elasticache is a fully managed Redis or Memcached in-memory data store.
It's great for read-intensive workloads where the common data/queries are cached and apps/users access the cache instead of the primary database.

Describe the workflow of an application using the cache in AWS

The application performs a query against the DB. There is a check to see if the data is in the cache

If it is, it's a "cache hit" and the data is retrieved from there

If it's not in there, it's a "cache miss" and the data is pulled from the database

The data is then also written to the cache (assuming it is often accessed) and next time the user queries for the same data, it might be retrieved from the cache (depends on how much time passed and whether this specific data was invalidated or not)

How can you make an application stateless using ElastiCache?

Let's say you have multiple instances running the same application and every time you use the application, it creates a user session.
This user session can be stored in ElastiCache so even if the user contacts a different instance of the application, the application can retrieve the session from the ElsatiCache.

You need a highly available cache with backup and restore features. Which one would you use?

ElastiCache Redis.

You need a cache with read replicas that can be scaled and one support multi AZ. Which one would you use?

ElastiCache Redis.

You need a cache that supports sharding and built with multi-threaded architecture in mind. Which one would you use?

ElastiCache Memcached

True or False? ElastiCache doesn't supports IAM authentication

True.

What patterns are there for loading data into the cache?

Write Through: add or update data in the cache when the data is written to the DB

Lazy Loading: all the read data is cached

Session Store: store temporary session data in cache

RedShift

What is AWS Redshift and how is it different than RDS?

cloud data warehouse

What do you if you suspect AWS Redshift performs slowly?

You can confirm your suspicion by going to AWS Redshift console and see running queries graph. This should tell you if there are any long-running queries.

If confirmed, you can query for running queries and cancel the irrelevant queries

Check for connection leaks (query for running connections and include their IP)

Check for table locks and kill irrelevant locking sessions

What is Amazon DocumentDB?

Amazon definition: "Amazon DocumentDB (with MongoDB compatibility) is a fast, scalable, highly available, and fully managed document database service that supports MongoDB workloads. As a document database, Amazon DocumentDB makes it easy to store, query, and index JSON data."

Learn more here

What "AWS Database Migration Service" is used for?

What type of storage is used by Amazon RDS?

EBS

VPC

What is VPC?

"A logically isolated section of the AWS cloud where you can launch AWS resources in a virtual network that you define" Read more about it here.

True or False? By default, any new account has a default VPC

True

True or False? Default VPC doesn't have internet connectivity and any launched EC2 will only have a private IP assigned

False. The default VPC has internet connectivity and any launched EC2 instance gets a public IPv4 address.

In addition, any launched EC2 instance gets a public and private DNS names.

True or False? VPC spans multiple regions

False

True or False? It's possible to have multiple VPCs in one region

True. As of today, the soft limit is 5.

True or False? Subnets belong to the same VPC, can be in different availability zones

True. Just to clarify, a single subnet resides entirely in one AZ.

You have noticed your VPC's subnets (which use x.x.x.x/20 CIDR) have 4096 available IP addresses although this CIDR should have 4096 addresses. What is the reason for that?

AWS reserves 5 IP addresses in each subnet - first 4 and the last one, and so they aren't available for use.

What AWS uses the 5 reserved IP addresses for?

x.x.x.0 - network address x.x.x.1 - VPC router x.x.x.2 - DNS mapping x.x.x.3 - future use x.x.x.255 - broadcast address

What is an Internet Gateway?

AWS Docs: "component that allows communication between instances in your VPC and the internet"

In addition it's good to know that IGW is:

Highly available and redundant

Not porivding internet access by its own (you need route tables to be edited)

Created separately from VPC

True or False? One or more VPCs can be attached to one Internet Gateway

False. Only one VPC can be attached to one IGW and vice versa

True or False? NACL allow or deny traffic on the subnet level

True

What is VPC peering?

docs.aws: "A VPC peering connection is a networking connection between two VPCs that enables you to route traffic between them using private IPv4 addresses or IPv6 addresses."

True or False? Multiple Internet Gateways can be attached to one VPC

False. Only one internet gateway can be attached to a single VPC.

You've restarted your EC2 instance and the public IP has changed. How would you deal with it so it won't happen?

Use Elastic IP which provides you a fixed IP address.

When creating a new VPC, there is an option called "Tenancy". What is it used for?

What is an Elastic IP address?

AWS Docs: "An Elastic IP address is a static IPv4 address designed for dynamic cloud computing. An Elastic IP address is allocated to your AWS account, and is yours until you release it. By using an Elastic IP address, you can mask the failure of an instance or software by rapidly remapping the address to another instance in your account."

Why would you use an Elastic IP address?

Let's say you have an instance that you need to shutdown or perform some maintenance on. In that case, what you would want to do is to move the Elastic IP address to another instance that is operational, until you finish to perform the maintenance and then you can move it back to the original instance (or keep it assigned to the second one).

True or False? When stopping and starting an EC2 instance, its public IP changes

True

What are the best practices around Elastic IP?

The best practice is actually not using them in the first place. It's more common to use a load balancer without a public IP or use a random public IP and register a DNS record to it

True or False? An Elastic IP is free, as long it's not associated with an EC2 instance

False. An Elastic IP is free of charge as long as **it is ** associated with an EC2 instance. This instance should be running and should have only one Elastic IP.

True or False? Route Tables used to allow or deny traffic from the internet to AWS instances

False.

Explain Security Groups and Network ACLs

NACL - security layer on the subnet level.

Security Group - security layer on the instance level.

Read more about it here and here

What is AWS Direct Connect?

Allows you to connect your corporate network to AWS network.

What would you use if you need a fixed public IP for your EC2 instance?

Elastic IP

Kratos, your colleague, decided to use a subnet of /27 because he needs 29 IP addresses for EC2 instances. Is Kratos right?

No. Since AWS reserves 5 IP addresses for every subnet, Kratos will have 32-5=27 addresses and this is less than what he needs (29).

It's better if Kratos uses a subnet of size /26 but good luck telling him that.

In order for AWS Lambda to have internet access

Identify the Service

What would you use for automating code/software deployments?

AWS CodeDeploy

You would like to invoke a function every time you enter a URL in the browser. Which service would you use for that?

AWS Lambda

What would you use for easily creating similar AWS environments/resources for different customers?

CloudFormation

Using which service, can you add user sign-up, sign-in and access control to mobile and web apps?

Cognito

Which service would you use for building a website or web application?

Lightsail

Which tool would you use for choosing between Reserved instances or On-Demand instances?

Cost Explorer

What would you use to check how many unassociated Elastic IP address you have?

Trusted Advisor

Which service allows you to transfer large amounts (Petabytes) of data in and out of the AWS cloud?

AWS Snowball

Which service would you use if you need a data warehouse?

AWS RedShift

Which service provides a virtual network dedicated to your AWS account?

VPC

What you would use for having automated backups for an application that has MySQL database layer?

Amazon Aurora

What would you use to migrate on-premise database to AWS?

AWS Database Migration Service (DMS)

What would you use to check why certain EC2 instances were terminated?

AWS CloudTrail

What would you use for SQL database?

AWS RDS

What would you use for NoSQL database?

AWS DynamoDB

What would you use for adding image and video analysis to your application?

AWS Rekognition

Which service would you use for debugging and improving performances issues with your applications?

AWS X-Ray

Which service is used for sending notifications?

SNS

What would you use for running SQL queries interactively on S3?

AWS Athena

What would you use for preparing and combining data for analytics or ML?

AWS Glue

Which service would you use for monitoring malicious activity and unauthorized behavior in regards to AWS accounts and workloads?

Amazon GuardDuty

Which service would you use for centrally manage billing, control access, compliance, and security across multiple AWS accounts?

AWS Organizations

Which service would you use for web application protection?

AWS WAF

You would like to monitor some of your resources in the different services. Which service would you use for that?

CloudWatch

Which service would you use for performing security assessment?

AWS Inspector

Which service would you use for creating DNS record?

Route 53

What would you use if you need a fully managed document database?

Amazon DocumentDB

Which service would you use to add access control (or sign-up, sign-in forms) to your web/mobile apps?

AWS Cognito

Which service would you use if you need messaging queue?

Simple Queue Service (SQS)

Which service would you use if you need managed DDOS protection?

AWS Shield

Which service would you use if you need store frequently used data for low latency access?

ElastiCache

What would you use to transfer files over long distances between a client and an S3 bucket?

Amazon S3 Transfer Acceleration

Which services are involved in getting a custom string (based on the input) when inserting a URL in the browser?

Lambda - to define a function that gets an input and returns a certain string
API Gateway - to define the URL trigger (= when you insert the URL, the function is invoked).

Which service would you use for data or events streaming?

Kinesis

DNS (Route 53)

What is Route 53?

AWS Route 53: "Amazon Route 53 is a highly available and scalable cloud Domain Name System (DNS) web service..."

Some of Route 53 features:

Not a feature but its SLA is 100% availability

What it means that "Route 53 is an Authoritative DNS"?

The customer can update DNS records

What each Route 53 record contains?

Domain/subdomain name (e.g. blipblop.com)

Value (e.g. 201.7.202.2)

Record type (e.g. A, AAAA, MX)

TTL: amount of time the record is going to be cached

Routing Policy: how to respond to queries

What DNS record types does Route 53 supports?

A

AAAA

CNAME

NS

DS

CAA

SOA

MX

TXT

SPF

SRV

NAPTR

PTR

What are hosted zones?

A container that includes records for defining how to route traffic from a domain and its subdomains

What types of hosted zones are there?

Public Hosted Zones - include records to specify how to route traffic on the internet

Private Hosted Zones - contain records that specify how you traffic within VPC(s)

What is the difference between CNAME record and an Alias record?

CNAME is used for mapping one hostname to any other hostname while Alias is used to map an hostname to an AWS resource.

In addition, Alias work for both root domain (somedomain.com) and non-root domain, while CNAME works only with non-root domain (foo.somedomain.com)

True or False? Alias record can be set up for an EC2 DNS name

False

True or False? Alias record can be set up for an VPC interface endpoint

True

True or False? Alias record is only of type A or AAAA

True

What is a routing policy in regards to AWS Route 53?

A routing policy routing defines how Route 53 responds to DNS queries.

What Route 53 routing policies are there?

Simple

Geolocation

Failover

Latency based

Geoproximity

Multi-Value Answer

Weighted

Suppose you need to route % of your traffic to a certain instance and the rest of the traffic, to another instance. Which routing policy would you choose?

Weighted routing policy.

Suppose you need to route traffic to a single source with Route 53, without any other requirements, which routing policy would you choose?

The simple routing policy

Explain the geolocation routing policy

Routing based on user location

Location can be specified by continent, country or US state

It's recommended to have a default record in case there is no match on location

What are some use cases for using geolocation routing policy?

Restrict content distribution

App localization

Load balancing

Explain the geoproximity routing policy

Route based on the geographic location of resources

Shifting routing is done based on the bias value

Resources can be of AWS and non-AWS type

For non-AWS you have to specify latitude and longitude in addition to AWS region as done in AWS-based resources

To use it, you have to use Route 53 traffic flow

What are some use cases for weighted routing policy?

Load balancing between regions

Testing new applications versions

True or False? Route 53 simple routing policy supports both single and multiple values

True.

If multiple values are returned from Route 53 then, the client chooses a single value to use.

True or False? In weighted routing DNS records must have the same name but not the same type

False. They must have the same name AND type.

You would like to use a routing policy that will take latency into account and will route to the resource with the lowest latency. Which routing policy would you use?

Latency-based routing policy.

What happens when you set all records to weight 0 when using Weighted routing policy?

All records are used equally.

What Route 53 health checks are used for?

Automated DNS failover based on monitoring:

Another health check

endpoint (app, AWS resource, server)

CloudWatch alarms

You would like to use a routing policy based on the resource location and be able to shift more traffic to some resources. Which one would you use?

Geoproximity routing policy

Explain Route 53 Traffic Flow feature

It's a visual editor for managing complex routing decision trees. It allows you to simplify the process of managing records.

Configuration can be saved (as Traffic Flow Policy) and applied to different domains/hosted zones. In addition, it supports versioning

What are calculated health checks?

When you combine the results of multiple health checks into a single health check.

What is one possible use case for using calculated health checks?

Performing maintenance for a website without causing all the health checks to fail.

You would like to use a routing policy based on the user location. Which one would you use?

Geolocation routing policy. It's based on user location.

Don't confuse it with latency-based routing policy. While shorter distance may result in lower latency, this is not the requirement in the question.

True or False? Route 53 Multi Value is a substitute for those who want cheaper solution than ELB

False. Route 53 Multi Value is not a substitute for ELB. It's focused on client-side load balancing as opposed to ELB.

True or False? Domain registrar and DNS service is inherently the same thing

False. DNS service can be Route 53 (where you manage DNS records) while the domain itself can be purchased from other sources that aren't Amazon related (e.g. GoDadday).

Monitoring and Logging

What is AWS CloudWatch?

AWS definition: "Amazon CloudWatch is a monitoring and observability service..."

More on CloudWatch here

What is AWS CloudTrail?

AWS definition: "AWS CloudTrail is a service that enables governance, compliance, operational auditing, and risk auditing of your AWS account."

Read more on CloudTrail here

What is Simply Notification Service?

AWS definition: "a highly available, durable, secure, fully managed pub/sub messaging service that enables you to decouple microservices, distributed systems, and serverless applications."

Read more about it here

Explain the following in regards to SNS:

Topics
Subscribers
Publishers

Topics - used for grouping multiple endpoints

Subscribers - the endpoints where topics send messages to

Publishers - the provider of the message (event, person, ...)

Billing and Support

What is "AWS Organizations"?

AWS definition: "AWS Organizations helps you centrally govern your environment as you grow and scale your workloads on AWS." More on Organizations here

What are Service Control Policies and to what service they belong?

AWS organizations service and the definition by Amazon: "SCPs offer central control over the maximum available permissions for all accounts in your organization, allowing you to ensure your accounts stay within your organization’s access control guidelines."

Learn more here

Explain AWS pricing model

It mainly works on "pay-as-you-go" meaning you pay only for what are using and when you are using it. In s3 you pay for 1. How much data you are storing 2. Making requests (PUT, POST, ...) In EC2 it's based on the purchasing option (on-demand, spot, ...), instance type, AMI type and the region used.

More on AWS pricing model here

How do you estimate AWS costs?

TCO calculator

AWS simple calculator

Cost Explorer

AWS Budgets

Cost Allocation Tags

What basic support in AWS includes?

24x7 customer service

Trusted Advisor

AWS personal Health Dashoard

How are EC2 instances billed?

What AWS Pricing Calculator is used for?

What is Amazon Connect?

Amazon definition: "Amazon Connect is an easy to use omnichannel cloud contact center that helps companies provide superior customer service at a lower cost."

Learn more here

What are "APN Consulting Partners"?

Amazon definition: "APN Consulting Partners are professional services firms that help customers of all types and sizes design, architect, build, migrate, and manage their workloads and applications on AWS, accelerating their journey to the cloud."

Learn more here

Which of the following are AWS accounts types (and are sorted by order)?

Basic, Developer, Business, Enterprise
Newbie, Intermediate, Pro, Enterprise
Developer, Basic, Business, Enterprise
Beginner, Pro, Intermediate Enterprise

Basic, Developer, Business, Enterprise

True or False? Region is a factor when it comes to EC2 costs/pricing

True. You pay differently based on the chosen region.

What is "AWS Infrastructure Event Management"?

AWS Definition: "AWS Infrastructure Event Management is a structured program available to Enterprise Support customers (and Business Support customers for an additional fee) that helps you plan for large-scale events such as product or application launches, infrastructure migrations, and marketing events."

Automation

What is AWS CodeDeploy?

Amazon definition: "AWS CodeDeploy is a fully managed deployment service that automates software deployments to a variety of compute services such as Amazon EC2, AWS Fargate, AWS Lambda, and your on-premises servers."

Learn more here

Explain what is CloudFormation

Misc

Which AWS service you have experience with that you think is not very common?

What is AWS CloudSearch?

What is AWS Lightsail?

AWS definition: "Lightsail is an easy-to-use cloud platform that offers you everything needed to build an application or website, plus a cost-effective, monthly plan."

What is AWS Rekognition?

AWS definition: "Amazon Rekognition makes it easy to add image and video analysis to your applications using proven, highly scalable, deep learning technology that requires no machine learning expertise to use."

Learn more here

What AWS Resource Groups used for?

Amazon definition: "You can use resource groups to organize your AWS resources. Resource groups make it easier to manage and automate tasks on large numbers of resources at one time. "

Learn more here

What is AWS Global Accelerator?

Amazon definition: "AWS Global Accelerator is a service that improves the availability and performance of your applications with local or global users..."

Learn more here

What is AWS Config?

Amazon definition: "AWS Config is a service that enables you to assess, audit, and evaluate the configurations of your AWS resources."

Learn more here

What is AWS X-Ray?

AWS definition: "AWS X-Ray helps developers analyze and debug production, distributed applications, such as those built using a microservices architecture." Learn more here

What is AWS OpsWorks?

Amazon definition: "AWS OpsWorks is a configuration management service that provides managed instances of Chef and Puppet."

Learn more about it here

What is AWS Snowmobile?

"AWS Snowmobile is an Exabyte-scale data transfer service used to move extremely large amounts of data to AWS."

Learn more here

What is AWS Athena?

"Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL."

Learn more about AWS Athena here

What is Amazon Cloud Directory?

Amazon definition: "Amazon Cloud Directory is a highly available multi-tenant directory-based store in AWS. These directories scale automatically to hundreds of millions of objects as needed for applications."

Learn more here

What is AWS Elastic Beanstalk?

AWS definition: "AWS Elastic Beanstalk is an easy-to-use service for deploying and scaling web applications and services...You can simply upload your code and Elastic Beanstalk automatically handles the deployment"

Learn more about it here

What is AWS SWF?

Amazon definition: "Amazon SWF helps developers build, run, and scale background jobs that have parallel or sequential steps. You can think of Amazon SWF as a fully-managed state tracker and task coordinator in the Cloud."

Learn more on Amazon Simple Workflow Service here

What is AWS EMR?

AWS definition: "big data platform for processing vast amounts of data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto."

Learn more here

What is AWS Quick Starts?

AWS definition: "Quick Starts are built by AWS solutions architects and partners to help you deploy popular technologies on AWS, based on AWS best practices for security and high availability."

Read more here

What is the Trusted Advisor?

Amazon definition: "AWS Trusted Advisor provides recommendations that help you follow AWS best practices. Trusted Advisor evaluates your account by using checks. These checks identify ways to optimize your AWS infrastructure, improve security and performance, reduce costs, and monitor service quotas."

Learn more here

What is AWS Service Catalog?

Amazon definition: "AWS Service Catalog allows organizations to create and manage catalogs of IT services that are approved for use on AWS."

Learn more here

What is AWS CAF?

Amazon definition: "AWS Professional Services created the AWS Cloud Adoption Framework (AWS CAF) to help organizations design and travel an accelerated path to successful cloud adoption. "

Learn more here

What is AWS Cloud9?

AWS: "AWS Cloud9 is a cloud-based integrated development environment (IDE) that lets you write, run, and debug your code with just a browser"

What is AWS CloudShell?

AWS: "AWS CloudShell is a browser-based shell that makes it easy to securely manage, explore, and interact with your AWS resources."

What is AWS Application Discovery Service?

Amazon definition: "AWS Application Discovery Service helps enterprise customers plan migration projects by gathering information about their on-premises data centers."

Learn more here

What is the AWS well-architected framework and what pillars it's based on?

AWS definition: "The Well-Architected Framework has been developed to help cloud architects build secure, high-performing, resilient, and efficient infrastructure for their applications. Based on five pillars — operational excellence, security, reliability, performance efficiency, and cost optimization"

Learn more here

What AWS services are serverless (or have the option to be serverless)?

AWS Lambda AWS Athena

What is Simple Queue Service (SQS)?

AWS definition: "Amazon Simple Queue Service (SQS) is a fully managed message queuing service that enables you to decouple and scale microservices, distributed systems, and serverless applications".

Learn more about it here

High Availability

What high availability means from AWS perspective?

Application/Service is running in at least 2 availability zones

Application/Service should survive (= operate as usual) a data center disaster

Production Operations and Migrations

Describe in high-level how to upgrade a system on AWS with (near) zero downtime

One way is through launching a new instance. In more detail:

Launch a new instance

Install all the updates and applications

Test the instance

If all tests passed successfully, you can start using the new instance and perform the switch with the old one, in one of various ways:

Go to route53 and update the record with the IP of the new instance

If you are using an Elastic IP then move it to the new instance ...

You try to use an detached EBS volume from us-east-1b in us-east-1a, but it fails. What might be the reason?

EBS volumes are locked to a specific availability zone. To use them in another availability zone, you need to take a snapshot and restore it in the destination availability zone.

When you launch EC2 instances, it takes them time to boot due to commands you run with user data. How to improve instances boot time?

Consider creating customized AMI with the commands from user data already executed there. This will allow you launch instance instantly.

You try to mount EFS on your EC2 instance and it doesn't work (hangs...) What might be a possible reason?

Security group isn't attached to your EFS or it lacks a rule to allow NFS traffic.

How to migrate an EBS volume across availability zones?

Pause the application

Take a snapshot of the EBS volume

Restore the snapshot in another availability zone

How to encrypt an unencrypted EBS volume attached to an EC2 instance?

Create EBS snapshot of the volume

Copy the snapshot and mark the "Encrypt" option

Create a new EBS volume out of the encrypted snapshot

You've created a network load balancer but it doesn't work (you can't reach your app on your EC2 instance). What might be a possible reason?

Missing security group or misconfigured one. For example, if you go to your instances in the AWS console you might see that the instances under your NLB are in "unhealthy status" and if you didn't create a dedicated security group for your NLB, that means that the security group used is the one attached to the EC2 instances.

Go to the security group of your instance(s) and enable the traffic that NLB should forward (e.g. TCP on port 80).

Scenarios

You have a load balancer running and behind it 5 web servers. Users complain that every time they move to a new page, they have to authenticate, instead of doing it once. How can you solve it?

Enable sticky sessions. This way, the user keep working against the same instance, instead of being redirected to a different instance every request.

You have a load balancer running and behind it 5 web servers. Users complain that some times when they try to use the application it doesn't works. You've found out that sometimes some of the instances crash. How would you deal with it?

One possible way is to use health checks with the load balancer to ensure the instances are ready to be used before forwarding traffic to them.

You run your application on 5 EC2 instances on one AZ and on 10 EC2 instances in another AZ. You distribute traffic between all of them using a network load balancer, but it seems that instances in one AZ have higher CPU rates than the instances in the other AZ. What might be the issue and how to solve it?

It's possible that traffic is distributed evenly between the AZs but that doesn't mean it's distributed equally across all instances evenly.

To distribute it evenly between all the instances, you have to enable cross-zone load balancing.

You are running an ALB that routes traffic using two hostnames: a.b.com and d.e.com. Is it possible to configure HTTPS for both of the hostnames?

Yes, using SNI (Server Name Indication) each application can has its own SSL certificate (This is supported from 2017).

You have set up read replicas to scale reads but users complain that when they update posts in forums, the posts are not being updated. What may cause this issue?

Read Replicas use asynchronous replication so it's possible users access a read replica instance that wasn't synced yet.

You need a persistent shared storage between your containers that some are running in Fargate and some in ECS. What would you use?

EFS. It allows us to have persistent multi-AZ shared storage for containers.

You would like to run an AWS Fargate task every time a file is uploaded to a certain S3 bucket. How would you achieve that?

Use Amazon EventBridge so every time a file is uploaded to an S3 bucket (event) it will run an ECS task.

Such task should have an ECS Task Role so it can get the object from the S3 bucket (and possibly other permissions if it needs to update the DB for example).

Architecture Design

You've been asked to design an architecture for high performance and low-latency application (millions of requests per second). Which load balancer would you use?

Network Load Balancer

What should you use for scaling reads?

You can use an ElastiCache cluster or RDS Read Replicas.

Misc

What's an ARN?

ARN (Amazon Resources Names) used for uniquely identifying different AWS resources. It is used when you would like to identify resource uniqely across all AWS infra.

114 KiB Raw Blame History Unescape Escape

AWS

Exercises

IAM

EC2

S3

ELB

Auto Scaling Groups

VPC

Databases

DNS

Containers

Lambda

Elastic Beanstalk

CodePipeline

Misc

Questions

Global Infrastructure

IAM

EC2

AMI

EBS

Instance Store

EFS

Pricing Models

Launch Template

ENI

Placement Groups

Lambda

Containers

ECS

Fargate

S3

Basics

Buckets 101

Objects

Security

Misc

Disaster Recovery

CloudFront

ELB

ALB

Auto Scaling Group

Security

Databases

RDS

Aurora

DynamoDB

ElastiCache

RedShift

VPC

Identify the Service

DNS (Route 53)

Monitoring and Logging

Billing and Support

Automation

Misc

High Availability

Production Operations and Migrations

Scenarios

Architecture Design

Misc

114 KiB

Raw Blame History