Skip to content

iamfet/aws-autoscaling

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Create a load-balanced web server with auto scaling

Overview

This tutorial walks you through the process of creating a web server which is managed by an auto scaling group. The auto scaling group uses a launch template to create the servers. The auto scaling group launches the servers into a target group. An internet-facing load balancer directs traffic to the target group.

Errors or corrections? Contact [email protected].


Architecture Diagram

Create a VPC

A VPC is an isolated, private network you can create to run your workloads. You have complete control over your VPC when you create one.

  1. Click on Create VPC from the VPC Dashboard to create a new VPC
  2. Select VPC and more
  3. Enter a name of your choice under Auto-generate
  4. Choose a 10.0.0.0/16 IPV4 Cidr block
  5. Number of Availability Zones (AZs) = "2"
  6. Number of public subnets = "2"
  7. NAT gateways ($) = "In 1 AZ"
  8. Optional: VPC endpoints = "S3 Gateway"
  9. Leave all other settings as default
  10. Click Create VPC

Your configuration should match what is shown below:

Create a Launch Template

The launch template will serve as the blueprint for creating the exact type of server we need to meet our web server demands. A launch template can be modified to create new versions when you need to change a config.

  1. Click on Create launch template from the EC2 console to create a new launch template
  2. Launch template name - required = "autoscale-webserver"
  3. Check the Provide guidance to help me set up a template that I can use with EC2 Auto Scaling box
  4. Under Application and OS Images (Amazon Machine Image) - required, choose Amazon Linux 2 AMI (HVM) - Kernel 5.10, SSD Volume Type
  5. Instance type = "t2.micro"
  6. Key pair - Create a new one or use existing key pair
  7. Subnet - Don't include in launch template
  8. Create security group = "autoscale-webserver-sg"
  9. Allow SSH and HTTP traffic from 0.0.0.0/0 (Ignore the warning about security group. We will edit it later)
  10. VPC - Select the VPC you created
  11. Under Advanced network configuration, choose "Enable" under Auto-assign public IP
  12. Under Storage, leave all other configuration as default and choose "gp3" for Volume type
  13. Resource tags: Key: Name, Value: autoscale-webserver
  14. Under Advanced details, scroll down to the User data section and enter the following lines of code exactly as shown
#!/bin/bash -ex
sudo su
yum -y update
yum install httpd -y
systemctl start httpd
systemctl enable httpd
systemctl status httpd
echo "<html>Hello World, welcome to my server</html>" > /var/www/html/index.html
systemctl restart httpd
amazon-linux-extras install epel -y
yum install stress -y

Your configuration should look like this:

Create Target Group

A target group will route requests to the web servers we create. Our load balancer will need this target group to know what set of servers to distribute traffic to. Our auto scaling group will also be associated with this target group so it launches our servers into the target group.

  1. Click on Create target group from the EC2 console to create a target group
  2. Choose a target type: "Instances"
  3. Target group name: "autoscale-webserver"
  4. Protocol: "HTTP"
  5. Port: "80"
  6. VPC: Select the VPC you created
  7. Leave every other value on this page as default. Next
  8. Register Targets: Leave as is.
  9. Click Create target group

Create Load Balancer

An application load balancer acts as the entry point for traffic to our webservers. Instead of allowing users to access our application directly, we will use the load balancer to distribute traffic equally among our autoscaling group of web servers. This is better for load management, security and reliability of our application.

  1. Click on Create load balancer from the EC2 console to create a load balancer
  2. Type: "Application Load Balancer"
  3. Scheme: "Internet-facing"
  4. IP address type: "IPV4"
  5. VPC: Select the VPC you created
  6. Mappings: Check the box beside the two AZs listed
  7. Subnet: For each AZ selected, choose the public subnet in the dropdown menu
  8. At this point, go to the Security groups console and create a new security group for the load balancer. The inbound rule should allow HTTP traffic from anywhere.
  9. Select this security group as the load balancer security group
  10. Listeners and routing: Leave protocol and port as HTTP:80. Select the target group you created as target group
  11. Leave every other config as default and click Create load balancer

Create Auto Scaling Group

The auto scaling group configures and controls how your application scales automatically in response to varying traffic situations.

  1. Click on Create Auto Scaling group from the EC2 console to create an auto scaling group
  2. Enter a name
  3. Choose the launch template you created. Click Next
  4. Select your webserver VPC created from the VPC step
  5. Under Availability Zones and subnets, select the two public subnets in your VPC, in different AZs. Click Next

NB: Note that you can use the auto scaling group to override your instance type config from the launch template

  1. Under Load balancing, choose the option "Attach to an existing load balancer"
  2. Select Choose from your load balancer target groups
  3. Select the target group you created
  4. Select VPC Lattice service to attach: "No VPC Lattice service"
  5. Additional health check types - optional: "Turn on Elastic Load Balancing health checks"
  6. Leave every other config as default. Next
  7. Group size: Desired: "2", Minimum: "1", Maximum: "4"
  8. Scaling policies: "Target Tracking Policy"
  9. Metric type: "Average CPU Utilization"
  10. Target Value: "50%"
  11. Add notifications - optional (Skipped)
  12. Add tags - optional (Skipped)
  13. Create Auto Scaling Group

Check your configuration below:

Immediately you create your autoscaling group, you should see two new instances getting created in the EC2 console. This is because we specified a desired count of 2. Also note that they are automatically placed one in each AZ to support high availability.

Test your web server

Click on one of the web servers, copy the public IP or DNS name and paste it in your browser. You should see the following content:

This means your apache web server is running and you are able to reach it from the internet.

Restrict web traffic to servers

With the current design, users are directly accessing our web server. We don't want that. That is why we created a load balancer. To restrict incoming HTTP traffic destined for our servers to only the load balancer, we need to update the web servers' security group to accept HTTP traffic from only our application load balancer. This means no request gets to our servers without first making it through the load balancer.

Edit web server security group

  1. Go to the autoscale-webserver-sg security group and click on "Edit inbound rules".
  2. Delete the existing HTTP rule.
  3. Add a new HTTP rule. In the Source box, scroll down to select the security group of the load balancer. Save rules.
  4. You have successfully restricted traffic going to the servers to the load balancer.
  5. You should no longer be able to access your web server using the server IPs or DNS names. You should now be able to use the load balancer DNS name to access the servers. Test this out.

Live Autoscaling Test

We will now simulate a scenario of high CPU usage on our web server to allow the auto scaling group to respond.

We will SSH to our server and run a command to stress the server and this will raise CPU usage across our auto scaling group above 50%. This will make it respond by adding new servers until our maximum number of servers specified is reached.

Observe your EC2 console after running this command and you will see new instances being added by the auto scaling group to handle the simulated surge in traffic.

If you stopped or terminated the instance on which CPU usage has been simulated, instances will get terminated from your auto scaling group accordingly (scale-in).

Conclusion

You have built a load-balanced and highly available web application that auto scales out based on a target of CPU utilization.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published