Skip to content

Latest commit

 

History

History
91 lines (74 loc) · 7.54 KB

aws-cloud9-deployment.md

File metadata and controls

91 lines (74 loc) · 7.54 KB

AWS Cloud9 Deployment

This document walks through the deployment of Semantic Search on AWS using AWS Cloud9. Following this guide deploys the semantic search application with default configurations.

Create AWS Cloud9 environment

  1. Open AWS Cloud9 in AWS
  2. Use the Create environment button to create a new AWS Cloud9 IDE.
    1. Give your AWS Cloud9 environment a name, for example semantic-search-deployment.
    2. Select t3.small or m5.large and leave the other configurations as the defaults.
    3. Create the AWS Cloud9 environment.

Open the AWS Cloud9 environment

  1. Go to your AWS Cloud9 Dashboard in AWS
  2. In the list of AWS Cloud9 environments open your AWS Cloud9 environment that you created before. The open button opens your AWS Cloud9 IDE in a new browser tab. Wait for the Cloud9 IDE to connect. This may take a minute.

Semantic Search Infrastructure Code

  1. Execute git clone https://github.com/aws-samples/semantic-search-aws-docs.git in the terminal of your AWS Cloud9 IDE.

Modify AWS Cloud9 EC2 Instance

Once your AWS Cloud9 environment creation completes you will need to increase the underlying EC2 instance storage to be able to pull the container images that the semantic search application deploys.

  1. In the Terminal of your AWS Cloud9 IDE make the resize bash script executable chmod +x ~/environment/semantic-search-aws-docs/cloud9/resize.sh
  2. Increase the Amazon EBS volume size of your AWS Cloud9 environment to 50 GB ~/environment/semantic-search-aws-docs/cloud9/resize.sh 50

AWS CLI Credentials

Terraform needs to access AWS resources from within your AWS Cloud9 environment to deploy the semantic search application. The recommended way to access AWS resources from within the AWS Cloud9 environment is to use AWS managed temporary credentials. For the semantic search deployment the AWS managed temporary credentials do not have sufficient permissions to create an Amazon EC2 instance profile.

  1. In your AWS Cloud9 IDE at the top left click on AWS Cloud9 and then on Preferences.
  2. On the Preferences page navigate to AWS Settings, Credentials, and disable AWS managed temporary credentials.

You need to authenticate directly with the AWS CLI in your AWS Cloud9 IDE. To do so you need to follow one of the AWS CLI Authentication and access credentials methods.

One easy way to setup authentication is by setting environment variables

e.g. similar to the snippet below, but with your own access key and access key.

export AWS_ACCESS_KEY_ID=AKIAIOSFODNN7EXAMPLE
export AWS_SECRET_ACCESS_KEY=wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
export AWS_DEFAULT_REGION=us-east-1 

Once you authenticated with the AWS CLI you can go on to the next step.

GPU or CPU deployment

The default configuration uses GPU instances for the semantic search application. If you want to deploy this solution with GPU acceleration you will need to increase the Running On-Demand G and VT instances EC2 Service quota to at least 8.

To deploy the semantic search application without GPU instance open semantic-search-aws-docs/infrastructure/main.tf in your AWS Cloud9 IDE. Search for Using a CPU instance in the Terraform file. Uncomment the CPU image_id and instance_type and add comments before the GPU image_id and instance_type. The code should now look like the following:

### Using a CPU instance ###
image_id      =  data.aws_ssm_parameter.ami.value #AMI name like amzn2-ami-ecs-hvm-2.0.20220520-x86_64-ebs
instance_type =  "c6i.2xlarge"

### Using a GPU instance ###
#image_id      = data.aws_ssm_parameter.ami_gpu.value #AMI name like amzn2-ami-ecs-gpu-hvm-2.0.20220520-x86_64-ebs
#instance_type = "g4dn.xlarge"

Deploy Semantic Search Infrastructure

  1. In your AWS Cloud9 environment terminal navigate to cd ~/environment/semantic-search-aws-docs/infrastructure.
  2. Set the following environment variables. Change their value if you are using a different region other than us-east-1 or if you want to give the Terraform state Amazon S3 bucket and state sync Amazon DynamoDB table different names.
export REGION=us-east-1
export S3_BUCKET="terraform-semantic-search-state-$(aws sts get-caller-identity --query Account --output text)"
export SYNC_TABLE="terraform-semantic-search-state-sync"
  1. Create the Terraform state bucket in Amazon S3
    aws s3 mb s3://$S3_BUCKET --region=$REGION
  2. Create the Terraform state sync table in Amazon DynamoDB
    aws dynamodb create-table --table-name $SYNC_TABLE --attribute-definitions AttributeName=LockID,AttributeType=S --key-schema AttributeName=LockID,KeyType=HASH --billing-mode PAY_PER_REQUEST --region=$REGION
  3. Initialize Terraform for the infrastructure deployment
    terraform init -backend-config="bucket=$S3_BUCKET" -backend-config="region=$REGION" -backend-config="dynamodb_table=$SYNC_TABLE"
  4. Deploy the Semantic Search infrastructure with Terraform
    terraform apply -var="region=$REGION" -var="index_name=awsdocs"
    • Change the terraform variable index_name if you want to change the name of your index in the Amazon OpenSearch cluster. The search API uses this variable to search for documents.
    • Enter yes when Terraform prompts you "to perform these actions".
    • The deployment will take 10–20 minutes. Wait for completion before moving on with the document ingestion deployment.
  5. You can receive the frontend URL using the following command. It may take some time though until the tasks are in running state
    terraform output loadbalancer_url

Deploy and Run Semantic Search Ingestion

In your AWS Cloud9 environments terminal navigate to
cd ~/environment/semantic-search-aws-docs/ingestion.

Clean up Ingestion

After ingesting your documents you can remove the ingestion resources. Follow the Clean up Ingestion Resources instructions to clean up the ingestion resources.

Clean up Infrastructure

Destroy the resources that were deployed for the infrastructure of the semantic search application if you are not using the application anymore.

  1. In your AWS Cloud9 IDE navigate to the ingestion directory cd ~/environment/semantic-search-aws-docs/infrastructure
  2. Clean up the semantic search application infrastructure with the terraform destroy -var="region=$REGION" command.
    1. Run eval REGION=$(terraform output region) if your REGION variable is not set anymore.
  3. Enter yes when Terraform prompts you "Do you really want to destroy all resources?".