This document walks through the deployment of Semantic Search on AWS using AWS Cloud9. Following this guide deploys the semantic search application with default configurations.
- Open AWS Cloud9 in AWS
- Use the Create environment button to create a new AWS Cloud9 IDE.
- Give your AWS Cloud9 environment a name, for example
semantic-search-deployment
. - Select t3.small or m5.large and leave the other configurations as the defaults.
- Create the AWS Cloud9 environment.
- Give your AWS Cloud9 environment a name, for example
- Go to your AWS Cloud9 Dashboard in AWS
- In the list of AWS Cloud9 environments open your AWS Cloud9 environment that you created before. The open button opens your AWS Cloud9 IDE in a new browser tab. Wait for the Cloud9 IDE to connect. This may take a minute.
- Execute
git clone https://github.com/aws-samples/semantic-search-aws-docs.git
in the terminal of your AWS Cloud9 IDE.
Once your AWS Cloud9 environment creation completes you will need to increase the underlying EC2 instance storage to be able to pull the container images that the semantic search application deploys.
- In the Terminal of your AWS Cloud9 IDE make the resize bash script executable
chmod +x ~/environment/semantic-search-aws-docs/cloud9/resize.sh
- Increase the Amazon EBS volume size of your AWS Cloud9 environment to 50 GB
~/environment/semantic-search-aws-docs/cloud9/resize.sh 50
Terraform needs to access AWS resources from within your AWS Cloud9 environment to deploy the semantic search application. The recommended way to access AWS resources from within the AWS Cloud9 environment is to use AWS managed temporary credentials. For the semantic search deployment the AWS managed temporary credentials do not have sufficient permissions to create an Amazon EC2 instance profile.
- In your AWS Cloud9 IDE at the top left click on AWS Cloud9 and then on Preferences.
- On the Preferences page navigate to AWS Settings, Credentials, and disable AWS managed temporary credentials.
You need to authenticate directly with the AWS CLI in your AWS Cloud9 IDE. To do so you need to follow one of the AWS CLI Authentication and access credentials methods.
One easy way to setup authentication is by setting environment variables
e.g. similar to the snippet below, but with your own access key and access key.
export AWS_ACCESS_KEY_ID=AKIAIOSFODNN7EXAMPLE
export AWS_SECRET_ACCESS_KEY=wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
export AWS_DEFAULT_REGION=us-east-1
Once you authenticated with the AWS CLI you can go on to the next step.
The default configuration uses GPU instances for the semantic search application. If you want to deploy this solution with GPU acceleration you will need to increase the Running On-Demand G and VT instances EC2 Service quota to at least 8.
To deploy the semantic search application without GPU instance open semantic-search-aws-docs/infrastructure/main.tf
in your AWS Cloud9 IDE. Search for Using a CPU instance in the Terraform file. Uncomment the CPU image_id
and instance_type
and add comments before the GPU image_id
and instance_type
. The code should now look like the following:
### Using a CPU instance ###
image_id = data.aws_ssm_parameter.ami.value #AMI name like amzn2-ami-ecs-hvm-2.0.20220520-x86_64-ebs
instance_type = "c6i.2xlarge"
### Using a GPU instance ###
#image_id = data.aws_ssm_parameter.ami_gpu.value #AMI name like amzn2-ami-ecs-gpu-hvm-2.0.20220520-x86_64-ebs
#instance_type = "g4dn.xlarge"
- In your AWS Cloud9 environment terminal navigate to
cd ~/environment/semantic-search-aws-docs/infrastructure
. - Set the following environment variables. Change their value if you are using a different region other than
us-east-1
or if you want to give the Terraform state Amazon S3 bucket and state sync Amazon DynamoDB table different names.
export REGION=us-east-1
export S3_BUCKET="terraform-semantic-search-state-$(aws sts get-caller-identity --query Account --output text)"
export SYNC_TABLE="terraform-semantic-search-state-sync"
- Create the Terraform state bucket in Amazon S3
aws s3 mb s3://$S3_BUCKET --region=$REGION
- Create the Terraform state sync table in Amazon DynamoDB
aws dynamodb create-table --table-name $SYNC_TABLE --attribute-definitions AttributeName=LockID,AttributeType=S --key-schema AttributeName=LockID,KeyType=HASH --billing-mode PAY_PER_REQUEST --region=$REGION
- Initialize Terraform for the infrastructure deployment
terraform init -backend-config="bucket=$S3_BUCKET" -backend-config="region=$REGION" -backend-config="dynamodb_table=$SYNC_TABLE"
- Deploy the Semantic Search infrastructure with Terraform
terraform apply -var="region=$REGION" -var="index_name=awsdocs"
- Change the terraform variable
index_name
if you want to change the name of your index in the Amazon OpenSearch cluster. The search API uses this variable to search for documents. - Enter
yes
when Terraform prompts you "to perform these actions". - The deployment will take 10–20 minutes. Wait for completion before moving on with the document ingestion deployment.
- Change the terraform variable
- You can receive the frontend URL using the following command. It may take some time though until the tasks are in running state
terraform output loadbalancer_url
In your AWS Cloud9 environments terminal navigate to
cd ~/environment/semantic-search-aws-docs/ingestion
.
- If you want to ingest the AWS documentation follow the
Ingest AWS Documentation instructions. - If you want to ingest your documents from a URL (for example from Amazon S3) follow the
Ingest Documents from URL instructions. - If instead you like to make local documents searchable follow the
Local Documents Ingestion instructions.
After ingesting your documents you can remove the ingestion resources. Follow the Clean up Ingestion Resources instructions to clean up the ingestion resources.
Destroy the resources that were deployed for the infrastructure of the semantic search application if you are not using the application anymore.
- In your AWS Cloud9 IDE navigate to the ingestion directory
cd ~/environment/semantic-search-aws-docs/infrastructure
- Clean up the semantic search application infrastructure with the
terraform destroy -var="region=$REGION"
command.- Run
eval REGION=$(terraform output region)
if yourREGION
variable is not set anymore.
- Run
- Enter
yes
when Terraform prompts you "Do you really want to destroy all resources?".