This repository has been archived by the owner on Jun 4, 2024. It is now read-only.
[autoscaler] while running ray up
, client cannot connect to head node when client node/head node are in the same private subnet.
#1
Labels
bug
Something isn't working
What is the problem?
In my setting, the client machine is in the same VPC as the requested instances in the cluster.
When a client machine runs
ray up
, requirement seems to be that head node must be in a VPC subnet that enables "“Auto-assign public IPv4 address”". When the subnet doesn't enable this (is a private subnet), the client machine cannot connect to the head node; below is the error message.Ray version and other system information (Python version, TensorFlow version, OS):
Ray: 1.1.0, Python: 3.7.7
Client machine
cat /etc/os-release
Reproduction (REQUIRED)
Note that client is in the SAME VPC as the 4 subnets getting requested for head_node and worker_node.
Below is my cluster_config.yaml; the only important bit though is the 4
SubnetIds
.I confirmed that when I change the subnet setting of
head_node
's specified SubnetIds to "“Auto-assign public IPv4 address”" (via AWS VPC Console) client is able to find the head node and cluster launch is successful.Unfortunately I am working in a private AWS account so to reproduce, I suggest getting the SubnetIds that match your AWS account.
The text was updated successfully, but these errors were encountered: