This repository has been archived by the owner on Jun 4, 2024. It is now read-only.
[autoscaler] Improve experience when EC2 does not have capacity for worker nodes #10
Labels
enhancement
New feature or request
Hello -
After I spinned up the cluster with
ray up my_cluster.yaml
, my workload wasn't really getting handled well by the Ray cluster. I triedray monitor my_cluster.yaml
then found out that the logs were flooded with below messages:So it looks like the requested instance type isn't really available by EC2, and Ray isn't able to spin up desired worker nodes. This means I need to run
ray down
, modify themy_cluster.yaml
, then retry with a different instance type.I was wondering if we can improve this experience. Perhaps check the EC2 instance capacity for at least minimum # of workers before telling the user that cluster is launched? Or perhaps let user specify list of instance types that they're OK with?
The text was updated successfully, but these errors were encountered: