Releases: aws/aws-parallelcluster-node
AWS ParallelCluster v3.9.1
We're excited to announce the release of AWS ParallelCluster Node 3.9.1
This is associated with AWS ParallelCluster v3.9.1
CHANGES
- There were no changes for this version.
AWS ParallelCluster v3.9.0
We're excited to announce the release of AWS ParallelCluster Node 3.9.0
This is associated with AWS ParallelCluster v3.9.0
ENHANCEMENTS
- Add a clustermgtd config option
ec2_instance_missing_max_count
to allow a configurable amount of retries for eventual EC2
describe instances consistency with run instances
AWS ParallelCluster v3.8.0
We're excited to announce the release of AWS ParallelCluster Node 3.8.0
This is associated with AWS ParallelCluster v3.8.0
ENHANCEMENTS
- Add support for EC2 Capacity Blocks for ML.
CHANGES
- Perform job-level scaling by default for all jobs, using information in the
SLURM_RESUME_FILE
. Job-level scaling
can be disabled using newjob_level_scaling
resume configuration parameter. - Remove support of
all_or_nothing_batch
configuration parameter in the Slurm resume program, in favor of the newScheduling/ScalingStrategy
cluster configuration.
AWS ParallelCluster v3.7.2
We're excited to announce the release of AWS ParallelCluster Node 3.7.2
This is associated with AWS ParallelCluster v3.7.2
CHANGES
- There were no changes for this version.
AWS ParallelCluster v3.7.1
We're excited to announce the release of AWS ParallelCluster Node 3.7.1
This is associated with AWS ParallelCluster v3.7.1
CHANGES
- There were no changes for this version.
AWS ParallelCluster v3.7.0
We're excited to announce the release of AWS ParallelCluster Node 3.7.0
This is associated with AWS ParallelCluster v3.7.0
CHANGES
- Perform default job-level scaling for exclusive jobs, by reading job information from SLURM_RESUME_FILE.
- Make aws-parallelcluster-node daemons handle only ParallelCluster-managed Slurm partitions.
BUG FIXES
- Fix an issue that was causing misalignment of compute nodes DNS name on instances with multiple network interfaces,
when usingSlurmSettings/Dns/UseEc2Hostnames
equals toTrue
.
AWS ParallelCluster v3.6.1
We're excited to announce the release of AWS ParallelCluster Node 3.6.1
This is associated with AWS ParallelCluster v3.6.1
CHANGES
- Avoid duplication of nodes seen by ClusterManager if compute nodes are added to multiple Slurm partitions.
BUG FIXES
- Fix fast insufficient capacity fail-over logic when using Multiple Instance Types and no instances are returned.
AWS ParallelCluster v3.6.0
We're excited to announce the release of AWS ParallelCluster Node 3.6.0
This is associated with AWS ParallelCluster v3.6.0
CHANGES
- Consider dynamic nodes failing Slurm registration, identified by
INVALID_REG
flag, as bootstrap failure towards the Slurm protected mode.
Static nodes failing the Slurm registration are already treated as a bootstrap failure after thenode_replacement_timeout
.
BUG FIXES
- Fix an issue that was causing misalignment of compute nodes IP on instances with multiple network interfaces.
AWS ParallelCluster v3.5.1
We're excited to announce the release of AWS ParallelCluster Node 3.5.1
This is associated with AWS ParallelCluster v3.5.1
BUG FIXES
- Fix for compute_console_output log file being truncated at every clustermgtd iteration.
AWS ParallelCluster v3.5.0
We're excited to announce the release of AWS ParallelCluster Node 3.5.0
This is associated with AWS ParallelCluster v3.5.0
ENHANCEMENTS
- Add logging of compute node console output to CloudWatch from head node on compute node bootstrap failure.
- Add validators to prevent malicious string injection while calling the subprocess module.
BUG FIXES
- Fix an issue in clustermgtd that caused compute nodes rebooted via Slurm to be replaced if the EC2 instance status checks fail.