Skip to content

Releases: aws/aws-parallelcluster-node

AWS ParallelCluster v3.9.1

11 Apr 10:42
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.9.1

This is associated with AWS ParallelCluster v3.9.1

CHANGES

  • There were no changes for this version.

AWS ParallelCluster v3.9.0

12 Mar 01:29
21bf5e3
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.9.0

This is associated with AWS ParallelCluster v3.9.0

ENHANCEMENTS

  • Add a clustermgtd config option ec2_instance_missing_max_count to allow a configurable amount of retries for eventual EC2
    describe instances consistency with run instances

AWS ParallelCluster v3.8.0

19 Dec 17:40
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.8.0

This is associated with AWS ParallelCluster v3.8.0

ENHANCEMENTS

  • Add support for EC2 Capacity Blocks for ML.

CHANGES

  • Perform job-level scaling by default for all jobs, using information in the SLURM_RESUME_FILE. Job-level scaling
    can be disabled using new job_level_scaling resume configuration parameter.
  • Remove support of all_or_nothing_batch configuration parameter in the Slurm resume program, in favor of the new Scheduling/ScalingStrategy cluster configuration.

AWS ParallelCluster v3.7.2

13 Oct 19:37
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.7.2

This is associated with AWS ParallelCluster v3.7.2

CHANGES

  • There were no changes for this version.

AWS ParallelCluster v3.7.1

22 Sep 20:16
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.7.1

This is associated with AWS ParallelCluster v3.7.1

CHANGES

  • There were no changes for this version.

AWS ParallelCluster v3.7.0

30 Aug 12:11
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.7.0

This is associated with AWS ParallelCluster v3.7.0

CHANGES

  • Perform default job-level scaling for exclusive jobs, by reading job information from SLURM_RESUME_FILE.
  • Make aws-parallelcluster-node daemons handle only ParallelCluster-managed Slurm partitions.

BUG FIXES

  • Fix an issue that was causing misalignment of compute nodes DNS name on instances with multiple network interfaces,
    when using SlurmSettings/Dns/UseEc2Hostnames equals to True.

AWS ParallelCluster v3.6.1

05 Jul 14:22
e52e25b
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.6.1

This is associated with AWS ParallelCluster v3.6.1

CHANGES

  • Avoid duplication of nodes seen by ClusterManager if compute nodes are added to multiple Slurm partitions.

BUG FIXES

  • Fix fast insufficient capacity fail-over logic when using Multiple Instance Types and no instances are returned.

AWS ParallelCluster v3.6.0

22 May 15:51
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.6.0

This is associated with AWS ParallelCluster v3.6.0

CHANGES

  • Consider dynamic nodes failing Slurm registration, identified by INVALID_REG flag, as bootstrap failure towards the Slurm protected mode.
    Static nodes failing the Slurm registration are already treated as a bootstrap failure after the node_replacement_timeout.

BUG FIXES

  • Fix an issue that was causing misalignment of compute nodes IP on instances with multiple network interfaces.

AWS ParallelCluster v3.5.1

28 Mar 20:13
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.5.1

This is associated with AWS ParallelCluster v3.5.1

BUG FIXES

  • Fix for compute_console_output log file being truncated at every clustermgtd iteration.

AWS ParallelCluster v3.5.0

20 Feb 11:50
Compare
Choose a tag to compare

We're excited to announce the release of AWS ParallelCluster Node 3.5.0

This is associated with AWS ParallelCluster v3.5.0

ENHANCEMENTS

  • Add logging of compute node console output to CloudWatch from head node on compute node bootstrap failure.
  • Add validators to prevent malicious string injection while calling the subprocess module.

BUG FIXES

  • Fix an issue in clustermgtd that caused compute nodes rebooted via Slurm to be replaced if the EC2 instance status checks fail.