Skip to content

Releases: GoogleCloudPlatform/cluster-toolkit

v1.39.0: Slurm reservations during maintenance windows, Improved GKE Support, removed CentOS 7 references

12 Sep 19:38
7699f5d
Compare
Choose a tag to compare

What's Changed

Key New Features 🎉

Module Improvements 🔨

Improvements 🛠

Bug fixes 🐞

  • Add slurmgcp-managed infix to resource policy name by @mr0re1 in #2892
  • Move pytest and other package installation to make by @annuay-google in #2890
  • Prevent use of google provider 6.0 where breaking changes are in use by @tpdownes in #2978
  • Fix local_ssd_config issue that forces node-pool recreation by @sharabiani in #2968
  • kubernetes provider added to gke-cluster module by @sharabiani in #2985
  • Fix for cleanup script. The last input is optional by @cdunbar13 in #2993
  • Catch "None" fields in slurm job datetime data for BigQuery by @fdmalone in #2992

Other changes

New Contributors

Full Changelog: v1.38.0...v1.39.0

v1.38.0: Slurm GCP v6 for a3-highgpu-8g and added ability to disable automatic updates

15 Aug 23:20
1e38ce0
Compare
Choose a tag to compare

What's Changed

Key New Features 🎉

New Modules 🧱

Module Improvements 🔨

Improvements 🛠

Deprecations 💤

Version Updates ⏫

Bug fixes 🐞

New Contributors

Full Changelog: v1.37.2...v1.38.0

v1.37.2 Fix SlurmGCP cleanup of resource policies

09 Aug 21:23
229803f
Compare
Choose a tag to compare

What's Changed

Bug fixes 🐞

  • Delete at most one resource policy at a time by @mr0re1 in #2895

Full Changelog: v1.37.1...v1.37.2

v1.37.1: Documentation update

02 Aug 18:13
9e68ecc
Compare
Choose a tag to compare

Fix minor typographical errors in documentation

Full Changelog: v1.37.0...v1.37.1

v1.37.0

31 Jul 21:14
54da9b7
Compare
Choose a tag to compare

The HPC Toolkit has been rebranded to Cluster Toolkit. More details will follow shortly. The github repository has been renamed to match. This should not break existing workflows. References to the old name should seamlessly redirect to the updated repo. The binary has been renamed to gcluster (formally ghpc) but ghpc has been symlinked and will continue to work. If any unexpected behavior is noticed as part of this transition, please report it.

What's Changed

Key New Features 🎉

Other changes

Full Changelog: v1.36.1...v1.37.0

v1.36.1: Fix Slurm GCP Cloud Parameter Defaults

26 Jul 22:45
493308e
Compare
Choose a tag to compare

What's Changed

Bug fixes 🐞

Full Changelog: v1.36.0...v1.36.1

v1.36.0 - Parallelstore support

19 Jul 16:59
da56862
Compare
Choose a tag to compare

What's Changed

Key New Features 🎉

  • Add support for parallelstore in pre-existing-network-storage by @harshthakkar01 in #2701
  • Develop and adopt boot-time fix for EOL CentOS 7 repositories by @tpdownes in #2738

New Modules 🧱

Module Improvements 🔨

  • Add 'source' argument for path to prolog or epilog scripts by @andybubu in #2670
  • Allow users to turn on access to cluster via GCP public IP address space by @ankitkinra in #2687
  • Add known gpu types and their accelerators to gke module by @ankitkinra in #2680
  • Add disk_type for HTCondor's EP template by @aneo-ssam in #2705

Improvements 🛠

  • Update A3 mega blueprint to use Slurm-GCP 6.5.12 by @tpdownes in #2763

Bug fixes 🐞

  • Revert "Remove installation of enroot and pyxis from a3-highgpu-8g blueprint" by @samskillman in #2722
  • Only enable gpu taints if guest_acclerator list is not empty by @ankitkinra in #2727
  • Move GCESysPrep to provisioner in Windows scripts by @tpdownes in #2728
  • Modify a3-highgpu-8g image-building blueprint network by @tpdownes in #2744
  • Update image to new centos image for both login and builder nodes by @ankitkinra in #2780

Other changes

  • Add validator for Terraform version and SlurmGCP6 by @mr0re1 in #2772

New Contributors

Full Changelog: v1.35.1...v.1.36.0

v1.35.1: Fix SlurmGCP prolog/epilog scripts bug

26 Jun 23:51
dbe05ee
Compare
Choose a tag to compare

v1.35.0: Shared reservations, TF provider configuration, and targeted group deployment

20 Jun 23:35
eaeacfb
Compare
Choose a tag to compare

What's Changed

Key New Features 🎉

  • Ability to configure the Terraform provider in blueprint @cdunbar13 in #2635
  • Add --skip and --only to deploy and destroy commands by @mr0re1 in #2658
  • Add support for shared reservations by @mr0re1 in #2640

New Modules 🧱

Module Improvements 🔨

Improvements 🛠

  • Add topologically-aware NCCL tests solution to A3 Mega by @tpdownes in #2657

Deprecations 💤

  • SlurmGCP V6 remove support for custom instance templates by @mr0re1 in #2664
  • SlurmGCP V6 remove support for custom instance templates by @mr0re1 in #2667

Bug fixes 🐞

New Contributors

Full Changelog: v1.34.3...v1.35.0

v1.34.3 Documentation update

10 Jun 19:21
627b43a
Compare
Choose a tag to compare

What's Changed

Other changes

  • Add link to newly published A3 Mega documentation by @tpdownes in #2677

Full Changelog: v1.34.2...v1.34.3