-
Notifications
You must be signed in to change notification settings - Fork 22
Open
Description
It would be great to add something that can sanity check the bot configuration. For example, when creating a cluster with Magic Castle it would be quite easy to attach the wrong type of instance to a partition name (and it would not be easy to notice, the Zen2 nodes are called Standard_HB120-16rs_v2 and the Zen4 nodes are Standard_HB176-24rs_v4 so no obvious hints in the names).
One way to verify the architecture is
[ocaisa@login1 ~]$ srun -n 1 -p x86-64-amd-zen2-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
x86_64/amd/zen2
[ocaisa@login1 ~]$ srun -n 1 -p x86-64-amd-zen4-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
x86_64/amd/zen4
[ocaisa@login1 ~]$ srun -n 1 -p aarch64-neoverse-n1-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
aarch64/neoverse_n1This would also check that the bot can successfully submit jobs.
You'd also want to verify that the bot can talk to the target repository with the correct permissions.
Reactions are currently unavailable