Seitenvergleich

Inhalt

style	none

...

CentOS 7	Rocky Linux 9
old partition name	new partition name	current job limits
● `standard96`	● `cpu-clx`	512 nodes, 12h wall time
● `standard96:test`	● `cpu-clx:test`	32 nodes, 1 h wall time
● `standard96:ssd`	● `cpu-clx:ssd`
● `large96`	● `cpu-clx:large`
● `large96:test`
● `large96:shared`
● `huge96`	● `cpu-clx:huge`

( ● available ● closed/not available yet )

...

For users of SLURM’s srun job launcher:
Open MPI 5.x has dropped support for the PMI-2 API, it solely depends on PMIx to bootstrap MPI processes. For this reason the environment setting was changed from SLURM_MPI_TYPE=pmi2 to SLURM_MPI_TYPE=pmix, so binaries linked against Open MPI can be started as usual “out of the box” using srun mybinary. For the case of a binary linked against Intel-MPI, this works too when a recent version (≥2021.11) of Intel-MPI has been used. If an older version of Intel-MPI has been used, and relinking/recompiling is not possible, one can follow the workaround for PMI-2 with srun as described in the Q&A section below. Switching from srun to mpirun instead should also be considered.
Using more processes per node than available physical cores (PPN > 96; hyperthreads) with the OPX providerwhen defining FI_PROVIDER=opx:
The OPX provider currently does not support using hyperthreads/PPN > 96 on the clx partitions. Doing so may result in segmentation faults in libfabric during process startup. If a high number of PPN is really required, the libfabric provider has to be changed back to PSM2 by setting re-defining FI_PROVIDER=psm2(which is the default setting). Note that the usage of hyperthreads may not be advisable. We encourage users to test performance before using more threads than available physical cores.
Note that Open MPI’s mpirun/exec defaults to use all hyperthreads if a Slurm job/allocation is used that does not explicitely sets explicitly set --ntasks-per-node (or similar options).

...

Versionen im Vergleich

Alte Version 49

Neue Version 50

Schlüssel