The compute nodes of the CPU cluster of system Lise are organised via the following slurm partitions.
Partition name | Node number | CPU | Main memory (GB) | Max. nodes per job | Max jobs (running/ queued) | Walltime (hh:mm:ss) | Remark |
---|---|---|---|---|---|---|---|
standard96 | 1204 | Cascade 9242 | 362 | 512 | 64 / 500 | 12:00:00 | default partition |
standard96:test | 32 dedicated +128 on demand | 362 | 16 | 1 / 500 | 01:00:00 | test nodes with higher priority but lower walltime | |
large96 | 28 | 747 | 8 | 64 / 500 | 12:00:00 | fat memory nodes | |
large96:test | 2 dedicated +2 on demand | 747 | 2 | 1 / 500 | 10:00:00 | fat memory test nodes with higher priority but lower walltime | |
large96:shared | 2 dedicated | 747 | 1 | 64 / 500 | 48:00:00 | fat memory nodes for data pre- and postprocessing | |
huge96 | 2 | 1522 | 1 | 64 / 500 | 24:00:00 | very fat memory nodes for data pre- and postprocessing |
See Slurm usage how to pass the 12h walltime limit with job dependencies.
Which partition to choose?
If you do not request a partition, your job will be placed in the default partition, which is standard96.
The default partitions are suitable for most calculations. The :test partitions are, as the name suggests, intended for shorter and smaller test runs. These have a higher priority and a few dedicated nodes, but are limited in time and number of nodes. Shared nodes are suitable for pre- and postprocessing. A job running on a shared node is only accounted for its core fraction (cores of job / all cores per node). All non-shared nodes are exclusive to one job.
The available home/local-ssd/work/perm storages are discussed in Storage Systems.
An overview of all partitions and node statuses is provided by: sinfo -r
To see detailed information about a nodes type: scontrol show node <nodename>
Charge rates
Charge rates for the slurm partitions you find in Accounting.
Fat-Tree Network of Lise
See OPA Fat Tree network of Lise
List of CPUs and GPUs
Short name | Link to manufacturer specifications | Where to find | Units per node | Cores per unit | Clock speed |
---|---|---|---|---|---|
Cascade 9242 | Intel Cascade Lake Platinum 9242 (CLX-AP) | CPU partition "Lise" | 2 | 48 | 2.3 |
Cascade 4210 | Intel Cascade Lake Silver 4210 (CLX) | blogin[1-6] | 2 | 10 | 2.2 |
Tesla A100 | NVIDIA Tesla A100 40GB and 80GB | 4 | 432/6912* |
*Tensor Cores / CUDA FP64 Cores