Versionen im Vergleich

Schlüssel

  • Diese Zeile wurde hinzugefügt.
  • Diese Zeile wurde entfernt.
  • Formatierung wurde geändert.

...

Codeblock
titleExample: Request a single Multi Instance GPU slice on the according Slurm partition
$ srun --gpus=1 --partition=gpu-a100:shared:mig example_cmd

Hardware configuration

NHR@ZIB offers access to compute nodes equipped with Nvidia A100 GPUs. The GPU A100 partition consists of two login nodes and 42 compute nodes with the following properties for a single node:

  • 2x Intel Xeon "Ice Lake" Platinum 8360Y (36 cores per socket, 2.4 GHz, 250 W)

  • 1 TB RAM (DDR4-3200)
  • 4x Nvidia A100 (80GB HBM2, SXM), two attached to each CPU socket
  • 7.68 TB NVMe local SSD
  • 200 GBit/s InfiniBand Adapter (Mellanox MT28908).

The hardware of the login nodes nodes is similar to those of the A100 GPU compute nodes. Notable exceptions are reduced memory (512 GB instead of 1 TB RAM) and no GPUs (no CUDA drivers) on bgnlogin[1-2].