Table of Contents
Partitions on system Lise
Compute system Lise at NHR@ZIB contains different Compute partitions for CPUs and GPUs. Your choice for the partition affects specific configurations of
- Login nodes,
- slurm partition (compute nodes of Compute partitions), and
- Software.
Login nodes
To login to system Lise, please
- choose a login node associated to your Compute partitions and
- use authentication via SSH Login.
Software and environment modules
The webpage Software gives you information about available software on the NHR systems.
NHR provides a number of compilers and software packages for parallel computing and (serial) pre- and postprocessing:
- Compilers: Intel, GNU
- Libraries: NetCDF, LAPACK, ScaLAPACK, BLAS, FFTW, ...
- Debuggers: Allinea DDT, Roguewave TotalView...
- Tools: octave, python, R ...
- Visualisation: mostly tools to investigate gridded data sets from earth-system modelling
- Application software: mostly for engineering and chemistry (molecular dynamics)
Environment Modules are used to manage the access to software/libraries. The module
command offers the following functionality.
- Show lists of available software
- Enables access to software in different versions
blogin1:~ $ module avail ... blogin1:~ $ module load intel Module for Intel Parallel Studio XE Composer Edition (version 2019 Update 5) loaded. blogin1:~ $ module list Currently Loaded Modulefiles: 1) sw.skl 2) slurm 3) HLRNenv 4) intel/19.0.5(default)
To avoid conflicts between different compilers and compiler versions, builds of most important libraries are provided for all compilers and major release numbers.
File systems
Each complex has the following file systems available. More information about Quota, usage, and best pratices are available on Fixing Quota Issues. Hints for data transfer are given here.
- Home file system with 340 TiByte capacity containing
$HOME
directories/home/${USER}/
- Lustre parallel file system with 8.1 PiByte capacity containing
$WORK
directories/scratch/usr/${USER}/
$TMPDIR
directories/scratch/tmp/${USER}/
- project data directories
/scratch/projects/<projectID>/
(not yet available)
- Tape archive with 120 TiByte capacity (accessible on the login nodes, only)
Using slurm batch system
To run your applications on the systems, you need to go through our batch system/scheduler: Slurm. The scheduler uses meta information about the job (requested node and core count, wall time, etc.) and then runs your program on the compute nodes, once the resources are available and your job is next in line. For a more in depth introduction, visit our Slurm documentation.
We distinguish two kinds of jobs:
- Interactive job execution
- Job script execution
Resource specification
To request resources, there are multiple flags to be used when submitting the job.
Parameter | Default Value | |
---|---|---|
# tasks | -n # | 1 |
# nodes | -N # | 1 |
# tasks per node | --tasks-per-node # | |
partition | -p <name> | standard96 |
Timelimit | -t hh:mm:ss | 12:00:00 |
Job scripts
Please go to our webpage CPU CLX partition for more details about job scripts. For introduction, standard batch system jobs are executed applying the following steps:
- Provide (write) a batch job script, see the examples below.
- Submit the job script with the command
sbatch
(sbatch jobscript.sh
) - Monitor and control the job execution, e.g. with the commands
squeue
andscancel
(cancel the job).
Job Accounting
Accounting gives you more information about job accounting.
Every batch job is accounted. The account (project) which is debited for a batch job can be specified using the sbatch
parameter --account <account>
. If a batch job does not state an account (project), a default is taken from the account database. It defaults to the personal project of the user, which has the same name as the user. Users may modify their default project by visiting the Portal NHR@ZIB.