...
Codeblock | ||
---|---|---|
| ||
#!/bin/bash #SBATCH --nodes=2 #SBATCH --ntasks-per-node=8 #SBATCH --time=01:00:00 #SBATCH --partition=standard96 srun ./myexample.bin |
For this example 16 MPI tasks are executed. Once you add the linux command strace
to the code start you create two file files per linux process (MPI task (linux process)) , in this case 32 trace files.
Codeblock | ||
---|---|---|
| ||
#!/bin/bash #SBATCH --nodes=2 #SBATCH --ntasks-per-node=8 #SBATCH --time=01:00:00 #SBATCH --partition=standard96 srun strace -ff -t -o trace -e open,openat ./myexample.bin |
Analysing one trace file shows all file open
activity of one process.
Codeblock | ||
---|---|---|
| ||
> ls -l trace.*
-rw-r----- 1 bzfbml bzfbml 21741 Mar 10 13:10 trace.445215
...
> wc -l trace.445215
258 trace.445215
> cat trace.445215
13:10:37 open("/lib64/libc.so.6", O_RDONLY|O_CLOEXEC) = 3
13:10:37 open("/lib64/libfabric.so.1", O_RDONLY|O_CLOEXEC) = 3
...
13:10:38 open("/scratch/usr/bzfbml/mpiio-filesystem/mpiio_zxyblock.dat", O_RDWR) = 8
13:10:43 +++ exited with 0 +++ |
You need to expect a
Known issues
For some of the codes we are aware of certain issues:
...