...
TOPIC | Presenter | ||
---|---|---|---|
09:00 | 09:10 | Welcome and Introduction to Day 2 | Edmund Preiss (Intel), Dr. Thomas Steinke (ZIB) |
09:10 | 10:10 | A oneAPI Case Study: easyWave - A Tsunami Simulations Application - with demos ZIB's experiences with Intel oneAPI - Porting a tsunami application from CUDA to DPC++ - Running DPC++ code on GPU - From CUDA to DPC++ back to Nvidia GPUs... and FPGAs - A oneAPI case study with the tsunami simulation easyWave Live Demos: (1) overview on CUDA code, (2) CUDA code execution (3) guided transformation with Compatibility Tool, (4) presentation/analysis of generated DPC++ code, (5) execution on different architectures (Intel CPU + GPU, Nvidia GPU) | Steffen Christgau (ZIB) (Marius Knaust, ZIB) |
10:10 | 10:50 | Intel OpenMP – with Demos - Parallelizing heterogenous applications with Intel OpenMP and OpenMP offloading | Michael Steyer (Intel) |
10:50 | 11:00 | Bio Break | |
11:00 | 11:45 | Intel HW (public) roadmap/XPU and architecture specifics - Server CPUs - Client CPUs (i.e., Tigerlake; iCore with integrated graphics) - Intel hardware accelerators GPUs (DG1, SG1, ATS) and Intel FPGAs | Jean-Laurent Philippe (Intel) |
11:45 | 12:15 | A oneAPI Case Study: Ginkgo – a sparse linear algebra library for OneAPI Hardware Experience and potential of using oneAPI for the Ginkgo sparse linear algebra library - Porting of numerical linear algebra kernels from Cuda to DPC++ - Pitfalls and solutions in the OneAPI Compatibility Tool - Performance of Ginkgo’s DPC++ backend on Intel GPU - Live demo running Ginkgo in the Intel DevCloud | Prof. Hartwig Anzt (KIT) (Y. Tsai, T. Cojean, KIT) |
12:15 | 13:15 | Lunch Break | |
13:15 | 14:30 | Application profiling for heterogenous hardware - Demos - Profile DPC++ and GPU Workload VTune Profiler and VTune Offload - Share experiences/key findings with Gromacs related porting and optimization efforts | Heinrich Bockhorst (Intel) |
14:30 | 14:35 | Bio Break | |
14:35 | 15:35 | Application profiling for heterogenous hardware - Demos - Profile DPC++ and GPU Workload with Advisor include and Roofline analyser - Estimate performance potential gains with Offload Advisor ( CPU -> HW Accelerator) | Klaus-Dieter Oertel (Intel) |
15:35 | 16:00 | - Questions and Answers - Wrap up | Intel / ZIB |
...