Learn how to write a portable parallel program that can run on multicore CPUs and accelerators like GPUs and how to apply incremental parallelization strategies using the OpenACC programming model to accelerate an example application that simulates heat distribution across a 2-dimensional metal plate and applying this knowledge to accelerate a mini-application.
The bootcamp will be hosted online in the Central European Summer Time zone (CEST). All communication will be done through Zoom, Slack and email.
Basic experience with C/C++ or Fortran. No GPU programming knowledge is required.
This event has limited capacity, so please make sure that prerequisites are met before applying. You will be receiving an acceptance email with details on how to participate by April 22, 2021.
Day 1: April 29 2021 (9:00-12:30 CEST)
- Welcome (Moderator): 9:00
- Connecting to Hackathon cluster (9:15-9:30)
- Introduction to GPU Programming with OpenACC (09:30 - 12:30)
- Introduction to GPU programming (15 min)
- What is a GPU and Why Should You care?
- What is GPU Programming?
- Available Libraries, Programming Models, Platforms
- Introduction to OpenACC (45 min)
- What is OpenACC and Why Should You Care?
- Profile-driven Development
- First Steps with OpenACC
- Lab 1
- OpenACC Data Management (45 min)
- CPU and GPU Memories
- CUDA Unified (Managed) Memory
- OpenACC Data Management
- Lab 2
- OpenACC Loop Optimization (45 min)
- Gangs, Workers, and Vectors Demystified
- Loop Optimizations
- Lab 3
Day 2: April 30, 2021 (9:00 AM to 12:30 CEST)
Welcome (Moderator): 9:00
Mini-application challenge (9:00-12:00)
- Overview of the mini-application (15 min)
- Review steps to acceleration (5 min)
- Application challenge
Mini-application Solution Walk-through and next steps (12:00-12:30)
Homework [Offline, support through slack] :
- Introduction to NVIDIA ® Nsight™ Tools
- Overview of Nsight Tools
- How to profile a serial application with NVIDIA Tools Extension (NVTX)
- Overview of optimization cycle with Nsight Systems
- Profiling mini-application
- Profile a sequential weather modeling application (integrated with NVTX APIs) with NVIDIA Nsight Systems to capture and trace CPU events and time ranges
- Understand how to use NVIDIA Nsight Systems profiler’s report to detect hotspots and apply OpenACC compute constructs to the serial application to parallelise it on the GPU
- Learn how to use Nsight Systems to identify issues such as underutilized GPU device and unnecessary data movements in the application and to apply optimization strategies steps by steps to expose more parallelism and utilize computer’s CPU and GPU
Teams attending the event will be given access to a GPU cluster for the duration of the hackathon.
Important Event Dates
Cyfronet & NVIDIA OpenACC Bootcamp Application Deadline
Cyfronet & NVIDIA OpenACC Bootcamp Day 1
Cyfronet & NVIDIA OpenACC Bootcamp Day 2