OpenMP Training Series, May – Oct 2024 – Oak Ridge Leadership Computing Facility

Introduction

The OpenMP training series presented by Michael Klemm of AMD the OpenMP ARB, and Christian Terboven of RWTH Aachen University, is part of the Performance Portability training series. Offered by NERSC, OCLF, and ALCF, the series features training sessions on various performance portable programming solutions to help ease developer transitions between current and emerging high performance computing (HPC) systems, such as the NERSC Perlmutter and Polaris (AMD CPU and Nvidia GPU), OLCF Frontier (AMD CPU and GPU), and ALCF Aurora (Intel CPU and GPU).

Overview

The OpenMP API is the de facto standard for writing parallel applications for shared memory computers. It is a portable programming model supported by multiple scientific compilers on CPU and GPU architectures. This monthly OpenMP training series offered from May to October 2024 will cover topics including OpenMP basics, parallel worksharing, tasking, memory management and affinity, vectorization, GPU offloading, and MPI/OpenMP hybrid programming. Detailed topics for each session are provided below. The format of each training session will be presentations followed by homework assignments. Homework solutions will be reviewed at the beginning of the next session. This training series is open to NERSC, OLCF, and ALCF users. Perlmutter training accounts will be provided if needed.

Session Dates and Times

All times are Pacific Daylight Time (PDT/UTC-7).

Session 1: Monday, May 6, 9:00 – 11:00 a.m.
Session 2: Monday, June 10, 9:00 – 10:30 a.m.
Session 3: Monday, July 8, 9:00 – 10:30 a.m.
Session 4: Thursday, September 5, 9:00 – 10:30 a.m.
Session 5: Monday, October 7, 9:00 – 10:30 a.m.
Session 6: Monday, October 28, 9:00 – 10:30 a.m.

Session Topics (tentative)

Session 1 (May 6): OpenMP Introduction

Welcome
OpenMP Overview
Parallel Region
Worksharing
Scoping
Tasking (short introduction)
Executing OpenMP Programs
Homework Assignments
Compile and Run on CPUs with Various OpenMP Compilers
Q&A

Session 2 (Jun 10): Tasking

Review of Session 1, Q&A
Review of Homework Assignments
Tasking Motivation
Task Model in OpenMP
Scoping
Taskloop
Dependencies
Cut-off strategy
Homework Assignments
Q&A

Session 3 (Jul 8): Optimization for NUMA and SIMD

Review of Session 2, Q&A
Review of Homework Assignments
OpenMP and NUMA Architectures
SIMD (vectorization)
Misc Optimizations
MPI and Multi-threading
Homework Assignments
Q&A

Session 4 (Sept 5): Introduction to Offloading with OpenMP

Review of Session 3, Q&A
Review of Homework Assignments
Introduction to OpenMP Offload Features
Device Model
Creating Parallelism on the Target Device
Teams and Distribute Constructs
Loop Constructs
Homework Assignments
Compile and Run on CPUs with Various OpenMP Compilers
Q&A

Session 5 (Oct 7): Advanced OpenMP Offloading Topics

Review of Session 4, Q&A
Review of Homework Assignments
Unstructured Data Movement
Reducing Data Transfers
HALO Exchange
Asynchronous Offloading
Real-World Application Case Study: NWChem
Integration of GPU-Kernels (i.e., HIP)
Homework Assignments
Q&A

Session 6 (Oct 28): Selected / Remaining Topics

Review of Session 5, Q&A
Review of Homework Assignments
Remaining OpenMP Topics from Previous Sessions
Task Affinity
Hybrid Programming: Detached Tasks
Hybrid Programming: MPI + OpenMP
Q&A
Homework Help

Registration

This event will be presented online only using Zoom. Registration is required for remote participation.

You can register anytime during the OpenMP training series, even if you missed some early sessions. Slides, videos, and exercises will be posted before the next upcoming session to help you catch up.