Fundamental CUDA Optimization (Part 2)

Fundamental CUDA Optimization (Part 2)
Thursday, April 16, 2020

NOTE: The format of this event has been changed to online only. NVIDIA will present remotely for the first ~1 hour and the remote connection will be left open for the hands-on session, where representatives from OLCF, NERSC, and NVIDIA will be available to support participants.

ORNL Remote Participation Only 1:00 PM – 3:00 PM (ET)
NERSC Remote Participation Only 10:00 AM – 12:00 PM (PT)

On Thursday, April 16, 2020, NVIDIA will present part 4 of a 9-part CUDA Training Series titled “Fundamental CUDA Optimization (Part 2)”.

This part of the series is aimed at basic optimization principles. We will introduce users to optimization strategies related to kernel launch configurations, GPU latency hiding, global memory throughput, and shared memory applicability. After the presentation, there will be a hands-on session where participants can complete example exercises meant to reinforce the presented concepts.

Remote Participation
Remote participants can watch the presentations via web broadcast and will have access to the training exercises, but temporary access to the compute systems will be limited as follows:

  • Current NERSC users will have Cori-GPU access temporarily added to their accounts.
  • Temporary Summit access will not be available for remote participants.

NOTE: Registration is required. Please submit the form below.

If you have any questions, please contact Tom Papatheodore (

Zoom Connection Details:
Meeting ID: 510 486 5180
Call In: +1 669 900 6833
The example exercises for this module can be found in the exercises/hw4 folder of the following GitHub repo:


Apr 16 2020


Thomas Papatheodore
1 (865) 576-1244
QR Code

Comments are closed.