Tuesday, September 14, 2021
|ORNL||Remote Participation Only||1:00 PM – 3:00 PM (ET)|
|NERSC||Remote Participation Only||10:00 AM – 12:00 PM (PT)|
NVIDIA will present “CUDA Debugging” on Tuesday, September 14, 2021. This event is a continuation of the CUDA Training Series and will be presented by Robert Crovella from NVIDIA.
When your CUDA codes are not working at all, or not giving you the correct answer, there are a set of techniques to be aware of to tackle any debugging issue. First, we’ll review runtime error-checking best practices. We’ll cover “sticky” vs. “non-sticky” errors and under what situations it’s possible and how to recover from CUDA errors. Next, we’ll take a look at a powerful tool called compute-sanitizer, which is the recommended first debugging tool to dust off. We’ll cover basic usage of the tool as well as how to use the various sub-tools. Finally, we’ll cover use of the cuda-gdb debugger. How to build debug codes, starting the debugger, and how to set breakpoints, single-step, watch variables, inspect memory, and switch thread focus. Homework will be provided to reinforce the concepts.
All remote participants can watch the presentations via web broadcast and will have access to the training exercises, but temporary access to the compute systems will be limited as follows:
- Current NERSC users will have Cori-GPU access temporarily added to their accounts.
- Temporary Summit access will not be available for remote participants.
If you have any questions, please contact Subil Abraham (firstname.lastname@example.org)
Zoom information will be posted here on the day of the event.