Weekly Update: October 4, 2023
In This Message
– Andes and DTN (Oct 18 – Oct 20)
– OLCF Alpine Decommission (Begin moving data now)
– Call for Proposals for Summit in 2024
– Frontier and Crusher System Update
Meetings & Workshops
– Introduction to OpenMP Offload Part 2: Optimization and Data Management (Oct 6)
– RAJA Training (Oct 10)
– GPU Performance Profiling, Omniperf (Oct 16)
- Andes and DTN will be unavailable from Oct 18, 7:00 AM to Oct 20 midnight.
OLCF Alpine Decommission
(Begin moving data now)
The Alpine filesystem has reached the end of its life and data cannot remain on it after the end of December. Alpine will become read-only on December 19, 2023 to prepare for the disposition of Alpine on January 1, 2024. To assist you with moving your data off of Alpine, the DTNs mount the new Orion filesystem and all projects with access to Alpine have now been granted access to the Orion filesystem. We highly encourage all teams to start migrating and/or deleting data from the Alpine filesystem now. If you wait too late in the year to begin the transition, you will run the risk of running out of time to move your data before the system is decommissioned. It is important to note that any data remaining on the Alpine filesystem after December 31, 2023 will truly be unavailable and not recoverable in any way as the system will be dismantled and the drives will be shredded.
More details on the Alpine decommission timeline can be found at https://docs.olcf.ornl.gov/systems/2023_olcf_system_changes.html
Call for Proposals for Summit in 2024
The Department of Energy is extending Summit operations through October 2024, enabling researchers to pursue projects on one of the world’s leading AI-enabled open science supercomputing platforms. OLCF will allocate Summit through new programs for the calendar year 2024. SummitPLUS is one of the new allocation programs that will be used to allocate a significant portion of the system for 2024. The program is open to researchers from academia, government laboratories, federal agencies, and industry. We welcome proposals for computationally ready projects from investigators who are new to Summit, as well as from previous INCITE, ALCC, DD, ECP awardees and projects. We encourage proposals on emerging paradigms for computational campaigns including data-intensive science and AI/ML.
More information on the SummitPLUS allocation program and Alpine decommission can be found at https://docs.olcf.ornl.gov/systems/2023_olcf_system_changes.html
Frontier and Crusher System Update
On Tuesday, October 3, 2023, Frontier and Crusher were upgraded to a new version of the AMD GPU driver and the Slurm resource manager. For more information, please see the “System Updates” section in the corresponding system user guide.
Meeting & Workshops
Introduction to OpenMP Offload Part 2: Optimization and Data Management
October 6, 2023
12-2:30 pm EDT
Virtual via Zoom
The OpenMP API is a scalable model that gives parallel programmers a simple and flexible interface for developing portable parallel applications in C/C++ and Fortran. For part 2 of our OpenMP Offload series, OLCF/NERSC staff will cover optimization strategies and show how efficient data movement and a better understanding of the hierarchy of parallelism available can lead to improved performance. NERSC/OLCF staff will also cover best practices for OpenMP Offload.
For details and to register see:
October 10, 12:00 – 3:00 (ET), Virtual via Zoom
The OLCF, NERSC, and RAJA team has collaborated to offer a training session on RAJA as a part of our Performance Portability Training Series. This is a 1-part session that will allow participants to learn from and interact directly with RAJA team members. The session will give a general overview of RAJA and cover the basics of using RAJA abstractions to offload work to the GPUs. Throughout the session, a variety of quiz-like puzzles will be used to engage the audience and reinforce concepts. For details and registration, please see: https://www.olcf.ornl.gov/calendar/performance-portability-training-series-raja/
GPU Performance Profiling, Omniperf
October 16, 1:00 – 3:00 (ET), Virtual
As part of the HIP Training Series, AMD will present “GPU Profiling (Performance Timelines: Rocprof and Omnitrace)”. AMD’s rocprof profiler collects the basic hardware counter data to enable profiling. Omniperf adds to this hardware counter data many derived metrics and presents it in a form that application developers can use to tune their kernels and applications for top performance. Hands-on exercises will generate performance profiles, implement optimizations, and then generate a performance report that shows the differences between the optimized and unoptimized performance profiles. For details and registration, please see: https://www.olcf.ornl.gov/calendar/gpu-profiling-omniperf/