In This Message

  • Upcoming Downtimes
    • home.ccs.ornl.gov (Mar 26)
  • Meetings & Workshops ​ ​
    • March 2024 OLCF User Conference Call (Mar 27)
    • AI Training Series: Enhancing PyTorch Performance on Frontier with the RCCL/OFI-plugin (Apr 17)
  • OLCF Highlights
    • OLCF’s Summit Supercomputer Lives for Another Year with SummitPLUS
  • Community Events
    • NERSC FUN Training March 2024

Upcoming Downtimes

– home.ccs.ornl.gov will be unavailable from 4:00 PM until 8:00 PM on Tuesday, March 26.

Meetings & Workshops ​

March 2024 OLCF User Conference Call 
March 27 – NVIDIA NeMo

The March OLCF User Conference Call will be held from noon until 1:00 PM Eastern Time on Wednesday, March 27th. During this call, NVIDIA will give an overview of their NVIDIA NeMo framework. For more information, see the event page at https://www.olcf.ornl.gov/calendar/userconcall-mar2024/

AI Training Series: Enhancing PyTorch Performance on Frontier with the RCCL/OFI-plugin
April 17, 1:00 pm – 2:00 pm Eastern Time

Machine learning frameworks running on top of AMD GPUs use a library called RCCL which provides standard collective communication routines for an arbitrary number of GPUs installed across single or multiple nodes. The RCCL/OFI plugin maps RCCLs connection-oriented transport APIs to libfabric’s connection-less reliable interface. This allows RCCL applications to take benefit of libfabric’s transport layer services like reliable message support and operating system bypass. Using this plugin with PyTorch can lead to better performance.

In this seminar, an overview of using PyTorch on Frontier with the aws-ofi-rccl plugin will be provided, along with specific profiling examples run on Frontier. This seminar is intended for OLCF users that have an allocation on Frontier, but all are welcome to join and view the presentation. For more information or to register, see https://www.olcf.ornl.gov/calendar/pytorch-on-frontier/

OLCF Highlights

OLCF’s Summit Supercomputer Lives for Another Year with SummitPLUS
OLCF Director of Science Bronson Messer discusses the SummitPLUS program with HPCwire.

Read: https://www.hpcwire.com/2024/03/06/oclfs-summit-supercomputer-lives-for-another-year-with-summitplus/

Community Events

NERSC FUN Training March 2024: Introduction to Parallel Programming in Fortran
March 26-27, 2024 12:00 pm – 4:30 pm EDT

NERSC is offering virtual hands-on introduction to parallel programming in Fortran on March 26-27, 2024. This event is organized by the Fortran Users of NERSC (FUN) Special Interest Group. This training will provide different methods for taking advantage of multi-node and multi-core hardware for performing calculations in parallel using the Fortran programming language, and some common libraries and extensions. ALCF and OLCF users are welcome to this training. NERSC training accounts will be provided if needed.

Please fill out the registration form by March 22nd, 2024 if you plan to attend.