In This Message

  • Upcoming Downtimes
    • No scheduled outages through April 5
  • Meetings & Workshops ​ ​
    • March 2024 OLCF User Conference Call (Mar 27)
    • AI Training Series: Enhancing PyTorch Performance on Frontier with the RCCL/OFI-plugin (Apr 17)
  • OLCF Highlights
    • Scientists Use Summit to Explore Exotic Stellar Phenomena

Upcoming Downtimes

– No scheduled outages through April 5.

Meetings & Workshops ​

March 2024 OLCF User Conference Call 
March 27 – NVIDIA NeMo

The March OLCF User Conference Call will be held from noon until 1:00 PM Eastern Time on Wednesday, March 27th. During this call, NVIDIA will give an overview of their NVIDIA NeMo framework. For more information, see the event page at https://www.olcf.ornl.gov/calendar/userconcall-mar2024/

AI Training Series: Enhancing PyTorch Performance on Frontier with the RCCL/OFI-plugin
April 17, 1:00 pm – 2:00 pm Eastern Time

Machine learning frameworks running on top of AMD GPUs use a library called RCCL which provides standard collective communication routines for an arbitrary number of GPUs installed across single or multiple nodes. The RCCL/OFI plugin maps RCCLs connection-oriented transport APIs to libfabric’s connection-less reliable interface. This allows RCCL applications to take benefit of libfabric’s transport layer services like reliable message support and operating system bypass. Using this plugin with PyTorch can lead to better performance.

In this seminar, an overview of using PyTorch on Frontier with the aws-ofi-rccl plugin will be provided, along with specific profiling examples run on Frontier. This seminar is intended for OLCF users that have an allocation on Frontier, but all are welcome to join and view the presentation. For more information or to register, see https://www.olcf.ornl.gov/calendar/pytorch-on-frontier/

OLCF Highlights

Scientists Use Summit to Explore Exotic Stellar Phenomena
A team at State University of New York, Stony Brook simulated thermonuclear flames spreading across neutron stars in 2 and 3D

Read: https://www.olcf.ornl.gov/2024/03/15/scientists-use-summit-supercomputer-to-explore-exotic-stellar-phenomena/