Weekly Update: August 14, 2019
IN THIS MESSAGE
- Center Announcements
- Atlas is Read-Only; All data removed Aug 19
- Alpine Purge Policy Has Been Implemented
- Rhea Jobs Should Use Alpine
- Transition to Slurm on Rhea and DTNs
- Meetings & Workshops
- OLCF GPU Hackathon Series
- Introduction to AMD GPU Programming With HIP Webinar (Sep 6)
- Upcoming Scheduled Outages
- SummitDev (through Aug 15)
- HPSS (Aug 20-21)
- home.ccs.ornl.gov (Aug 28)
ATLAS IS READ-ONLY; ALL DATA REMOVED ON AUG 19
The Atlas filesystem was made read-only on Monday, August 5 and will be decommissioned (with all data permanently removed) on Monday, August 19. Please begin moving data out of that filesystem and onto the newer Alpine filesystem or to some alternate location immediately. If you have questions about data movement, please contact firstname.lastname@example.org. For more information, please see the 2019 OLCF System Decommissions and Notable Changespage.
ALPINE PURGE POLICY HAS BEEN IMPLEMENTED
The 90-day purge policy for the Alpine GPFS filesystem was enabled on August 6, 2019. Files in Atlas that have not been accessed or modified in the last 90 days are eligible for the purge.
RHEA JOBS SHOULD USE ALPINE
The Alpine filesystem is now mounted on Rhea, and your jobs should use it instead of Atlas. Atlas has been made read-only and will be decommissioned on August 19 (see the announcement above). Jobs targeting Atlas will now fail. Make sure you’re using Alpine, and if you have any questions please contact email@example.com.
TRANSITION TO SLURM ON RHEA AND DTNS
Rhea and the Data Transfer Nodes will transition to using Slurm for job scheduling on September 3, 2019. To help ensure a smooth transition, we have made 192 Rhea nodes and 2 DTNs available for your testing purposes. Slurm is available from the login nodes of both systems, and we encourage you to try it out before the transition on September 3. For more information on using Slurm at OLCF, see the Running Jobs section of the Rhea User Guide.
MEETINGS & WORKSHOPS
OLCF GPU HACKATHON SERIES
OLCF is once again supporting several GPU hackathons. GPU hackathons are 5-day coding events in which teams of developers prepare their own applications(s) to run on GPUs or focus on optimizing their application(s) that currently run on GPUs. Teams should consist of three or more developers who are intimately familiar with (some part of) their application, and they will work alongside two mentors with GPU programming expertise. If you want/need to get your code running (or optimized) on a GPU-accelerated system, these hackathons offer a unique opportunity to set aside 5 days, surround yourself with experts in the field, and push toward your goals. If you are interested in more information, or would like to submit a proposal to one of the upcoming events, please visit the OLCF GPU Hackathons event page.
The OLCF Hackathon (October 21-25 in Knoxville, TN) is the only hackathon still accepting proposals; the deadline to apply is August 16.
INTRODUCTION TO AMD GPU PROGRAMMING WITH HIP WEBINAR (SEP 6)
AMD Research will be presenting a webinar titled, “Introduction to AMD GPU programming with HIP” on September 6th from 1:00 PM to 3:00 PM ET. HIP is a C++ runtime API that allows developers to write portable code to run on AMD and NVIDIA GPUs. It is an interface that uses the underlying Radeon Open Compute (ROCm) or CUDA platform that is installed on a system. The API is similar to CUDA so porting existing codes from CUDA to HIP should be fairly straightforward in most cases. For more information, or to register, please visit the event page.
UPCOMING SCHEDULED OUTAGES
SummitDev is currently unavailable and will return to service at 8:00 AM on Thursday, August 15.
HPSS will be unavailable from 8:00 AM on Tuesday, August 20 until 8:00 AM on Wednesday, August 21.
The home.ccs.ornl.gov general login system may be intermittently unavailable from 4:00 PM until 9:00 PM on Wednesday, August 28.