The High Performance Computing And Data Operations Group keeps the OLCF leadership supercomputing systems running. Members of the group monitor all systems 24 hours a day, seven days a week, 365 days a year, and are responsible for administration, configuration management, and cybersecurity. The staff works with infrastructure systems, with Titan, scratch and archival storage, and with other OLCF supercomputers. The HPC Ops tests the systems when they are installed and upgraded, and use diagnostic tools to continually monitor them. They anticipate problems before they arise and identify components that are near failure. The group also ensures that all systems conform to ORNL cybersecurity policy.

Meet the High-Performance Computing and Data Operations Team

Sharon Allen

Group Secretary

Philip Curtis

HPC Linux/Storage System Administrator

Colin Dietz

Linux Systems Engineer

Clay England

System Administrator

Matt Ezell

HPC Systems Administrator

Gregg Gawinski

HPC UNIX/Storage System Administrator

Jesse Hanley

HPC Unix/Storage System Administrator

Jason Kincl

HPC Linux Systems Engineer

Dustin Leverman

HPC Systems Administrator

John Long

HPC Systems Administrator

Don Maxwell

HPC Systems Administrator

Brenna Miller

HPC Systems Engineer

Chris Muzyn

HPC Systems Administrator

Jeff Niles

HPC Storage Systems Engineer

Paul Peltz

Senior HPC Systems Engineer-HPC Ops

Rich Ray

HPC Systems Administrator

Jeremy Rogers

HPC Systems Administrator

Sergey Shpanskiy

HPC Systems Administrator

Lawrence Sorrillo

HPC Systems Linux Cluster Administrator

Kevin Thach

Group Leader, High-Performance Computing Operations

Joseph Voss

HPC Linux Systems Engineer

Tony Walsh

HPC Linux Systems Engineer - Clusters

Brian Zachary

Scalable Protected Systems Architect

Group R&D Activities

RATS is a customer relationship management tool used by OLCF staff. Developed internally by the OLCF, RATS provides…

This project will provide an important model for future exascale computing, increasing the coherence between the technology base…

Resource selection can have profound impacts on the performance and reliability of applications running on the supercomputer. On…

The Spider Lustre-based Parallel File System Development and Deployment: The OLCF has deployed multiple large-scale parallel file systems…

The Accelerated Data Analytics and Computing Institute has been established to explore potential future collaboration among UT-Battelle, LLC…