Systems

Staff Section Head - Kevin Thach

The Systems Section supports the NCCS’s computing, networking, and storage systems, including general support of critical computational and facilities-related infrastructure and systems. They administer and support high-speed parallel file systems and archive capabilities, and develop tools and administer data management platforms to ensure security, operational, and laboratory policy compliance.

Section Groups

Group Group Description
HPC Clusters

The HPC Clusters Group administers and supports the division’s large-scale cluster computing infrastructure, which includes system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting.

HPC Cybersecurity & Information Engineering

The HPC Cybersecurity & Information Engineering Group develops tools and administers data management platforms to extract and analyze telemetry, event logs, and system state information to ensure security and laboratory policy compliance.

HPC Infrastructure & Networking

The HPC Infrastructure & Networking Group administers and supports networking capabilities that support the overall mission of leadership-class and scalable computing programs.

HPC Infrastructure Operations

The HPC Infrastructure Operations Group provides continuous monitoring, issue triaging and escalation, and general support of critical computational and facilities-related infrastructure.

HPC Scalable Systems

The HPC Scalable Systems Group administers and supports system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting of HPC computational resources.

HPC Storage & Archive

The HPC Storage & Archive Group Administers and supports high-speed parallel file systems and archive capabilities, which support the overall mission of leadership-class and scalable computing programs.