Systems
Staff Section Head - Kevin Thach
The Systems Section supports the NCCS’s computing, networking, and storage systems, including general support of critical computational and facilities-related infrastructure and systems. They administer and support high-speed parallel file systems and archive capabilities, and develop tools and administer data management platforms to ensure security, operational, and laboratory policy compliance.
Section Groups
Group | Group Description |
---|---|
HPC Clusters |
The HPC Clusters Group administers and supports the division’s large-scale cluster computing infrastructure, which includes system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting. |
HPC Cybersecurity & Information Engineering |
The HPC Cybersecurity & Information Engineering Group develops tools and administers data management platforms to extract and analyze telemetry, event logs, and system state information to ensure security and laboratory policy compliance. |
HPC Infrastructure & Networking |
The HPC Infrastructure & Networking Group administers and supports networking capabilities that support the overall mission of leadership-class and scalable computing programs. |
HPC Infrastructure Operations |
The HPC Infrastructure Operations Group provides continuous monitoring, issue triaging and escalation, and general support of critical computational and facilities-related infrastructure. |
HPC Scalable Systems |
The HPC Scalable Systems Group administers and supports system installation, deployment, acceptance, performance testing, upgrades, problem diagnosis, and troubleshooting of HPC computational resources. |
HPC Storage & Archive |
The HPC Storage & Archive Group Administers and supports high-speed parallel file systems and archive capabilities, which support the overall mission of leadership-class and scalable computing programs. |