Project Description

The Spider Lustre-based Parallel File System Development and Deployment: The OLCF has deployed multiple large-scale parallel file systems (PFS) to support its operations. During this process, OLCF acquired significant expertise in large-scale storage system design, file system software development, technology evaluation, bench- marking, procurement, deployment, and operational practices. Based on the lessons learned from each new PFS deployment, OLCF improved its operating procedures, and strategies.

A model-driven provisioning tool to assist storage system designers and administrators reconcile key figures of merit (cost, capacity, performance, disk size, rebuild times, redundancy), answer what-if scenarios, and determine the relative importance of spare parts in minimizing data unavailability, both during initial system provisioning and continuous operations. Validated the tool with field observed failure data from two years of Spider operations