GUIDE is a framework used to collect, federate, and analyze log data from OLCF, and to derive insights into facility operations based on the log data. GUIDE collects system logs and extracts monitoring data at every level of the various OLCF subsystems, and applies a suite of pre-processing tools to make the raw data consumable. The cleansed logs are then ingested and federated into a central, scalable data warehouse, Splunk, that offers storage, indexing, querying, and visualization capabilities. The GUIDE framework further supports a set of tools to analyze these multiple disparate log streams in concert to derive operational insights. The system has been in operation for over two years n the production OLCF environment.
The main user interface to GUIDE is a set of Splunk dashboards available at: http://guide.ccs.ornl.gov