Skip to main content

OLCF and Fermilab staff tweak I/O system to give users real-time simulation analysis capabilities

ADIOS and BDE researchers have incorporated a new methodology into the integrated tool that allows users to compress and stream data in real time. The new approach leverages software-defining network capabilities, allowing users to control network resources rather than manually requesting to connect.

ADIOS and BDE researchers have incorporated a new methodology into the integrated tool that allows users to compress and stream data in real time. The new approach leverages software-defining network capabilities, allowing users to control network resources rather than manually requesting to connect.

Projects large enough to run on high-performance computing (HPC) resources pack data—and a lot of it. Transferring this data between computational and experimental facilities is a challenging but necessary part of projects that rely on experiments to validate computational models.

Staff at two US Department of Energy (DOE) Office of Science User Facilities—the Oak Ridge Leadership Computing Facility (OLCF) and Fermi National Accelerator Laboratory (Fermilab)—facilitated this process by executing the integration of the Adaptive Input/Output System (ADIOS) and the BigData Express (BDE) high-speed data transfer service.

Now ADIOS and BDE developers are changing the way researchers can transport and analyze data by incorporating a new methodology into the tool that allows for compressing and streaming of data coming out of simulations in real time. The methodology is being tested by OLCF user C. S. Chang, a plasma physics researcher at Princeton Plasma Physics Laboratory (PPPL) who studies the properties of the plasmas that exist in giant fusion devices called tokamaks. Chang seeks an understanding of the power needed to run ITER and the heat-load to the material wall that will surround its plasma, both of which are key to fusion’s viability. ITER is an international collaboration working to design, construct, and assemble a burning plasma experiment that can demonstrate the scientific and technological feasibility of fusion power for the commercial power grid. ITER, which counts DOE’s Oak Ridge National Laboratory (ORNL) among its partners, is currently under construction in southern France.

“If users can separate out the most important pieces of data and move those to another processor that can recognize the intended prioritization and reduce the data, it can provide them with feedback that they may need to stop a simulation if necessary,” said Scott Klasky, leader of the ADIOS framework and group leader for ORNL’s Scientific Data Group.

Wenji Wu, principal investigator of the BDE project and principal network research investigator of Fermilab’s Core Computing Division, added, “The new approach leverages the software-defining network [SDN] capabilities for resource scheduling and the high-performance data streaming capabilities of BDE.”

SDN allows users to dynamically control network resources rather than manually request to connect. “This combination enables real-time data streaming with guaranteed quality of service, whether it be over short or long distances,” Wu said. “In addition, this approach yields small memory footprints.”

Although the project is still in the development phase, preliminary tests allowed Chang and his team to successfully transfer fusion data between the OLCF—located at ORNL—and PPPL.

“With this new methodology, users can stream data on the network without ever touching the file system and request network resources on the fly,” said ADIOS and BDE researcher Qing Liu, who has a joint appointment with the New Jersey Institute of Technology and ORNL.

Without streaming capabilities, scientists can perform only after-the-fact analyses for many experiments, such as KSTAR, the Korean Superconducting Tokamak Advanced Research. But with simulations and experiments increasing in size, near–real-time monitoring and control are becoming necessary. The new ADIOS–BDE integration could also play a major role in large experimental projects, such as the fusion project Chang is leading and the Square Kilometer Array, an effort involving dozens of institutions to build the world’s largest radio telescope. The new streaming capabilities could more easily enable the capture of short-lived events such as pulsars—neutron stars that emit electromagnetic radiation—that the telescope aims to record.

“KSTAR wants to transfer their data as the experiment is happening, to process their data during the experiment,” Klasky said. “These additions to ADIOS will enable both sides to quickly perform data analysis and visualization in real time.”

Seo-Young Noh, director of the Global Science Experimental Data Hub Center at the Korea Institute of Science and Technology Information, leads a group that has contributed significantly to the BDE project. “Our work has made cross-Pacific, real-time data streaming possible,” Noh said.

Klasky, Liu, and their collaborators gave a best paper plenary talk related to these new capabilities—titled “Understanding and Modeling Lossy Compression Schemes on HPC Scientific Data”—at the 32nd IEEE International Parallel and Distributed Processing Symposium in Vancouver, Canada, this week. The team noted that the new ADIOS methodology will allow scientists to efficiently select the type of compression that will best fit their scientific and research needs, affording them the ability to analyze their data faster than ever before.

Liang Zhang, the developer of BDE data streaming capabilities, is working with Liu to enhance and test the tool. They expect the tool’s new capabilities to be fully tested and deployed by late 2019. This work also involves ADIOS researcher Jason Wang and BDE researchers Nageswara Rao, Phil DeMar, Qiming Lu, Sajith Sasidharan, S. A. R. Shah, Jin Kim, and Huizhang Luo.

ORNL is managed by UT-Battelle for the Department of Energy’s Office of Science, the single largest supporter of basic research in the physical sciences in the United States. DOE’s Office of Science is working to address some of the most pressing challenges of our time. For more information, please visit https://science.energy.gov/.

Rachel McDowell

Rachel McDowell is a science writer for the Oak Ridge Leadership Computing Facility.