The Advanced Data and Workflow Group offers scientific, technical, operational, and thought leadership for building data-driven computing environments for OLCF user needs. The group designs and develops creative data-science workflows (compute, analytic, and visualization) to enable interactive data-driven discoveries that require scale and performance on leadership computing resources hosted by the OLCF and big data resources hosted by the Compute and Data Environment for Science (CADES). Our team of data scientists, software engineers, and scientific visualization experts guides users through the data-to-knowledge discovery process by understanding user needs, designing novel algorithms and workflows, and implementing and supporting convenient tools.

Meet the Advanced Data and Workflows Team

Valentine Anantharaj

Computational Scientist (Climate)

Jamison Daniel

Data Scientist

Pete Eby

Systems Architect

Katherine (Kat) Engstrom

Katherine (Kat) Engstrom

Visualization Support Specialist

Michael Galloway

Michael Galloway

Systems Engineer

Benjamin Hernandez

Computer Scientist

Susan Hicks

HPC Network Engineer

Chris Layton

Linux Systems Engineer

Ketan Maheshwari

Linux Systems Engineer

Michael (Mike) Matheson

Michael (Mike) Matheson

Visualization Specialist

Sheila Moore

Administrative Assistant/Project Management Assistant

Steve Moulton

Unix/Linux System Engineer

George Ostrouchov

Senior Data Scientist

Norbort Podhorszki

Norbort Podhorszki

Team Lead: Scientific Data Management

Ryan Prout

Ryan Prout

Systems and Network Engineer

David Pugmire

David Pugmire

Team Lead: Scientific Data Analytics

Drew Schmidt

Drew Schmidt

Software Engineer

Arjun Shankar

Group Leader, Advanced Data and Workflows

Suhas Somnath

Suhas Somnath

Research Staff

Dale Stansberry

Dale Stansberry

Systems Programmer

James Trater

Linux System Engineer

Junqi Yin

Junqi Yin

Research Staff

Brian Zachary

CADES Team Lead

Group R&D Activities

This project will provide an important model for future exascale computing, increasing the coherence between the technology base…

Sight is an exploratory visualization tool for large scale datasets supporting manycore and multicore advanced shading, remote and…

Constellation is a digital object identifier (DOI) based science network for supercomputing data. Constellation makes it possible for…

CADES is an ORNL facility to support R&D staff’s scalable computing and data analytics needs—making a research computing…

[From the ALICE website: http://aliceinfo.cern.ch/Public/Welcome.html] ALICE is the acronym for A Large Ion Collider Experiment, one of the…

The "Programming with Big Data in R" project (pbdR) is a set of highly scalable R packages for…

The Grid Architecture project objectives are to provide a set of architectural depictions, tools, and skills to the…

The Advanced Data and Workflow group brings a holistic view to scalable services that span the data lifecycle…

The Scientific Data Management System (SDMS) is a cross-facility data storage, indexing, collaboration, and provisioning system for data…