October 2019 OLCF User Conference Call-Distributed Deep Learning on Summit
The OLCF hosts monthly User Conference Calls. These calls are your opportunity to speak with center personnel to get the latest updates, express any concerns you may have, etc. No registration is required for this event.
Monthly Topic: Distributed Deep Learning on Summit
Speakers: Dr. Brad Nemanich and Dr. Bryant Nelson
Abstract: IBM, a leader in AI technologies, has partnered with ORNL to provide easy to use Machine and Deep learning packages specifically tailored for our Summit environment. TensorFlow, PyTorch and Caffe can all now be accessible on Summit with the capability to distribute the work to over 6000 GPUs. On this call, IBM will discuss the details on how to access the provided deep learning frameworks and discuss issues to consider when distributing a deep learning training job. This will be followed by a short end-to-end demonstration of running a distributed deep learning job on Summit.