2017 OLCF User Call: R and pbdR
The OLCF hosts monthly User Conference Call. These calls are your opportunity to speak with center personnel to get the latest updates, express any concerns you may have, etc. No registration is required for this event.
Monthly Topic: Scaling Analytics with R and pbdR
Speaker: George Ostrouchov, Scientific Data Group
Abstract:
The pbdR project enables native and low overhead use of HPC scalable libraries in R language data analysis workflows. Distributed parallel dense linear algebra, which is essential in complex data analysis, is available without changes to serial R code. Data distribution and redistribution functions enable in-memory distributed data wrangling for workflow optimization. Building of custom analytics is further facilitated by high-level use of MPI collectives, where most rank wrangling is automated without user involvement, and by the availability of profiling facilities that range from PAPI library hardware counter information to instrumented MPI communications. Recent additions include a client-server interface for interactive SPMD style parallel code development. We will overview pbdR on this call.
Slides: OLCF_R+pbdR
Demo Files may be downloaded from: /lustre/atlas1/gen011/world-shared/pbdR
Video: Coming Soon
OLCF News slides: OLCF_news_6_28