perftools Overview

The following information is available by running module help perftools
Perftools 5.3.2
Release Date: April 19, 2012

On Cray XK systems:
Perftools 5.3.2 and Papi must be used as a set with the CLE 4.0.UP03
operating system and requires Cudatoolkit 4.1.28. Perftools 5.3.2 is NOT
supported on CLE 4.0 UP02 on XK systems. Swapping Perftools
or Papi with an older version is not supported and will cause problems.
Using Cuda 4.0.17a and Cuda tools 4.0.17a is not supported and will cause
problems. Cuda 4.0.17a and Cuda tools 4.0.17a are replaced by
Cudatoolkit 4.1.28 which must be installed as default and used with
Perftools 5.3.2.

Support for Cray XK systems running CLE 4.0 UP03 and with CudaToolkit 4.1.28
installed. Perftools 5.3.2 is not compatible with Cuda 4.0.17a and Cuda Tools
4.0.17a or earlier versions. This version contains no other bug fixes or

Support for CUDA 4.0 is deprecated.  To collect accelerator performance
statistics the cudatoolkit module must be loaded.

New accelerator hardware counter groups starting with Nvidia device driver
version 285.05.xx are available.  See the accpc man page for more information on
the new counter groups.  To determine the device driver version run the
following on a GPU compute node.  This command will not work on a non-GPU
compute node.

  aprun -n 1 cat /proc/driver/nvidia/version

  Bugs Fixed:
  783476 - Nvidia device driver 285.05.36 accelerator hardware counter name
           change breaks perftools ACCPC groups

  Known Problems:
  783852 -  GPU codes using multiple OpenMP threads hang intermittently when
                ACCPC's are collected
	    For OpenMP codes that use the GPU with OpenAcc, CUDA, or PGI, the
            number of threads must be set to one when collecting accelerator
            hardware counters. If the number of threads is not set to one the
            application may hang or raise an error similar to the following:
                "Error 14 for CUPTI API function 'cuptiEventGroupEnable'.
                cuptiQuery failed"

  778771 - Data transfers with host time, but zero data transferred
  774412 - app2/5.2.2:  HWPC overview tab comes up blank
  777133 - (GCC_Bug 50497) pat_build produces no user function
               traces for gnu-4.6 compiled code
           As of GCC 4.6.x the .debug_pubnames section is no longer produced.
           CrayPat  relies on this section to gather global entry points that
           are eligible for tracing. To work around this issue until an
           alternative is provide by the performance tools, users can use GCC
           4.5.X or recompile those source files whose entry points are desired
           to be traced with the -finstrument-functions option, then instrument
            with pat_build -w and any other -g options.

Product and OS Dependencies:
Perftools/5.3.2 is supported on Cray XE systems running the
CLE 3.1UP00 CNL or later operating systems and on Cray XK systems
running CLE 4.0UP03 and later and the following product(s).
Update UP03 with Cuda Driver version 4.1 is required.

  xt-asyncpe 5.09 or later
  One or more compilers:
      CCE 8.0.0 or later
      GCC 4.5.X
      Intel 12.0 or later
      PGI 10.9 or later
  Windows 7 (for the desktop version of Cray Apprentice2)
  CUDATOOLKIT 4.1.28 (for Cray XK systems)

  Notes: - Not supported with Cuda 4.0.17a and Cuda Tools 4.0.17a or earlier
         - Seastar is not supported.
         - GCC 4.6.X is not supported.

See the following documents at
Cray Performance Analysis Tools 5.3 Release Overview and
    Installation Guide S-2474-53
Using Cray Performance Analysis Tools S-2376-52

Installation of PAPI:
rpm -ivh cray-papi-

Cray XK systems with GPUs also need::
rpm -ihv cray-papi-acc-

To make PAPI the default version of PAPI, execute:

Installation of perftools:
rpm -ivh perftools-clients-5.3.2-1.x86_64.rpm
rpm -ivh perftools-5.3.2-1.x86_64.rpm

To make perftools/5.3.2 the default version of perftools, execute:

Installation of app2 remote client on Linux desktops/laptops:
tar -xvzf perftools-remote-clients-5.3.2.tar.gz
cd perftools-remote-clients-5.3.2
  Type 'yes' to agree to software license prior to rpm install.
  If you're using modules software, load the perftools module to
  access the software.
module load perftools

Installation of app2 remote client (+ server) on Windows 7 systems:

The Cray Apprentice2 installer for Windows is included in the
perftools/5.3.1 package.  Download the Cray Apprentice2 installer
onto a desktop or laptop running Windows 7.  Double click on installer
to begin installation.  The installer will walk you through the process
for your system.

Bug fixes and support for perftools 5.3.2

Visit the PAPI Reference pages for more information at:
And visit the PAPI website for the latest updates:

To re-display perftools/5.3.2 release information,
type:    less /opt/cray/perftools/5.3.2/release_info



  • 6.3.0
  • 6.3.1
  • 6.4.0
  • 6.4.5
  • 6.5.1


  • 6.3.0
  • 6.3.1
  • 6.3.2
  • 6.4.0
  • 6.4.2
  • 6.4.4
  • 6.4.5
  • 6.5.0