Skip to content

harrism/nsys_easy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

nsys_easy

This script is a wrapper around the nsys command to simplify the profiling process with NVIDIA NSight Systems. It allows you to specify the trace, sample, and context switch options, as well as the output and report names. The script will run the nsys profile command and then the nsys stats command.

Usage:

Put nsys_easy somewhere in your $PATH.

nsys_easy [-t trace] [-s sample] [-c ctxsw] [-o output] [-r report] command

Example:

nsys_easy -t cuda,osrt -s none -c none -o nsys_easy -r cuda_gpu_kernel_sum ./my_program

The above command will profile the my_program executable with the specified options and output files.

By default the script only traces CUDA API calls, but you can specify additional options to trace other events. The script runs the cuda_gpu_sum report by default, which combines Kernel and CUDA memory copy statistics. The goal is for the script to act similar to the nvprof command, but with the added flexibility of the NVIDIA Nsight Systems tool.

Prerequisites

For full nsys command line options, see the NSight Systems User Guide.

About

Easier, quicker command-line CUDA profiling

Topics

Resources

Stars

Watchers

Forks

Languages