Wrapper for CUDA profiler start/stop API functions. Zero dependencies.
Example:
import cudaprofile
cudaprofile.start()
# ... do expensive cuda stuff ...
cudaprofile.stop()and run the script from nvprof or nvvp.
You may want to use nvprof with --profile-from-start-off and only call start() when desired.