Using Nsight Systems to profile GPU workload

You can use --capture-range-end=stop instead.