Hey!
Did you had a chance to take a look at this great post on the topic: Using Nsight Systems to profile GPU workload ?