PyTorch SymmetricMemory: Harnessing NVLink Programmability with Ease

I think they’re referring to atomicCAS based barrier update.