OpenReg Dispatching Structured Ops

neeldug · May 28, 2023, 11:32pm

Hi all,

I’ve been working to create a disaggregated backend for PyTorch making use of the open registration example, and trying to reuse the CUDA kernel implementations to instead be dispatched via the disaggregated OS. The main issues I’ve run into on this matter are pertaining to the registration of various kernels like elu or logexpadd. Which seem to use a macro to form structured stubs instead, I was hoping to work out how I can register those operations with the custom backend, while still using the RegisterDispatch with the stub and appropriate key.

As some quick code stubs:

REGISTER_DISPATCH(logaddexp_stub, &logaddexp_kernel_pu1);

Faulty code:

at::Tensor &logaddexp_out(const at::Tensor &self, const at::Tensor &other,
                          at::Tensor &out) {
  return at::native::logaddexp_out(self, other, out);
}

Topic		Replies	Views
Memory operations on a custom backend hardware-backends	4	1167	July 5, 2022
Private use opencl device hardware-backends	7	1870	November 11, 2022
Implementing OpenCL backend for pytorch hardware-backends	14	16346	March 1, 2024
Slides from Structured Kernel presentation hardware-backends	4	1585	April 8, 2021
On input arguments to dispatch stub	0	406	August 6, 2023

OpenReg Dispatching Structured Ops

Related topics