Custom C++ External Kernel for TorchInductor

fhossein-quic · June 5, 2024, 3:15am

Is there a way to use a custom C++ lowering of a torch op (which is not part of aten) as an external kernel in TorchInductor?

In TorchInductor, we can choose to lower a torch op either with external kernels or triton kernels. However, the external kernels are bind to an aten implementation. I want to know if there is a way to have an external C++ implementation of an op and use them as an external kernel?

jansel · June 5, 2024, 5:59pm

cc @zou3519

You need to wrap C++ function in a custom op. See:
PyTorch Custom Operators Landing Page — PyTorch main documentation

jgong5 · June 10, 2024, 10:29am

After you wrap the C++ function with the custom op, you might also need to register function on “meta” key (either via c++ or easier via python) for shape propagation. Here is an example how we do that in ipex. FYI:
C++ function for rmsnorm: intel-extension-for-pytorch/csrc/cpu/aten/RMSNorm.cpp at 4027749462a5bb5ece1bcf89fdae463e883e3934 · intel/intel-extension-for-pytorch · GitHub
Meta: intel-extension-for-pytorch/intel_extension_for_pytorch/_meta_registrations.py at 4027749462a5bb5ece1bcf89fdae463e883e3934 · intel/intel-extension-for-pytorch · GitHub

Topic		Replies	Views
Inductor Triton Custom Op compiler	6	1429	March 25, 2025
[RFC] Adding Triton Backend for Aten operators hardware-backends	0	508	November 4, 2024
Memory operations on a custom backend hardware-backends	4	1161	July 5, 2022
The future of C++ model deployment	7	2722	December 28, 2023
[RFC] New Python operator registration API	10	1261	January 31, 2024

Custom C++ External Kernel for TorchInductor

Related topics