Pytorch to Triton for Non-GPU Devices

fhossein-quic · April 24, 2024, 3:31pm

Actually, I want to generate triton kernels out of pytroch model when there is no cuda device, i.e. getting triton code out of torch.compile(m, backend=“inductor”) when device="cpu". And then, to be able to get LLVM IR from these triton kernels using a custom non-GPU triton backend.

Topic		Replies	Views
How to Access Triton Kernels from TorchInductor when running on CPU? compiler	1	1038	August 12, 2024
No CPU backend in triton FX	4	1071	January 20, 2025
TorchInductor: a PyTorch-native Compiler with Define-by-Run IR and Symbolic Shapes compiler	46	75875	July 29, 2024
[tac] Follow up: Inductor HW backend implementation hardware-backends	7	1354	November 16, 2024
Inductor Triton Custom Op compiler	6	1983	March 25, 2025

Pytorch to Triton for Non-GPU Devices

Related topics