Inductor CUDA Backend

jeromeku · November 30, 2023, 2:54am

Is there any documentation on the different CUDA kernel backends that inductor selects from (cutlass, triton, etc.)?

More specifically, for each backend would like to understand how modules / layers / ops are mapped to concrete kernel implementations. E.g., for triton, the autotuning / heuristics selection process, jit compilation, and stitching of the generated kernel back into the graph. For cutlass, the heuristics used for templated kernel gen.

Thanks!

sandyasm · April 4, 2024, 10:35am

Did you find any info on this?

Topic		Replies	Views
[tac] Follow up: Inductor HW backend implementation hardware-backends	7	889	November 16, 2024
Trying to understand flow for compilation compiler	1	333	March 7, 2024
Custom cuda extension support in Inductor compiler	8	677	March 7, 2024
No CPU backend in triton FX	4	620	January 20, 2025
How to Access Triton Kernels from TorchInductor when running on CPU? compiler	1	621	August 12, 2024

Inductor CUDA Backend

Related topics