Partial graph allocation for accelerators

jhkim · March 9, 2023, 12:50pm

Hello, I have a question about integrating custom backend compilers into PyTorch.

Suppose I have an accelerator, and my compiler for this accelerator only supports a limited number of operations, such as convolutions and ReLUs.

Is it possible to allocate partial FX graphs that only include convolutions and ReLUs to my backend compiler, while allocating the remaining partial FX graphs to other compilers like TorchInductor for CPU execution?

SherlockNoMad · March 10, 2023, 8:13am

Here’s an example of partial graph delegation.

jhkim · March 12, 2023, 4:02am

This is what I wanted to know. Thank you so much!

SherlockNoMad · March 13, 2023, 4:43pm

You can also find the example usage in

nvFuser
- Backend bridge: pytorch/torch/fx/passes/backends/nvfuser.py
onnxruntime
- Backend bridge: onnxruntime/orttraining/orttraining/python/training/torchdynamo/ort_backend.py

Topic		Replies	Views
Utilities for partitioning an FX graph	1	527	June 7, 2023
Registering new compiler backend in Pytorch2.0 compiler	5	2246	March 20, 2023
Next Steps for PyTorch Compilers compiler	9	10504	October 21, 2021
How do we do mid layer integration after Aten fx graph compiler	4	813	March 22, 2023
TorchInductor: a PyTorch-native Compiler with Define-by-Run IR and Symbolic Shapes compiler	46	68378	July 29, 2024

Partial graph allocation for accelerators

Related topics