What's the difference between torch.export / torchserve / executorch / aotinductor?

suo · February 28, 2024, 5:55pm

We will not deprecate TorchScript without a suitable (and technically superior) replacement.

I think the key missing piece, which we are developing but have not yet released, is a generic interpreted runtime that uses libtorch to execute the graph in a target-independent way, optionally calling out to compiled artifacts for acceleration.

So the proposed TorchScript replacement flow would be:

On the frontend:
torch.export → compile subgraphs/whole graph with inductor → packaged model (graph, plus any compiled artifacts)

On the server:
Runtime loads the packaged model and executes it, appropriately selecting interpretation/compiled artifacts depending on the host environment.

Does that picture fit with what you would expect?

Topic		Replies	Views
PyTorch 2.x Inference Recommendations deployment	11	1207	November 3, 2024
TorchInductor: a PyTorch-native Compiler with Define-by-Run IR and Symbolic Shapes compiler	46	65420	July 29, 2024
What’s preventing PyTorch from being competitive with Llamafile? compiler	8	397	December 10, 2024
The future of C++ model deployment	7	2713	December 28, 2023
What is the correct, future-proof, way of deploying a pytorch python model in C++ for inference? deployment	12	344	February 25, 2025

What's the difference between torch.export / torchserve / executorch / aotinductor?

Related topics