A simplified introduction to PyTorch's autograd implementation

tom · January 23, 2021, 1:01pm

Cool!
So one thing I’ve been wondering about is whether we should make this type of experiment easier by allowing programmatic access to derivatives.
The “easy” part might be exposing what is in derivatives.yaml / what is generated from it.
The hard part is what to do with the functions in ATen that have “tape-based differentiation”.
This would also enable us to have multiple implementations auf autograd (autograd classic, autodiff, “hacker’s autograd” (ie in Python)) use the same per-operator derivatives.

Best regards

Thomas

Topic		Replies	Views
How to read the autograd codebase frontend API	1	2905	October 26, 2021
Highlighting a few recent autograd features (H2 2023) autodiff	0	682	January 5, 2024
How does torch.compile work with autograd?	13	3804	November 21, 2023
Implementing Generalized Backpropagation	6	1617	February 28, 2024
TorchDynamo Update 6: Training support with AOTAutograd compiler	0	5656	March 29, 2022

A simplified introduction to PyTorch's autograd implementation

Related topics