I mean how to determine which intermediate variables to save for the most efficient backward. Still trying to understand the post Min-cut optimal(*) recomputation (i.e. activation checkpointing) with AOTAutograd - #9 by Chillee .
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
`torch.compile` `AOTAutograd` backwards _inductor function | 0 | 461 | January 23, 2024 | |
`compile_autograd` | 0 | 354 | January 26, 2024 | |
TorchDynamo Update 6: Training support with AOTAutograd | 0 | 5660 | March 29, 2022 | |
Torch.compile with AOT Autograd can be debugged now! | 1 | 825 | October 31, 2023 | |
How to trace torch.autograd.backward or torch.autograd.grad? | 3 | 1212 | December 5, 2023 |