I mean how to determine which intermediate variables to save for the most efficient backward. Still trying to understand the post Min-cut optimal(*) recomputation (i.e. activation checkpointing) with AOTAutograd - #9 by Chillee .
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| `torch.compile` `AOTAutograd` backwards _inductor function | 0 | 484 | January 23, 2024 | |
| `compile_autograd` | 0 | 376 | January 26, 2024 | |
| TorchDynamo Update 6: Training support with AOTAutograd | 0 | 5738 | March 29, 2022 | |
| Torch.compile with AOT Autograd can be debugged now! | 1 | 867 | October 31, 2023 | |
| How to trace torch.autograd.backward or torch.autograd.grad? | 3 | 1275 | December 5, 2023 |