Summary of Issues Found in Adapting Models to use TorchScript

kevin.stephano · March 16, 2021, 5:18pm

We recently found a new issue that is also filed as Pytorch Issue 54040 where TorchScript’s Autodiff is not respecting the requires_grad option of a tensor when calculating gradients such that unused gradients are unnecessarily calculated. The summary was updated. This was seen, in particular, on the mask applied to multihead attention in NLP networks.

Topic		Replies	Views
Adapting Models to use TorchScript and Getting them to Produce Fusions compiler	15	6361	February 26, 2021
A simplified introduction to PyTorch's autograd implementation autodiff	6	1501	January 26, 2021
Highlighting a few recent autograd features (H2 2023) autodiff	0	680	January 5, 2024
First Contribution	1	293	April 26, 2024
TorchDynamo Update 6: Training support with AOTAutograd compiler	0	5655	March 29, 2022

Summary of Issues Found in Adapting Models to use TorchScript

Related topics