TorchDynamo Update 11: Making FSDP and Dynamo Work Together
|
|
0
|
1447
|
February 8, 2023
|
TorchDynamo Update 3: GPU Inference Edition
|
|
12
|
3713
|
February 2, 2023
|
TorchDynamo Update 9: Making DDP Work with TorchDynamo
|
|
7
|
4656
|
January 16, 2023
|
Tracing with Primitives: Update 2
|
|
4
|
4129
|
January 13, 2023
|
PyTorch/XLA 2022 Q4 Dev update
|
|
0
|
1269
|
January 5, 2023
|
TorchDynamo Update 10: Integrating with PyTorch/XLA for Inference and Training
|
|
8
|
1873
|
December 22, 2022
|
PyTorch 2.0 Manifesto and Architecture docs
|
|
4
|
780
|
December 11, 2022
|
The nuances of PyTorch Graph Capture
|
|
9
|
5625
|
December 9, 2022
|
TorchDynamo Update 7: Inference with FX2TRT
|
|
2
|
1841
|
December 8, 2022
|
TorchInductor Update 4: CPU backend started to show promising performance boost
|
|
1
|
995
|
November 25, 2022
|
Dynamo/FX: patching a function to add more outputs not working
|
|
2
|
247
|
November 21, 2022
|
Where we are headed and why it looks a lot like Julia (but not exactly like Julia)
|
|
8
|
30852
|
November 14, 2022
|
Skipping Dispatcher with LazyTensor
|
|
10
|
836
|
October 19, 2022
|
TorchInductor Update 3: E2E model training with TorchDynamo + Inductor gets 1.67x/2.1x speedup
|
|
3
|
1417
|
October 11, 2022
|
Reducing Framework Overhead with Static Runtime
|
|
4
|
1260
|
October 4, 2022
|
Tracing with Primitives: Update 0
|
|
17
|
4026
|
September 26, 2022
|
NNC walkthrough: how PyTorch ops get fused
|
|
10
|
3979
|
November 3, 2021
|
TorchDynamo Update 8: TorchDynamo passed correctness check on 7k+ github models
|
|
7
|
3027
|
July 1, 2022
|
Python Operator Authoring w/ NNC
|
|
5
|
1479
|
June 7, 2022
|
Where do the 2000+ PyTorch operators come from?: More than you wanted to know
|
|
12
|
6011
|
May 12, 2022
|
Tracing with Primitives: Update 1, nvFuser and its Primitives
|
|
0
|
2796
|
April 25, 2022
|
TorchDynamo Update 5: Improved Capture & Bigger Graphs
|
|
4
|
2015
|
April 14, 2022
|
TorchDynamo Update 6: Training support with AOTAutograd
|
|
0
|
2659
|
March 29, 2022
|
Universal binaries for libtorch Mac?
|
|
10
|
2305
|
March 14, 2022
|
TorchDynamo Update 4: LazyTensor & nvFuser Experiments
|
|
2
|
2446
|
February 25, 2022
|
Prim::autocast_promote operation?
|
|
4
|
752
|
December 31, 2021
|
TorchDynamo Update: 1.48x geomean speedup on TorchBench CPU Inference
|
|
0
|
3044
|
November 12, 2021
|
Next Steps for PyTorch Compilers
|
|
9
|
6162
|
October 21, 2021
|
Loop_tool's lazy frontend - Experimenting with symbolic laziness
|
|
0
|
425
|
October 11, 2021
|
torch-MLIR presentation from Google / nod.ai
|
|
1
|
636
|
October 8, 2021
|