PyTorch/XLA 2.3 dev update
|
|
0
|
470
|
May 29, 2024
|
User-defined Kernels vs. `torch.library` custom op
|
|
1
|
212
|
May 22, 2024
|
Compiled Optimizer w/ LR Scheduler Now Supported
|
|
0
|
196
|
May 22, 2024
|
Different points at which fusion occurs?
|
|
8
|
163
|
May 21, 2024
|
PyTorch Sparse(GNN) Compiler RFC
|
|
28
|
1747
|
May 21, 2024
|
How to replace a callable with an opaque python callable in an Inductor pass?
|
|
6
|
202
|
May 15, 2024
|
How To Bring Compile Time Down to Zero: Our Plans and Direction (May 14th Edition)
|
|
0
|
859
|
May 15, 2024
|
When does the inductor code run?
|
|
5
|
250
|
May 15, 2024
|
What is the mental model for torch.compile to deal with stateful objects in general?
|
|
3
|
172
|
May 13, 2024
|
How can I dump the prims IR, triton code, and ptx code when using torch.compile()
|
|
2
|
544
|
May 6, 2024
|
Torch.compile support for Python 3.12 completed
|
|
0
|
1615
|
May 3, 2024
|
Performance Comparison between Torch.Compile and APEX optimizers
|
|
1
|
1132
|
May 1, 2024
|
AOT module decompositions
|
|
0
|
166
|
April 25, 2024
|
Understanding CUDAGraph Trees
|
|
4
|
694
|
April 11, 2024
|
State of PT2 OSS Issues: Q1 2024
|
|
0
|
212
|
April 10, 2024
|
Inductor CUDA Backend
|
|
1
|
497
|
April 4, 2024
|
TorchDynamo Update 9: Making DDP Work with TorchDynamo
|
|
8
|
12063
|
November 27, 2023
|
Difference torch dynamo and torch script
|
|
0
|
345
|
March 27, 2024
|
A TorchDynamo trace time ablation study
|
|
0
|
469
|
March 22, 2024
|
FMAs (and softmax (and floating point)) considered harmful
|
|
0
|
383
|
March 20, 2024
|
Supporting mutations in torch.export.export
|
|
2
|
809
|
March 20, 2024
|
Meaning of strict=False in torch.export.export
|
|
0
|
173
|
March 20, 2024
|
Export sub-graphs at the Aten IR level
|
|
2
|
273
|
March 14, 2024
|
PyTorch 2.0 User Empathy Day Recap
|
|
2
|
417
|
March 13, 2024
|
Torch.compile() + FSDP - Dec 8th
|
|
2
|
2369
|
March 8, 2024
|
TorchInductor Update 7: key optimizations with CPU backend in PyTorch 2.2 release
|
|
4
|
640
|
March 8, 2024
|
Custom cuda extension support in Inductor
|
|
8
|
439
|
March 7, 2024
|
Trying to understand flow for compilation
|
|
1
|
267
|
March 7, 2024
|
Is there a plan for FX in a C++ IR?
|
|
2
|
476
|
February 28, 2024
|
PyTorch/XLA 2.2 Release Dev Update
|
|
0
|
786
|
February 23, 2024
|