About the compiler category
|
|
0
|
1478
|
January 22, 2021
|
Understanding CUDAGraph Trees
|
|
6
|
1024
|
April 15, 2025
|
Empty tensor with SymInt size
|
|
1
|
35
|
April 15, 2025
|
Questions about handling non-scalar kwargs in pattern matcher and kwargs capture in replacements
|
|
0
|
31
|
April 11, 2025
|
Profiling torch.compile
|
|
0
|
47
|
April 10, 2025
|
Inductor Triton Custom Op
|
|
6
|
1281
|
March 25, 2025
|
How to prevent ops to be decomposed in custom compile backend
|
|
0
|
55
|
March 24, 2025
|
AOT module decompositions
|
|
1
|
322
|
March 13, 2025
|
A new strategy for automatic custom operators functionalization
|
|
1
|
195
|
February 8, 2025
|
Generated AOTInductor C shim header files have unexpectedly changed
|
|
1
|
52
|
February 7, 2025
|
Functionalization in PyTorch: Everything You Wanted To Know
|
|
7
|
6210
|
February 4, 2025
|
No CPU backend in triton
|
|
4
|
487
|
January 20, 2025
|
Inductor Passes
|
|
0
|
198
|
January 13, 2025
|
Torch.compile support for Python 3.13 completed
|
|
0
|
1026
|
January 10, 2025
|
What’s preventing PyTorch from being competitive with Llamafile?
|
|
8
|
359
|
December 10, 2024
|
"Fused compiled autograd bwd + optimizer graph" - status update?
|
|
4
|
300
|
November 21, 2024
|
FMAs (and softmax (and floating point)) considered harmful
|
|
2
|
640
|
November 21, 2024
|
CUDAGraphs in Pytorch 2.0
|
|
6
|
4842
|
November 20, 2024
|
Where do the 2000+ PyTorch operators come from?: More than you wanted to know
|
|
13
|
13706
|
November 15, 2024
|
Compiled Optimizer w/ LR Scheduler Now Supported
|
|
3
|
477
|
November 13, 2024
|
Understanding dynamic shapes and guards and when it does/does not cause graph breaks
|
|
1
|
229
|
November 7, 2024
|
What's the difference between `next_variable()` and `reconstruct()` in `IteratorVariable`
|
|
2
|
41
|
October 26, 2024
|
Support for _set with other mutations in graph
|
|
2
|
110
|
October 18, 2024
|
Is it possible to disable inlining of custom module for torch.compile?
|
|
1
|
208
|
October 11, 2024
|
Impact of multithreading and local caching on torch.compile
|
|
3
|
530
|
September 27, 2024
|
TorchInductor Update 9: Harden Vectorization Support and Enhance Loop Optimizations in TorchInductor CPP Backend
|
|
0
|
424
|
September 4, 2024
|
TorchInductor Update 8: Max-autotune Support on CPU with GEMM Template
|
|
0
|
386
|
September 4, 2024
|
PyTorch Runtime Error with Compiled Autograd
|
|
1
|
231
|
August 31, 2024
|
Pytorch to Triton for Non-GPU Devices
|
|
7
|
1201
|
August 30, 2024
|
Difference between the graph break reasons: `Dynamic control flow is not supported at the moment.` and `generic_jump TensorVariable()`
|
|
0
|
271
|
August 30, 2024
|