About the hardware-backends category
|
|
0
|
379
|
January 22, 2021
|
Is there a place for storing custom data within PyTorch Tensor
|
|
7
|
129
|
March 10, 2023
|
Set of ops for a backend to register
|
|
10
|
271
|
March 10, 2023
|
A small MPS debugging story
|
|
3
|
375
|
March 9, 2023
|
Why is mps_copy_ not registered to copy_stub
|
|
0
|
31
|
March 8, 2023
|
Can backend compilers modify guards for torch Dynamo frame
|
|
4
|
103
|
February 28, 2023
|
Keeping PyTorch's Ops Maintainable: The Jiterator
|
|
7
|
674
|
February 27, 2023
|
Using Nsight Systems to profile GPU workload
|
|
7
|
7584
|
February 24, 2023
|
Private use opencl device
|
|
7
|
283
|
November 11, 2022
|
OpenCL Backend - Important Updates
|
|
2
|
392
|
November 8, 2022
|
How to keep up with updates of operator signatures
|
|
6
|
196
|
November 4, 2022
|
What is the reason for using old ABI on Linux?
|
|
3
|
177
|
October 30, 2022
|
Weight sharing on cuda
|
|
10
|
437
|
August 3, 2022
|
Lazy Tensor Core
|
|
20
|
4044
|
July 12, 2022
|
Memory operations on a custom backend
|
|
4
|
253
|
July 5, 2022
|
PyTorch and TensorFloat32
|
|
5
|
3501
|
June 28, 2022
|
More In-Depth Details of Floating Point Precision
|
|
0
|
683
|
June 21, 2022
|
Implementing OpenCL backend for pytorch
|
|
10
|
7005
|
January 8, 2022
|
Automatic out-of-tree backend loading
|
|
1
|
296
|
December 13, 2021
|
Debugging back-propogation results
|
|
4
|
391
|
November 18, 2021
|
OpenCL Backend: Broadcast/Reduce Ops
|
|
3
|
508
|
November 5, 2021
|
Aten::abs_out called with an undefined 'out' tensor
|
|
1
|
502
|
November 4, 2021
|
Aten::mul.out receives tensors from mixed devices
|
|
2
|
383
|
October 16, 2021
|
Asynchronous Execution and Memory Management
|
|
3
|
879
|
October 11, 2021
|
Slides from Structured Kernel presentation
|
|
4
|
839
|
April 8, 2021
|
Backend Fallbacks
|
|
0
|
1335
|
April 7, 2021
|