|
About the hardware-backends category
|
|
0
|
998
|
January 22, 2021
|
|
CuSolver DnXgeev faster CUDA Eigenvalue calculations
|
|
9
|
177
|
November 4, 2025
|
|
Will the Metal4 update bring significant optimizations for future pytorch mps performance and compatibility?
|
|
1
|
160
|
October 12, 2025
|
|
OpenGL interoperability
|
|
2
|
572
|
September 30, 2025
|
|
Complimentary access to AMD Developer Cloud
|
|
1
|
159
|
July 20, 2025
|
|
OpenCL Backend - Important Updates
|
|
18
|
7704
|
May 23, 2025
|
|
Using Nsight Systems to profile GPU workload
|
|
12
|
35205
|
April 30, 2025
|
|
Intel GPU & CPU Enabling Status and Feature Plan – 2025 H1 Update
|
|
1
|
738
|
April 16, 2025
|
|
Possible to use custom backend to create TensorImpl that allows custom datatype?
|
|
2
|
474
|
April 11, 2025
|
|
Embrace tensor subclass as a Python device registration API
|
|
5
|
599
|
March 28, 2025
|
|
Does PyTorch support RTX 5090?
|
|
1
|
1934
|
March 14, 2025
|
|
Custom kernels Intel XPU backend in LibTorch C++ API
|
|
13
|
528
|
March 2, 2025
|
|
Question of using DeviceGuard regarding the exclusive use of a device
|
|
0
|
117
|
December 20, 2024
|
|
[tac] Follow up: Inductor HW backend implementation
|
|
7
|
1120
|
November 16, 2024
|
|
[RFC] Adding Triton Backend for Aten operators
|
|
0
|
665
|
November 4, 2024
|
|
DTensor random RNG state support for non-CUDA backends
|
|
2
|
217
|
October 18, 2024
|
|
How profiling Pytorch Using Nsight Compute?
|
|
2
|
652
|
October 16, 2024
|
|
Prior art on implementing a "print" op on hardware
|
|
0
|
60
|
October 3, 2024
|
|
RFC Proposal for CUDA-Accelerated Dynamic Time Warping (DTW) Implementation in PyTorch
|
|
2
|
583
|
September 20, 2024
|
|
Why so many HW backend and nobody cooperate?
|
|
6
|
700
|
September 17, 2024
|
|
OpenCL backend dev - questions/support
|
|
4
|
431
|
August 29, 2024
|
|
Backend Fallbacks
|
|
1
|
3597
|
August 22, 2024
|
|
Intel GPU Enabling Status and Feature Plan
|
|
0
|
703
|
August 16, 2024
|
|
find_package(Torch REQUIRED) fails - 2.3.1 and nightly
|
|
10
|
968
|
August 6, 2024
|
|
ROCm vs OpenCL/dlprimitives
|
|
0
|
430
|
August 5, 2024
|
|
What are these `_foreach_` operators and why there is no fallback?
|
|
6
|
1099
|
June 25, 2024
|
|
Overlapping device to host copy with GPU collectives
|
|
5
|
779
|
June 4, 2024
|
|
MPS working group?
|
|
2
|
451
|
March 12, 2024
|
|
Implementing OpenCL backend for pytorch
|
|
14
|
16719
|
March 1, 2024
|
|
Implementing _copy_from_and_resize for OpenCL backend
|
|
0
|
355
|
November 21, 2023
|