About the hardware-backends category
|
|
0
|
946
|
January 22, 2021
|
Embrace tensor subclass as a Python device registration API
|
|
1
|
59
|
February 7, 2025
|
OpenCL Backend - Important Updates
|
|
13
|
6387
|
February 7, 2025
|
Custom kernels Intel XPU backend in LibTorch C++ API
|
|
8
|
91
|
February 6, 2025
|
Question of using DeviceGuard regarding the exclusive use of a device
|
|
0
|
43
|
December 20, 2024
|
OpenGL interoperability
|
|
1
|
185
|
December 19, 2024
|
[tac] Follow up: Inductor HW backend implementation
|
|
7
|
592
|
November 16, 2024
|
[RFC] Adding Triton Backend for Aten operators
|
|
0
|
326
|
November 4, 2024
|
DTensor random RNG state support for non-CUDA backends
|
|
2
|
118
|
October 18, 2024
|
How profiling Pytorch Using Nsight Compute?
|
|
2
|
267
|
October 16, 2024
|
Prior art on implementing a "print" op on hardware
|
|
0
|
36
|
October 3, 2024
|
RFC Proposal for CUDA-Accelerated Dynamic Time Warping (DTW) Implementation in PyTorch
|
|
2
|
317
|
September 20, 2024
|
Why so many HW backend and nobody cooperate?
|
|
6
|
369
|
September 17, 2024
|
OpenCL backend dev - questions/support
|
|
4
|
194
|
August 29, 2024
|
Using Nsight Systems to profile GPU workload
|
|
11
|
28944
|
August 22, 2024
|
Backend Fallbacks
|
|
1
|
3315
|
August 22, 2024
|
Intel GPU Enabling Status and Feature Plan
|
|
0
|
464
|
August 16, 2024
|
find_package(Torch REQUIRED) fails - 2.3.1 and nightly
|
|
10
|
530
|
August 6, 2024
|
ROCm vs OpenCL/dlprimitives
|
|
0
|
203
|
August 5, 2024
|
What are these `_foreach_` operators and why there is no fallback?
|
|
6
|
714
|
June 25, 2024
|
Overlapping device to host copy with GPU collectives
|
|
5
|
519
|
June 4, 2024
|
MPS working group?
|
|
2
|
330
|
March 12, 2024
|
Implementing OpenCL backend for pytorch
|
|
14
|
15783
|
March 1, 2024
|
Implementing _copy_from_and_resize for OpenCL backend
|
|
0
|
294
|
November 21, 2023
|
Possible to use custom backend to create TensorImpl that allows custom datatype?
|
|
0
|
318
|
October 25, 2023
|
Custom TensorImpl and TorchDynamo
|
|
1
|
499
|
September 10, 2023
|
Weight sharing on cuda
|
|
12
|
1947
|
August 31, 2023
|
Unable to port with isfinite(isxxx) series functions
|
|
1
|
461
|
July 31, 2023
|
How to support heterogeneous memories
|
|
0
|
503
|
June 29, 2023
|
How to share CUcontext with other application?
|
|
0
|
652
|
June 12, 2023
|