About the hardware-backends category
|
|
0
|
948
|
January 22, 2021
|
Custom kernels Intel XPU backend in LibTorch C++ API
|
|
9
|
114
|
February 21, 2025
|
OpenCL Backend - Important Updates
|
|
16
|
6516
|
February 15, 2025
|
Embrace tensor subclass as a Python device registration API
|
|
2
|
127
|
February 12, 2025
|
Question of using DeviceGuard regarding the exclusive use of a device
|
|
0
|
47
|
December 20, 2024
|
OpenGL interoperability
|
|
1
|
204
|
December 19, 2024
|
[tac] Follow up: Inductor HW backend implementation
|
|
7
|
647
|
November 16, 2024
|
[RFC] Adding Triton Backend for Aten operators
|
|
0
|
357
|
November 4, 2024
|
DTensor random RNG state support for non-CUDA backends
|
|
2
|
124
|
October 18, 2024
|
How profiling Pytorch Using Nsight Compute?
|
|
2
|
298
|
October 16, 2024
|
Prior art on implementing a "print" op on hardware
|
|
0
|
37
|
October 3, 2024
|
RFC Proposal for CUDA-Accelerated Dynamic Time Warping (DTW) Implementation in PyTorch
|
|
2
|
336
|
September 20, 2024
|
Why so many HW backend and nobody cooperate?
|
|
6
|
385
|
September 17, 2024
|
OpenCL backend dev - questions/support
|
|
4
|
211
|
August 29, 2024
|
Using Nsight Systems to profile GPU workload
|
|
11
|
29295
|
August 22, 2024
|
Backend Fallbacks
|
|
1
|
3327
|
August 22, 2024
|
Intel GPU Enabling Status and Feature Plan
|
|
0
|
481
|
August 16, 2024
|
find_package(Torch REQUIRED) fails - 2.3.1 and nightly
|
|
10
|
554
|
August 6, 2024
|
ROCm vs OpenCL/dlprimitives
|
|
0
|
223
|
August 5, 2024
|
What are these `_foreach_` operators and why there is no fallback?
|
|
6
|
720
|
June 25, 2024
|
Overlapping device to host copy with GPU collectives
|
|
5
|
540
|
June 4, 2024
|
MPS working group?
|
|
2
|
339
|
March 12, 2024
|
Implementing OpenCL backend for pytorch
|
|
14
|
15855
|
March 1, 2024
|
Implementing _copy_from_and_resize for OpenCL backend
|
|
0
|
296
|
November 21, 2023
|
Possible to use custom backend to create TensorImpl that allows custom datatype?
|
|
0
|
325
|
October 25, 2023
|
Custom TensorImpl and TorchDynamo
|
|
1
|
510
|
September 10, 2023
|
Weight sharing on cuda
|
|
12
|
1969
|
August 31, 2023
|
Unable to port with isfinite(isxxx) series functions
|
|
1
|
463
|
July 31, 2023
|
How to support heterogeneous memories
|
|
0
|
505
|
June 29, 2023
|
How to share CUcontext with other application?
|
|
0
|
652
|
June 12, 2023
|