About the hardware-backends category
|
|
0
|
966
|
January 22, 2021
|
OpenCL Backend - Important Updates
|
|
18
|
7161
|
May 23, 2025
|
Using Nsight Systems to profile GPU workload
|
|
12
|
32333
|
April 30, 2025
|
Intel GPU & CPU Enabling Status and Feature Plan – 2025 H1 Update
|
|
1
|
372
|
April 16, 2025
|
Possible to use custom backend to create TensorImpl that allows custom datatype?
|
|
2
|
401
|
April 11, 2025
|
Embrace tensor subclass as a Python device registration API
|
|
5
|
402
|
March 28, 2025
|
Does PyTorch support RTX 5090?
|
|
1
|
1474
|
March 14, 2025
|
Custom kernels Intel XPU backend in LibTorch C++ API
|
|
13
|
278
|
March 2, 2025
|
Question of using DeviceGuard regarding the exclusive use of a device
|
|
0
|
68
|
December 20, 2024
|
OpenGL interoperability
|
|
1
|
352
|
December 19, 2024
|
[tac] Follow up: Inductor HW backend implementation
|
|
7
|
869
|
November 16, 2024
|
[RFC] Adding Triton Backend for Aten operators
|
|
0
|
501
|
November 4, 2024
|
DTensor random RNG state support for non-CUDA backends
|
|
2
|
160
|
October 18, 2024
|
How profiling Pytorch Using Nsight Compute?
|
|
2
|
465
|
October 16, 2024
|
Prior art on implementing a "print" op on hardware
|
|
0
|
42
|
October 3, 2024
|
RFC Proposal for CUDA-Accelerated Dynamic Time Warping (DTW) Implementation in PyTorch
|
|
2
|
458
|
September 20, 2024
|
Why so many HW backend and nobody cooperate?
|
|
6
|
513
|
September 17, 2024
|
OpenCL backend dev - questions/support
|
|
4
|
305
|
August 29, 2024
|
Backend Fallbacks
|
|
1
|
3458
|
August 22, 2024
|
Intel GPU Enabling Status and Feature Plan
|
|
0
|
592
|
August 16, 2024
|
find_package(Torch REQUIRED) fails - 2.3.1 and nightly
|
|
10
|
756
|
August 6, 2024
|
ROCm vs OpenCL/dlprimitives
|
|
0
|
330
|
August 5, 2024
|
What are these `_foreach_` operators and why there is no fallback?
|
|
6
|
854
|
June 25, 2024
|
Overlapping device to host copy with GPU collectives
|
|
5
|
652
|
June 4, 2024
|
MPS working group?
|
|
2
|
388
|
March 12, 2024
|
Implementing OpenCL backend for pytorch
|
|
14
|
16320
|
March 1, 2024
|
Implementing _copy_from_and_resize for OpenCL backend
|
|
0
|
311
|
November 21, 2023
|
Custom TensorImpl and TorchDynamo
|
|
1
|
561
|
September 10, 2023
|
Weight sharing on cuda
|
|
12
|
2041
|
August 31, 2023
|
Unable to port with isfinite(isxxx) series functions
|
|
1
|
473
|
July 31, 2023
|