About the hardware-backends category
|
|
0
|
959
|
January 22, 2021
|
Using Nsight Systems to profile GPU workload
|
|
12
|
31113
|
April 30, 2025
|
Intel GPU & CPU Enabling Status and Feature Plan – 2025 H1 Update
|
|
1
|
210
|
April 16, 2025
|
Possible to use custom backend to create TensorImpl that allows custom datatype?
|
|
2
|
377
|
April 11, 2025
|
Embrace tensor subclass as a Python device registration API
|
|
5
|
316
|
March 28, 2025
|
Does PyTorch support RTX 5090?
|
|
1
|
796
|
March 14, 2025
|
Custom kernels Intel XPU backend in LibTorch C++ API
|
|
13
|
240
|
March 2, 2025
|
OpenCL Backend - Important Updates
|
|
16
|
6894
|
February 15, 2025
|
Question of using DeviceGuard regarding the exclusive use of a device
|
|
0
|
57
|
December 20, 2024
|
OpenGL interoperability
|
|
1
|
290
|
December 19, 2024
|
[tac] Follow up: Inductor HW backend implementation
|
|
7
|
782
|
November 16, 2024
|
[RFC] Adding Triton Backend for Aten operators
|
|
0
|
433
|
November 4, 2024
|
DTensor random RNG state support for non-CUDA backends
|
|
2
|
141
|
October 18, 2024
|
How profiling Pytorch Using Nsight Compute?
|
|
2
|
390
|
October 16, 2024
|
Prior art on implementing a "print" op on hardware
|
|
0
|
40
|
October 3, 2024
|
RFC Proposal for CUDA-Accelerated Dynamic Time Warping (DTW) Implementation in PyTorch
|
|
2
|
415
|
September 20, 2024
|
Why so many HW backend and nobody cooperate?
|
|
6
|
457
|
September 17, 2024
|
OpenCL backend dev - questions/support
|
|
4
|
262
|
August 29, 2024
|
Backend Fallbacks
|
|
1
|
3408
|
August 22, 2024
|
Intel GPU Enabling Status and Feature Plan
|
|
0
|
540
|
August 16, 2024
|
find_package(Torch REQUIRED) fails - 2.3.1 and nightly
|
|
10
|
667
|
August 6, 2024
|
ROCm vs OpenCL/dlprimitives
|
|
0
|
288
|
August 5, 2024
|
What are these `_foreach_` operators and why there is no fallback?
|
|
6
|
793
|
June 25, 2024
|
Overlapping device to host copy with GPU collectives
|
|
5
|
594
|
June 4, 2024
|
MPS working group?
|
|
2
|
366
|
March 12, 2024
|
Implementing OpenCL backend for pytorch
|
|
14
|
16126
|
March 1, 2024
|
Implementing _copy_from_and_resize for OpenCL backend
|
|
0
|
306
|
November 21, 2023
|
Custom TensorImpl and TorchDynamo
|
|
1
|
544
|
September 10, 2023
|
Weight sharing on cuda
|
|
12
|
2016
|
August 31, 2023
|
Unable to port with isfinite(isxxx) series functions
|
|
1
|
473
|
July 31, 2023
|