artyom-beilis / dlprimitivesLinks
Deep Learning Primitives and Mini-Framework for OpenCL
☆204Updated last year
Alternatives and similar repositories for dlprimitives
Users that are interested in dlprimitives are comparing it to the libraries listed below
Sorting:
- DLPrimitives/OpenCL out of tree backend for pytorch☆374Updated last year
- HIPIFY: Convert CUDA to Portable C++ Code☆628Updated this week
- AMD's graph optimization engine.☆262Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆303Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆387Updated this week
- ☆271Updated last week
- ☆127Updated last week
- 8-bit CUDA functions for PyTorch☆66Updated last month
- Development repository for the Triton language and compiler☆136Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆147Updated last week
- Implementation of OpenCL 3.0 on Vulkan☆413Updated last week
- ☆61Updated 2 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆245Updated this week
- A collection of examples for the ROCm software stack☆251Updated this week
- Tuned OpenCL BLAS☆1,154Updated last month
- build scripts for ROCm☆187Updated last year
- Tensor Tiling Library☆37Updated last month
- ☆153Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆474Updated 4 months ago
- Print all known information about all available OpenCL platforms and devices in the system☆363Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆114Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆254Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆211Updated 9 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆266Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated 9 months ago
- Make PyTorch models at least run on APUs.☆57Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆136Updated last week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆425Updated 9 months ago
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆259Updated 2 weeks ago
- Fast and memory-efficient exact attention☆198Updated 2 weeks ago