artyom-beilis / dlprimitivesLinks
Deep Learning Primitives and Mini-Framework for OpenCL
☆204Updated last year
Alternatives and similar repositories for dlprimitives
Users that are interested in dlprimitives are comparing it to the libraries listed below
Sorting:
- DLPrimitives/OpenCL out of tree backend for pytorch☆377Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆634Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆389Updated this week
- ☆127Updated last week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆305Updated this week
- ☆273Updated last week
- 8-bit CUDA functions for PyTorch☆68Updated 2 months ago
- Development repository for the Triton language and compiler☆137Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆216Updated 9 months ago
- AMD's graph optimization engine.☆266Updated this week
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆265Updated last week
- A collection of examples for the ROCm software stack☆259Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆474Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆147Updated this week
- Implementation of OpenCL 3.0 on Vulkan☆414Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆114Updated this week
- OpenAI Triton backend for Intel® GPUs☆221Updated this week
- ☆155Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆269Updated last week
- High-Performance SGEMM on CUDA devices☆112Updated 10 months ago
- ☆61Updated 2 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆246Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆103Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆79Updated 8 months ago
- LLM training in simple, raw C/HIP for AMD GPUs☆54Updated last year
- Tuned OpenCL BLAS☆1,161Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆129Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated 10 months ago
- Print all known information about all available OpenCL platforms and devices in the system☆366Updated 5 months ago