artyom-beilis / pytorch_dlprimLinks
DLPrimitives/OpenCL out of tree backend for pytorch
☆356Updated 10 months ago
Alternatives and similar repositories for pytorch_dlprim
Users that are interested in pytorch_dlprim are comparing it to the libraries listed below
Sorting:
- Deep Learning Primitives and Mini-Framework for OpenCL☆199Updated 10 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆597Updated this week
- A collection of examples for the ROCm software stack☆225Updated last week
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆222Updated this week
- ☆111Updated last week
- 8-bit CUDA functions for PyTorch☆53Updated 3 weeks ago
- build scripts for ROCm☆186Updated last year
- Tuned OpenCL BLAS☆1,123Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆382Updated this week
- Development repository for the Triton language and compiler☆125Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆290Updated last week
- A tool which profiles OpenCL devices to find their peak capacities☆458Updated last month
- AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.☆555Updated last week
- ☆356Updated 3 months ago
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- ☆264Updated this week
- ☆143Updated this week
- ☆233Updated 2 years ago
- AMD's graph optimization engine.☆228Updated this week
- Implementation of OpenCL 3.0 on Vulkan☆399Updated 2 weeks ago
- ☆430Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆233Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆201Updated 5 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆437Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆405Updated 6 months ago
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆230Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆110Updated this week
- AI Tensor Engine for ROCm☆232Updated this week
- Print all known information about all available OpenCL platforms and devices in the system☆353Updated 3 weeks ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆260Updated this week