artyom-beilis / dlprimitivesLinks
Deep Learning Primitives and Mini-Framework for OpenCL
☆197Updated 9 months ago
Alternatives and similar repositories for dlprimitives
Users that are interested in dlprimitives are comparing it to the libraries listed below
Sorting:
- DLPrimitives/OpenCL out of tree backend for pytorch☆352Updated 9 months ago
- Next generation BLAS implementation for ROCm platform☆382Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆590Updated this week
- A collection of examples for the ROCm software stack☆224Updated this week
- AMD's graph optimization engine.☆223Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆284Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆230Updated this week
- Development repository for the Triton language and compiler☆125Updated this week
- ☆261Updated this week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 5 months ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆258Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆195Updated 4 months ago
- A tool which profiles OpenCL devices to find their peak capacities☆455Updated 2 weeks ago
- ROCm BLAS marshalling library☆144Updated this week
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆427Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated last month
- Implementation of OpenCL 3.0 on Vulkan☆396Updated last week
- ☆108Updated last week
- AI Tensor Engine for ROCm☆208Updated this week
- Tuned OpenCL BLAS☆1,116Updated 2 months ago
- ☆60Updated last year
- ☆48Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆404Updated 5 months ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆177Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆106Updated this week
- Python SYCL bindings and SYCL-based Python Array API library☆113Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆102Updated last month
- rocWMMA☆117Updated this week