amd / ZenDNNLinks
☆123Updated last week
Alternatives and similar repositories for ZenDNN
Users that are interested in ZenDNN are comparing it to the libraries listed below
Sorting:
- AMD's graph optimization engine.☆249Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆246Updated this week
- oneAPI Collective Communications Library (oneCCL)☆244Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆351Updated last week
- oneCCL Bindings for Pytorch*☆102Updated last month
- OpenAI Triton backend for Intel® GPUs☆206Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆460Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- Development repository for the Triton language and compiler☆130Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆111Updated this week
- AI Tensor Engine for ROCm☆267Updated this week
- Bandwidth test for ROCm☆65Updated last week
- ☆267Updated this week
- ROCm Communication Collectives Library (RCCL)☆363Updated this week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆45Updated 3 weeks ago
- A collection of examples for the ROCm software stack☆238Updated this week
- Ahead of Time (AOT) Triton Math Library☆76Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆206Updated 7 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆134Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆382Updated this week
- MLIR-based partitioning system☆125Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated 2 months ago
- Benchmarks to capture important workloads.☆31Updated 7 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆229Updated 3 years ago
- ☆62Updated 8 months ago
- RCCL Performance Benchmark Tests☆75Updated 3 weeks ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆43Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆152Updated this week
- rocWMMA☆128Updated this week
- ☆420Updated this week