HabanaAI / SynapseAI_CoreLinks
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
☆42Updated 11 months ago
Alternatives and similar repositories for SynapseAI_Core
Users that are interested in SynapseAI_Core are comparing it to the libraries listed below
Sorting:
- ☆135Updated last week
- Bandwidth test for ROCm☆73Updated last week
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- oneAPI Collective Communications Library (oneCCL)☆253Updated last month
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆149Updated this week
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆25Updated 9 months ago
- Development repository for the Triton language and compiler☆140Updated this week
- AMD's graph optimization engine.☆272Updated last week
- Benchmarks to capture important workloads.☆32Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 5 months ago
- A Data-Centric Compiler for Machine Learning☆85Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆254Updated this week
- A GPU-driven system framework for scalable AI applications☆124Updated 11 months ago
- Ahead of Time (AOT) Triton Math Library☆88Updated this week
- OpenAI Triton backend for Intel® GPUs☆225Updated this week
- An extension library of WMMA API (Tensor Core API)☆109Updated last year
- ☆59Updated this week
- ☆50Updated last year
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆104Updated last month
- MAD (Model Automation and Dashboarding)☆31Updated 2 weeks ago
- ☆164Updated this week
- An IR for efficiently simulating distributed ML computation.☆32Updated 2 years ago
- ☆60Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆375Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago