HabanaAI / SynapseAI_CoreLinks
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
☆42Updated 11 months ago
Alternatives and similar repositories for SynapseAI_Core
Users that are interested in SynapseAI_Core are comparing it to the libraries listed below
Sorting:
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last week
- ☆132Updated 3 weeks ago
- Bandwidth test for ROCm☆73Updated this week
- oneAPI Collective Communications Library (oneCCL)☆252Updated 3 weeks ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 6 months ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆24Updated 8 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆253Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- OpenAI Triton backend for Intel® GPUs☆223Updated this week
- An extension library of WMMA API (Tensor Core API)☆109Updated last year
- ☆55Updated this week
- Benchmarks to capture important workloads.☆31Updated 11 months ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆169Updated this week
- ☆50Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Updated 4 months ago
- ☆71Updated 9 months ago
- A Data-Centric Compiler for Machine Learning☆85Updated 3 weeks ago
- AMD's graph optimization engine.☆269Updated this week
- A GPU-driven system framework for scalable AI applications☆123Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated last month
- ☆278Updated this week
- An IR for efficiently simulating distributed ML computation.☆32Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆130Updated 2 weeks ago
- Development repository for the Triton language and compiler☆140Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆55Updated this week
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Updated last year
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆92Updated 2 years ago
- RCCL Performance Benchmark Tests☆85Updated last month