HabanaAI / SynapseAI_Core
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
☆38Updated last month
Alternatives and similar repositories for SynapseAI_Core:
Users that are interested in SynapseAI_Core are comparing it to the libraries listed below
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- oneAPI Collective Communications Library (oneCCL)☆227Updated this week
- RCCL Performance Benchmark Tests☆60Updated 2 weeks ago
- Benchmarks to capture important workloads.☆30Updated last month
- End to End steps for adding custom ops in PyTorch.☆21Updated 4 years ago
- ☆138Updated this week
- An extension library of WMMA API (Tensor Core API)☆91Updated 8 months ago
- OpenAI Triton backend for Intel® GPUs☆170Updated this week
- ☆25Updated this week
- ☆106Updated 3 weeks ago
- oneCCL Bindings for Pytorch*☆91Updated 2 weeks ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆21Updated 3 months ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing. By pro…☆70Updated this week
- MLIR-based partitioning system☆74Updated this week
- rocWMMA☆104Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated 3 weeks ago
- AMD's graph optimization engine.☆213Updated this week
- ☆61Updated 3 months ago
- ☆73Updated 4 months ago
- ☆49Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated this week
- RAND library for HIP programming language☆117Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆59Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated last week
- ☆60Updated last year
- Standalone Flash Attention v2 kernel without libtorch dependency☆106Updated 6 months ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week