HabanaAI / Setup_and_Install
Setup and Installation Instructions for Habana binaries, docker image creation
☆25Updated last month
Alternatives and similar repositories for Setup_and_Install:
Users that are interested in Setup_and_Install are comparing it to the libraries listed below
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆161Updated last month
- ☆37Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated 3 weeks ago
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated last week
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆59Updated this week
- oneCCL Bindings for Pytorch*☆91Updated 2 weeks ago
- Development repository for the Triton language and compiler☆114Updated this week
- ☆20Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆62Updated this week
- ☆25Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆180Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆167Updated this week
- OpenVINO Tokenizers extension☆31Updated this week
- OpenAI Triton backend for Intel® GPUs☆172Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated last month
- ☆19Updated last week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 3 months ago
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆227Updated last week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated 3 weeks ago
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated 4 months ago
- ☆60Updated last year
- OpenVINO NPU Plugin☆47Updated this week
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆86Updated 5 months ago
- RCCL Performance Benchmark Tests☆60Updated 2 weeks ago
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆84Updated this week
- CloudAI Benchmark Framework☆59Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated this week
- Documentation for vLLM Dev Channel releases☆9Updated 3 months ago
- LLM SDK for OnnxRuntime GenAI (OGA)☆117Updated this week