HabanaAI / Setup_and_Install
Setup and Installation Instructions for Habana binaries, docker image creation
☆25Updated last month
Alternatives and similar repositories for Setup_and_Install:
Users that are interested in Setup_and_Install are comparing it to the libraries listed below
- Large Language Model Text Generation Inference on Habana Gaudi☆31Updated last week
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆57Updated last week
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆159Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆171Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆56Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆60Updated 2 months ago
- ☆34Updated this week
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆163Updated this week
- Development repository for the Triton language and compiler☆107Updated this week
- AMD SMI☆54Updated this week
- ☆19Updated 2 months ago
- ☆19Updated last month
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆12Updated 2 months ago
- oneCCL Bindings for Pytorch*☆88Updated last month
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆83Updated 4 months ago
- ☆17Updated this week
- ☆105Updated 3 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated 2 months ago
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Bandwidth test for ROCm☆54Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- ☆27Updated 2 weeks ago
- ☆43Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆303Updated this week
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆35Updated this week
- ☆99Updated 3 weeks ago
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- OpenVINO Tokenizers extension☆29Updated this week