HabanaAI / Setup_and_InstallLinks
Setup and Installation Instructions for Habana binaries, docker image creation
☆27Updated 3 weeks ago
Alternatives and similar repositories for Setup_and_Install
Users that are interested in Setup_and_Install are comparing it to the libraries listed below
Sorting:
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆169Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 5 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 9 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆518Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆202Updated this week
- ☆130Updated last week
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆56Updated this week
- Bandwidth test for ROCm☆72Updated last week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆216Updated 2 weeks ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆103Updated this week
- The Triton backend for the PyTorch TorchScript models.☆167Updated this week
- MLPerf™ logging library☆37Updated this week
- Intel® Extension for TensorFlow*☆350Updated last month
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆323Updated 2 months ago
- ☆58Updated last year
- ☆111Updated 11 months ago
- ROCm Communication Collectives Library (RCCL)☆404Updated last week
- A validation and profiling tool for AI infrastructure☆352Updated this week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆396Updated 6 months ago
- Inference server benchmarking tool☆130Updated 2 months ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆62Updated 3 months ago
- ☆67Updated this week
- ☆127Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆367Updated this week
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆115Updated 5 months ago
- oneAPI Collective Communications Library (oneCCL)☆252Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Updated 3 months ago
- oneCCL Bindings for Pytorch* (deprecated)☆102Updated last month
- Example code for AWS Neuron SDK developers building inference and training applications☆152Updated this week