HabanaAI / Setup_and_InstallLinks
Setup and Installation Instructions for Habana binaries, docker image creation
☆28Updated last month
Alternatives and similar repositories for Setup_and_Install
Users that are interested in Setup_and_Install are comparing it to the libraries listed below
Sorting:
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 10 months ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆60Updated 2 weeks ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆170Updated last month
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Updated 7 months ago
- ☆137Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆205Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- OpenVINO Tokenizers extension☆48Updated this week
- ☆134Updated this week
- The Triton backend for the PyTorch TorchScript models.☆173Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆532Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆204Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆380Updated this week
- oneAPI Collective Communications Library (oneCCL)☆254Updated last week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Updated last year
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- Issues related to MLPerf® Inference policies, including rules and suggested changes☆63Updated this week
- AMD related optimizations for transformer models☆97Updated 3 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆221Updated this week
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆119Updated 7 months ago
- Development repository for the Triton language and compiler☆140Updated last week
- A top-like tool for monitoring GPUs in a cluster☆84Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Updated 2 weeks ago
- A utility for stressing GPUs by driving utilization (and thus power consumption) up and down in user-defined cycle intervals. It will als…☆26Updated 2 years ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- ☆24Updated 4 months ago
- Repository of model demos using TT-Buda☆63Updated 10 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Updated last month
- A validation and profiling tool for AI infrastructure☆360Updated this week