HabanaAI / Setup_and_Install
Setup and Installation Instructions for Habana binaries, docker image creation
☆23Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Setup_and_Install
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆155Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆152Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆26Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆57Updated 2 months ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆25Updated this week
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆54Updated this week
- AMD related optimizations for transformer models☆57Updated this week
- Development repository for the Triton language and compiler☆92Updated this week
- This repo contains documents of the OPEA project☆26Updated this week
- ☆39Updated last month
- AMD SMI☆41Updated this week
- oneCCL Bindings for Pytorch*☆86Updated last week
- ☆83Updated 5 months ago
- The no-code AI toolchain☆74Updated 2 weeks ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆37Updated last year
- ☆16Updated this week
- Libraries and tools to support Transfer Learning☆18Updated last month
- Home for OctoML PyTorch Profiler☆107Updated last year
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆145Updated this week
- OpenAI Triton backend for Intel® GPUs☆143Updated this week
- The Triton backend for the PyTorch TorchScript models.☆123Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆268Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆101Updated last week
- Computation using data flow graphs for scalable machine learning☆66Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆41Updated this week
- ☆29Updated this week
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆76Updated 3 weeks ago
- Bandwidth test for ROCm☆47Updated this week
- ☆44Updated last month
- ☆100Updated last month