HabanaAI / Setup_and_InstallLinks

Setup and Installation Instructions for Habana binaries, docker image creation

☆26

Alternatives and similar repositories for Setup_and_Install

Users that are interested in Setup_and_Install are comparing it to the libraries listed below

Sorting:

huggingface / tgi-gaudi
Large Language Model Text Generation Inference on Habana Gaudi
☆34Updated 7 months ago
pytorch / test-infra
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆102Updated this week
intel / ai-containers
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …
☆52Updated last week
NVIDIA / nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
☆180Updated 5 months ago
HabanaAI / Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆165Updated 3 weeks ago
amd / ZenDNN
☆127Updated last week
ROCm / triton
Development repository for the Triton language and compiler
☆135Updated this week
intel / intel-extension-for-deepspeed
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆63Updated 3 months ago
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆384Updated 4 months ago
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆501Updated this week
HabanaAI / vllm-fork
A high-throughput and memory-efficient inference and serving engine for LLMs
☆83Updated this week
huggingface / optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆200Updated this week
aws-neuron / transformers-neuronx
☆110Updated 9 months ago
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆357Updated this week
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆66Updated this week
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆162Updated last week
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆114Updated 2 years ago
ROCm / hipBLASLt
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆115Updated last week
aws-neuron / aws-neuron-sdk
Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…
☆548Updated this week
openvinotoolkit / openvino_tokenizers
OpenVINO Tokenizers extension
☆42Updated last week
triton-inference-server / perf_analyzer
☆114Updated last week
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆318Updated 3 weeks ago
HabanaAI / Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…
☆61Updated last month
onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆26Updated this week
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆160Updated this week
amd / ryzen-ai-documentation
Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…
☆82Updated last week
groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆40Updated 2 months ago
uxlfoundation / oneAPI-spec
oneAPI Specification source files
☆207Updated last week
NVIDIA / GPUStressTest
GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…
☆110Updated 3 months ago
intel / intel-xpu-backend-for-triton
OpenAI Triton backend for Intel® GPUs
☆211Updated this week