Easy and lightning fast training of ๐ค Transformers on Habana Gaudi processor (HPU)
โ214Jun 23, 2026Updated last week
Alternatives and similar repositories for optimum-habana
Users that are interested in optimum-habana are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Language Model Text Generation Inference on Habana Gaudiโ34Mar 20, 2025Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.โ14Jan 8, 2026Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsโ88Updated this week
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudiโ46Feb 3, 2025Updated last year
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://devโฆโ65Sep 18, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ๐ค Optimum Intel: Accelerate inference with Intel optimization toolsโ602Updated this week
- Provides the examples to write and build Habana custom kernels using the HabanaToolsโ26Apr 15, 2025Updated last year
- Blazing fast training of ๐ค Transformers on Graphcore IPUsโ88May 26, 2026Updated last month
- โ26Oct 9, 2025Updated 8 months ago
- Intelยฎ Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Noteโฆโ65May 27, 2026Updated last month
- GenAI components at micro-service level; GenAI service composer to create mega-serviceโ196Updated this week
- โ178Updated this week
- โ24May 26, 2026Updated last month
- Intel Gaudi's Megatron DeepSpeed Large Language Models for trainingโ18Dec 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- โ14Mar 1, 2025Updated last year
- ๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationโฆโ3,426Jun 22, 2026Updated last week
- Training and inference on AWS Trainium and Inferentia chips.โ268Jun 15, 2026Updated 2 weeks ago
- Accelerated inference of ๐ค models using FuriosaAI NPU chips.โ28May 26, 2026Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.โ96May 28, 2026Updated last month
- โ10Dec 15, 2022Updated 3 years ago
- โ27Updated this week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataโ24Jul 30, 2024Updated last year
- Repository for CPU Kernel Generation for LLM Inferenceโ28Jul 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Openโฆโ736Updated this week
- An innovative library for efficient LLM inference via low-bit quantizationโ353Aug 30, 2024Updated last year
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safetyโฆโ41Apr 6, 2026Updated 2 months ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUsโ76Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platformโ2,015Mar 30, 2026Updated 2 months ago
- โ97Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, โฆโ2,666Updated this week
- Github action to connect to tailscaleโ21Jun 8, 2026Updated 3 weeks ago
- MedConceptsQA: Open source medical concepts QA benchmarkโ19Dec 30, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits โข AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- โ18May 6, 2026Updated last month
- ๐๏ธ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Oโฆโ338May 26, 2026Updated last month
- AMD related optimizations for transformer modelsโ101May 26, 2026Updated last month
- Setup and Installation Instructions for Habana binaries, docker image creationโ28Apr 14, 2026Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersโ288Jul 11, 2024Updated last year
- Optimize with SigOpt with this standalone SigOpt client driver.โ12May 18, 2026Updated last month
- A Streamlit app to add structured tags to a dataset cardโ23Jun 30, 2022Updated 3 years ago