Easy and lightning fast training of ๐ค Transformers on Habana Gaudi processor (HPU)
โ207Mar 3, 2026Updated this week
Alternatives and similar repositories for optimum-habana
Users that are interested in optimum-habana are comparing it to the libraries listed below
Sorting:
- Large Language Model Text Generation Inference on Habana Gaudiโ34Mar 20, 2025Updated 11 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.โ14Jan 8, 2026Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsโ85Mar 3, 2026Updated last week
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudiโ42Feb 3, 2025Updated last year
- Reference models for Intel(R) Gaudi(R) AI Acceleratorโ170Jan 8, 2026Updated 2 months ago
- Provides the examples to write and build Habana custom kernels using the HabanaToolsโ25Apr 15, 2025Updated 10 months ago
- ๐ค Optimum Intel: Accelerate inference with Intel optimization toolsโ542Mar 2, 2026Updated last week
- โ24Oct 9, 2025Updated 5 months ago
- Intelยฎ Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Noteโฆโ65Jun 30, 2025Updated 8 months ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use casesโ13Dec 2, 2024Updated last year
- Training and inference on AWS Trainium and Inferentia chips.โ261Updated this week
- โ14Mar 1, 2025Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.โ91Jan 9, 2026Updated 2 months ago
- A starter kit for evaluating benchmarks on the ๐ค Hubโ16Dec 29, 2023Updated 2 years ago
- ๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationโฆโ3,310Feb 9, 2026Updated last month
- Hugging Face Jobsโ19Jul 11, 2025Updated 7 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for trainingโ18Dec 19, 2024Updated last year
- โ155Updated this week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataโ23Jul 30, 2024Updated last year
- Hugging Face and Pyserini interoperabilityโ19May 18, 2023Updated 2 years ago
- โ34Feb 1, 2023Updated 3 years ago
- ๐๏ธ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Oโฆโ331Sep 25, 2025Updated 5 months ago
- โ56Jun 26, 2025Updated 8 months ago
- โ24Feb 24, 2026Updated 2 weeks ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUsโ68Updated this week
- โ44Mar 3, 2023Updated 3 years ago
- Manage scalable open LLM inference endpoints in Slurm clustersโ282Jul 11, 2024Updated last year
- A Streamlit app to add structured tags to a dataset cardโ22Jun 30, 2022Updated 3 years ago
- ๐ง ResNet: Deep Residual Learning for Image Recognitionโ10Sep 18, 2021Updated 4 years ago
- decontaminationโ26Updated this week
- OpenVINO LLM Benchmarkโ11Dec 7, 2023Updated 2 years ago
- โ17Updated this week
- Accelerated inference of ๐ค models using FuriosaAI NPU chips.โ27Jun 9, 2025Updated 9 months ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).โ27Feb 14, 2023Updated 3 years ago
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.โ38Dec 2, 2025Updated 3 months ago
- Materials for workshops on the Hugging Face ecosystemโ152May 16, 2023Updated 2 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platformโ2,013Feb 13, 2026Updated 3 weeks ago
- โก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Plโฆโ2,175Oct 8, 2024Updated last year
- AMD related optimizations for transformer modelsโ101Oct 16, 2025Updated 4 months ago