Large Language Model Text Generation Inference on Habana Gaudi
β34Mar 20, 2025Updated last year
Alternatives and similar repositories for tgi-gaudi
Users that are interested in tgi-gaudi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy and lightning fast training of π€ Transformers on Habana Gaudi processor (HPU)β209Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ88Updated this week
- β23Apr 17, 2026Updated last month
- β167May 18, 2026Updated last week
- Automatically derive Python dunder methods for your Rust codeβ26Apr 7, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β93Apr 15, 2026Updated last month
- π€ Optimum Intel: Accelerate inference with Intel optimization toolsβ588Updated this week
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.β39Dec 2, 2025Updated 5 months ago
- GenAI components at micro-service level; GenAI service composer to create mega-serviceβ195May 11, 2026Updated 2 weeks ago
- Reference models for Intel(R) Gaudi(R) AI Acceleratorβ171Jan 8, 2026Updated 4 months ago
- Nightly release storeβ23Updated this week
- A framework for few-shot evaluation of language models.β36Apr 3, 2026Updated last month
- Python interface to NUMA Linux libraryβ27Nov 5, 2019Updated 6 years ago
- Chunk Dedupe Estimationβ20Nov 5, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Openβ¦β733Updated this week
- β17Updated this week
- Mini-Engine Demonstration of Combining XeSS with VRS Tier 2.β14Jan 26, 2026Updated 4 months ago
- Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Relevaβ¦β38Sep 22, 2025Updated 8 months ago
- β209Apr 22, 2026Updated last month
- β14Jan 21, 2025Updated last year
- oneAPI Deep Neural Network Library (oneDNN)β22Updated this week
- β13Feb 13, 2021Updated 5 years ago
- Helper Files for IDCβ44Oct 23, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Test-time-training on nearest neighbors for large language modelsβ50Apr 18, 2024Updated 2 years ago
- β11Nov 20, 2024Updated last year
- β20Oct 5, 2025Updated 7 months ago
- InnerEye dataset creation tool for InnerEye-DeepLearning library. Transforms DICOM data into mask for training Deep Learning models.β21Mar 21, 2024Updated 2 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.β13Sep 13, 2024Updated last year
- A huge dataset for Document Visual Question Answeringβ23Jul 29, 2024Updated last year
- β26May 19, 2026Updated last week
- Sample Callback Server written in Nodeβ12Sep 22, 2018Updated 7 years ago
- An innovative library for efficient LLM inference via low-bit quantizationβ353Aug 30, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- eBPF tool to collect BOLT profileβ14Apr 9, 2026Updated last month
- β29Nov 18, 2025Updated 6 months ago
- Automatically check repositories health and quality and build reports that help us understand the current state of Sauce Labs repositorieβ¦β13Apr 10, 2023Updated 3 years ago
- β15Mar 3, 2025Updated last year
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zigβ14Apr 7, 2025Updated last year
- π³ Docker CI image for Leon projects.β13Nov 6, 2021Updated 4 years ago
- Pragmatic approach to parsing import profiles for CI'sβ12Jul 1, 2024Updated last year