Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆91Jan 9, 2026Updated last month
Alternatives and similar repositories for huggingface-inference-toolkit
Users that are interested in huggingface-inference-toolkit are comparing it to the libraries listed below
Sorting:
- ☆24Feb 24, 2026Updated last week
- ☆21Jan 21, 2026Updated last month
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.☆37Dec 2, 2025Updated 3 months ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated 11 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆206Aug 31, 2024Updated last year
- Repository for initial POC NLP based SQL adapter using LLM.☆10May 6, 2025Updated 9 months ago
- 🤗 Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AI☆22Feb 26, 2024Updated 2 years ago
- ☆18Sep 5, 2024Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- ☆16Sep 4, 2025Updated 6 months ago
- MCP server for Liveblocks.☆15Feb 14, 2026Updated 2 weeks ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- 8-bit floating point types for Rust☆63Feb 4, 2026Updated last month
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 3 years ago
- Github action to connect to tailscale☆18Dec 1, 2025Updated 3 months ago
- Repository for opt-out requests.☆10Mar 25, 2024Updated last year
- A course on building Large Language Models☆11Mar 24, 2025Updated 11 months ago
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year
- Homebrew MCP : Comprehensive brew support for installing, upgrading, searching, and maintaining macOS packages.☆25Jun 23, 2025Updated 8 months ago
- Let's build better datasets, together!☆271Dec 20, 2024Updated last year
- ☆13Dec 21, 2025Updated 2 months ago
- 🪐 Jupyter Kernel Client through HTTP and WebSocket.☆17Feb 11, 2026Updated 3 weeks ago
- ☆15Apr 11, 2024Updated last year
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆282Jul 11, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- A framework for few-shot evaluation of language models.☆36Mar 18, 2025Updated 11 months ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 7 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,314Feb 20, 2026Updated last week
- ☆130Oct 1, 2024Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆63Feb 6, 2025Updated last year
- A massively multilingual modern encoder language model☆131Jan 20, 2026Updated last month