huggingface / huggingface-inference-toolkitLinks
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆87Updated last month
Alternatives and similar repositories for huggingface-inference-toolkit
Users that are interested in huggingface-inference-toolkit are comparing it to the libraries listed below
Sorting:
- ☆49Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- Python library to use Pleias-RAG models☆63Updated 5 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆68Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated 2 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 11 months ago
- A massively multilingual modern encoder language model☆100Updated last week
- ☆136Updated 2 months ago
- Pre-train Static Word Embeddings☆87Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- Datamodels for hugging face tokenizers☆85Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆110Updated 6 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- ☆55Updated 11 months ago
- ☆79Updated 3 months ago
- PyLate efficient inference engine☆66Updated last month
- ☆124Updated 11 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆62Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆97Updated this week
- ☆49Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated 11 months ago
- ☆14Updated 3 months ago
- ☆80Updated last year
- ☆31Updated 11 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year