huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆61Updated 2 weeks ago
Alternatives and similar repositories for huggingface-inference-toolkit:
Users that are interested in huggingface-inference-toolkit are comparing it to the libraries listed below
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆101Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- ☆48Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- ☆30Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- Pre-train Static Word Embeddings☆42Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆56Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆52Updated 11 months ago
- ☆39Updated this week
- ☆42Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆62Updated 6 months ago
- ☆110Updated 4 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 5 months ago
- ☆24Updated last year
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- ☆57Updated 4 months ago
- Code for KaLM-Embedding models☆68Updated 2 weeks ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆31Updated 7 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆37Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 3 months ago