philschmid / text-generation-inference-testsLinks
☆19Updated last year
Alternatives and similar repositories for text-generation-inference-tests
Users that are interested in text-generation-inference-tests are comparing it to the libraries listed below
Sorting:
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆105Updated last month
- ☆55Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- ☆197Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆91Updated last year
- Tutorial for building LLM router☆233Updated last year
- Routing on Random Forest (RoRF)☆218Updated last year
- ☆138Updated 2 months ago
- ☆64Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 7 months ago
- Synthetic Data for LLM Fine-Tuning☆119Updated last year
- ☆23Updated 2 years ago
- ☆170Updated 8 months ago
- Let's build better datasets, together!☆263Updated 10 months ago
- ☆43Updated last year
- ☆67Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆83Updated last year
- manage histories of LLM applied applications☆90Updated last year
- ☆46Updated 2 years ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated last year
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆73Updated 7 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- ☆146Updated last year
- ☆79Updated 9 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year