withmartian / routerbench
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
☆101Updated 7 months ago
Alternatives and similar repositories for routerbench:
Users that are interested in routerbench are comparing it to the libraries listed below
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆156Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆114Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- ☆151Updated 5 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆155Updated this week
- Automatic Evals for LLMs☆128Updated this week
- ☆50Updated 2 months ago
- ☆98Updated this week
- Evaluating LLMs with fewer examples☆141Updated 9 months ago
- ☆116Updated 3 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆249Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆158Updated 2 weeks ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆168Updated 3 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆100Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆87Updated last week
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆215Updated 9 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆157Updated 2 weeks ago
- ☆30Updated 6 months ago
- ☆47Updated 2 months ago
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆150Updated 10 months ago
- ☆56Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆64Updated 5 months ago