withmartian / routerbenchView external linksLinks
The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System
☆153Jun 13, 2024Updated last year
Alternatives and similar repositories for routerbench
Users that are interested in routerbench are comparing it to the libraries listed below
Sorting:
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 6 months ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 3 months ago
- ☆56Jun 26, 2025Updated 7 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 4 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 4 months ago
- ☆54Jan 15, 2026Updated 3 weeks ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆21Aug 13, 2024Updated last year
- Tutorial for building LLM router☆244Jul 19, 2024Updated last year
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,581Aug 10, 2024Updated last year
- ☆23Jul 10, 2023Updated 2 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- A large-scale simulation framework for LLM inference☆530Jul 25, 2025Updated 6 months ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 3 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,897Jan 21, 2024Updated 2 years ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆820Jul 15, 2025Updated 6 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Jul 13, 2025Updated 7 months ago
- ☆10Jul 15, 2024Updated last year
- ☆16Mar 6, 2024Updated last year
- ☆12Mar 18, 2024Updated last year
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 2 weeks ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆369Dec 9, 2023Updated 2 years ago
- ExpressJS server for the GitWit React IDE.☆16May 28, 2024Updated last year
- AG2 (formerly AutoGen) is a programming framework for agentic AI. Join the community at: https://discord.gg/pAbnFJrkgZ☆13Jan 14, 2025Updated last year
- Jigsawstack Python SDK☆18Dec 1, 2025Updated 2 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆507Aug 26, 2024Updated last year
- Official PyTorch implementation of QA-LoRA☆145Mar 13, 2024Updated last year
- Connectors for your agent☆18Dec 7, 2025Updated 2 months ago
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆20Jun 16, 2024Updated last year
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆26Feb 19, 2025Updated 11 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆737Apr 10, 2024Updated last year
- Verbosity control for AI agents☆66May 23, 2024Updated last year
- Tools for merging pretrained large language models.☆19Jun 12, 2024Updated last year
- ☆23Dec 18, 2024Updated last year
- BenchBench is a Python package to evaluate multi-task benchmarks.☆18Oct 12, 2025Updated 4 months ago