microsoft/best-route-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/best-route-llm)

microsoft / best-route-llm

Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting costs by up to 60% with <1% performance drop. From the paper//arxiv.org/abs/2506.22716

☆44

Alternatives and similar repositories for best-route-llm

Users that are interested in best-route-llm are comparing it to the libraries listed below

Sorting:

zikuicai / aegisllm
View on GitHub
☆33Feb 17, 2026Updated 2 weeks ago
eth-easl / deltazip
View on GitHub
Compression for Foundation Models
☆35Jul 21, 2025Updated 7 months ago
IPADS-SAI / WaferAI-SIM
View on GitHub
The wafer-native AI accelerator simulation platform and inference engine.
☆50Jan 1, 2026Updated 2 months ago
hao-ai-lab / MuxServe
View on GitHub
☆87Oct 17, 2025Updated 4 months ago
ilsilfverskiold / ai-personalized-tech-reports-discord
View on GitHub
Build an AI bot in Discord to serve user's personalized reports on what's up in tech
☆28Sep 14, 2025Updated 5 months ago
TieJianKuDan / FHCCS
View on GitHub
FHCCS based chaotic image encryption.
☆11Oct 7, 2023Updated 2 years ago
RistovaIvona / Bank-Marketing
View on GitHub
A Data-Driven Approach to Predict the Success of Bank Telemarketing
☆10Apr 27, 2021Updated 4 years ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆35Aug 7, 2024Updated last year
felixbinder / introspection_self_prediction
View on GitHub
Code for experiments on self-prediction as a way to measure introspection in LLMs
☆16Dec 10, 2024Updated last year
ronakdm / ml-interviews
View on GitHub
Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).
☆11Dec 28, 2022Updated 3 years ago
GinsengHoney / Nerf_study
View on GitHub
☆10Jul 16, 2023Updated 2 years ago
chanind / linear-relational
View on GitHub
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch
☆10Aug 7, 2024Updated last year
spectraldani / thindeepgps
View on GitHub
Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)
☆14Nov 25, 2024Updated last year
ag8 / sha-transformer
View on GitHub
☆12Jul 8, 2024Updated last year
NikLever / model-viewer-course
View on GitHub
Resources for my <model-viewer> course
☆11Jul 25, 2023Updated 2 years ago
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 2 years ago
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆12Feb 11, 2024Updated 2 years ago
ajhalthor / stock-price-prediction
View on GitHub
Predicting the Stock Market - Can we do it?
☆10Jul 24, 2021Updated 4 years ago
duykhuongnguyen / MAT-Steer
View on GitHub
☆15Aug 19, 2025Updated 6 months ago
Hamme122 / gaussian-flow
View on GitHub
Unofficial implementation of "Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle"
☆13Jul 3, 2024Updated last year
MraDonkey / rethinking_prompting
View on GitHub
[ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…
☆16Aug 15, 2025Updated 6 months ago
lirundong / quant-pack
View on GitHub
[Ongoing Project] Codebase for network quantization study.
☆12May 20, 2020Updated 5 years ago
CSHaitao / CaseGen
View on GitHub
A Benchmark for Multi-Stage Legal Case Documents Generation
☆15Feb 24, 2025Updated last year
zuoqing1988 / train-ssd
View on GitHub
train ssd
☆10Apr 30, 2019Updated 6 years ago
sands-lab / splitbud
View on GitHub
SplitBud is a Split Learning framework built upon Flower
☆14Mar 22, 2025Updated 11 months ago
xingdi1990 / FaceAttributePrediction
View on GitHub
This repository is on the way to state of art face attribute prediction method
☆10Mar 22, 2018Updated 7 years ago
jiangycTarheel-zz / TPT-Summ
View on GitHub
☆11Jun 24, 2021Updated 4 years ago
eth-easl / pccheck
View on GitHub
☆12Nov 8, 2024Updated last year
AvisP / LM_Finetune
View on GitHub
Repo containing few notebooks on fine tuning of Language Models
☆13Apr 29, 2024Updated last year
coffee4j / coffee4j
View on GitHub
A Java-based framework for combinatorial test input generation, fault characterization and automated test execution.
☆11Jan 22, 2024Updated 2 years ago
Vermeille / motion-capture
View on GitHub
Animate a SVG avatar through facial Motion Capture
☆11Oct 3, 2023Updated 2 years ago
fiveai / understanding_safety_finetuning
View on GitHub
Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)
☆12Oct 31, 2024Updated last year
huybery / GDPnet
View on GitHub
GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)
☆11Nov 21, 2021Updated 4 years ago
agentic-learning-ai-lab / anticipatory-recovery
View on GitHub
Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"
☆11Oct 27, 2025Updated 4 months ago
smduan / FedMix
View on GitHub
The implementation of FedMix
☆11Aug 18, 2022Updated 3 years ago
CSU-NetLab / A2TP-Eurosys2023
View on GitHub
☆11Mar 13, 2023Updated 2 years ago
google-deepmind / exedec
View on GitHub
☆13May 9, 2024Updated last year
MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
Surrey-EEEM071-CVDL / CourseWork
View on GitHub
The course work repo for UoSurrey EEEM071 (2023 Spring)
☆11May 9, 2023Updated 2 years ago