microsoft / best-route-llmView external linksLinks
Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting costs by up to 60% with <1% performance drop. From the paper//arxiv.org/abs/2506.22716
☆42Aug 6, 2025Updated 6 months ago
Alternatives and similar repositories for best-route-llm
Users that are interested in best-route-llm are comparing it to the libraries listed below
Sorting:
- ☆33Sep 26, 2025Updated 4 months ago
- The wafer-native AI accelerator simulation platform and inference engine.☆50Jan 1, 2026Updated last month
- A Data-Driven Approach to Predict the Success of Bank Telemarketing☆10Apr 27, 2021Updated 4 years ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Resources for my <model-viewer> course☆11Jul 25, 2023Updated 2 years ago
- ☆12Jul 8, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- A collection of demos and utilities prepared ahead of the Vector Institute Privacy Enhancing Techniques (PETs) Bootcamp.☆15Sep 22, 2022Updated 3 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆13Nov 25, 2024Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Code for SIGKDD2025 paper: An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem☆14Jan 28, 2025Updated last year
- A platform that provides users with easy access to AI services developed by Montimage and usage of explainable AI techniques (e.g., LIME,…☆10Dec 4, 2025Updated 2 months ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- ☆11Jun 24, 2021Updated 4 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- The implementation of FedMix☆11Aug 18, 2022Updated 3 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 8 months ago
- a jax benchmark for ad hoc teamwork☆17Updated this week
- train ssd☆10Apr 30, 2019Updated 6 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- 1st Place Team Crane: @aswinkumar1999 @rathull @kyolebu☆29Sep 8, 2025Updated 5 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- [Ongoing Project] Codebase for network quantization study.☆12May 20, 2020Updated 5 years ago
- This repository is on the way to state of art face attribute prediction method☆10Mar 22, 2018Updated 7 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- (ICCV'25) TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models (Au…☆14Aug 22, 2025Updated 5 months ago
- ☆15Feb 10, 2023Updated 3 years ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 5 months ago
- Animate a SVG avatar through facial Motion Capture☆11Oct 3, 2023Updated 2 years ago
- ☆13May 9, 2024Updated last year
- ☆12Oct 22, 2024Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- An evaluation framework for data center traffic engineering.☆13Jul 28, 2024Updated last year
- Repo containing few notebooks on fine tuning of Language Models☆13Apr 29, 2024Updated last year
- machine learning specilization course 2☆12Dec 23, 2018Updated 7 years ago