A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)
☆132Mar 6, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-Routing-LLMs
Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆31Sep 25, 2025Updated 7 months ago
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆61Apr 6, 2026Updated last month
- A curated list of awesome approaches to AI model routing☆199Mar 24, 2025Updated last year
- ☆14Nov 19, 2024Updated last year
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆33Jan 26, 2026Updated 3 months ago
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated last year
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Dec 13, 2024Updated last year
- The code of RouterDC☆72Apr 14, 2025Updated last year
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆17Aug 15, 2025Updated 9 months ago
- Dumpy: A Compact and Adaptive Index for Large Data Series Collections (SIGMOD'23)☆13Dec 12, 2023Updated 2 years ago
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆48Oct 29, 2025Updated 6 months ago
- ROS package for iniVation Dynamic Vision System's dv-sdk.☆13Jan 11, 2022Updated 4 years ago
- ☆15Jan 24, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- We introduce EfficientRAG, an efficient retriever for multi-hop question answering. EfficientRAG iteratively generates new queries withou…☆17Mar 4, 2025Updated last year
- ☆23Feb 28, 2025Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- ☆136Aug 11, 2025Updated 9 months ago
- Route LLM requests to the best model for the task at hand.☆269May 7, 2026Updated 2 weeks ago
- Raptor - Monocular Arbitrary Moving Object Discovery and Segmentation - official code☆11Dec 15, 2025Updated 5 months ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆16Oct 24, 2022Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An extensive and commented list of resources on Learned Sparse Retrieval.☆57Apr 27, 2026Updated 3 weeks ago
- Efficiency/Effectiveness Trade-offs in Learning to Rank☆12Sep 11, 2018Updated 7 years ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- 基于OpenAI的同声传译应用,包含GUi界面,安卓app,命令行功能☆66Dec 11, 2025Updated 5 months ago
- an External Function Auto-Completion Tool to Strengthen the Static Binary Lifting☆13May 13, 2024Updated 2 years ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆160Jun 13, 2024Updated last year
- Medical Concept Embedding with Multiple Ontological Representations (IJCAI-19)☆10Jul 21, 2020Updated 5 years ago
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Mar 4, 2024Updated 2 years ago
- ☆23May 2, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Mar 6, 2026Updated 2 months ago
- RL Recommendation System☆13Aug 30, 2019Updated 6 years ago
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- Formal implementation of Robust Domain Misinformation Detection via Multi-modal Feature Alignment☆12Dec 8, 2023Updated 2 years ago
- ☆113Apr 13, 2026Updated last month
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago