MilkThink-Lab / Awesome-Routing-LLMsLinks
A curated list of awesome works in Routing LLMs paradigm (π Welcome to submit your contributions to this code repository)
β52Updated last month
Alternatives and similar repositories for Awesome-Routing-LLMs
Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below
Sorting:
- The related works and background techniques about Openai o1β224Updated 7 months ago
- β95Updated last week
- A live reading list for LLM data synthesis (Updated to July, 2025).β366Updated last week
- β52Updated last week
- a-m-team's exploration in large language modelingβ186Updated 3 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Tokenβ151Updated last year
- An Awesome List of Agentic Model trained with Reinforcement Learningβ420Updated this week
- β318Updated 2 months ago
- A Comprehensive Survey on Long Context Language Modelingβ180Updated last month
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ269Updated 2 years ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (β¦β298Updated this week
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Futureβ461Updated 7 months ago
- β31Updated 3 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β80Updated last year
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ243Updated 3 weeks ago
- β145Updated last year
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"β116Updated last month
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningβ173Updated 2 months ago
- β274Updated 3 months ago
- β83Updated last year
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google deβ¦β39Updated last month
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β134Updated 11 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β103Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.β568Updated 4 months ago
- Collection of papers for scalable automated alignment.β93Updated 10 months ago
- Awesome papers for role-playing with language modelsβ199Updated 9 months ago
- Awesome Agent Trainingβ215Updated 3 weeks ago
- β158Updated 7 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QAβ139Updated 9 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β385Updated 2 months ago