MilkThink-Lab / Awesome-Routing-LLMsLinks
A curated list of awesome works in Routing LLMs paradigm (π Welcome to submit your contributions to this code repository)
β70Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Routing-LLMs
Users that are interested in Awesome-Routing-LLMs are comparing it to the libraries listed below
Sorting:
- β369Updated 3 weeks ago
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ282Updated 2 weeks ago
- The related works and background techniques about Openai o1β223Updated 10 months ago
- β307Updated 5 months ago
- Awesome List for Agentic RLβ531Updated 3 weeks ago
- β134Updated last week
- β70Updated 2 weeks ago
- β415Updated 3 weeks ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (β¦β384Updated this week
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β133Updated 5 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generationβ132Updated 8 months ago
- A comprehensive collection of process reward models.β116Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ207Updated 6 months ago
- A Comprehensive Survey on Long Context Language Modelingβ199Updated 4 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".β85Updated 5 months ago
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ312Updated 2 weeks ago
- π Awesome Agentic Search is a curated list of papers, tools, and resources on agentic searchβwhere AI agents plan, search, and reason toβ¦β46Updated 2 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β254Updated 2 months ago
- β452Updated 3 months ago
- a-m-team's exploration in large language modelingβ191Updated 5 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β56Updated 11 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"β144Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"β139Updated 2 weeks ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inferenceβ326Updated last week
- A live reading list for LLM data synthesis (Updated to July, 2025).β399Updated 2 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.β192Updated 6 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"β¦β81Updated 2 years ago
- β280Updated 4 months ago
- β147Updated last week
- Official Repository of "Learning to Reason under Off-Policy Guidance"β360Updated last month