ulab-uiuc / FusionFactoryLinks
"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆19Updated last month
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below
Sorting:
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆27Updated 3 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Updated 3 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Updated last week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 4 months ago
- ☆77Updated 3 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- ☆30Updated 4 months ago
- ☆33Updated 6 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆34Updated 2 weeks ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Updated last year
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆38Updated 7 months ago
- MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents☆40Updated this week
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated last month
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 8 months ago
- [ICLR 2026] Geometric-Mean Policy Optimization☆100Updated 2 weeks ago
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆73Updated 7 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 6 months ago
- ☆56Updated 2 weeks ago
- ☆75Updated 7 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Updated 5 months ago
- DELT: Data Efficacy for Language Model Training☆43Updated 2 weeks ago
- Scaling Agentic Environments Automatically.☆47Updated 2 weeks ago
- ☆35Updated 4 months ago
- ☆43Updated 5 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆60Updated 3 months ago
- ☆51Updated 9 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 8 months ago
- ☆36Updated 4 months ago