ulab-uiuc / FusionFactoryLinks
"FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jiaxuan You
☆17Updated 2 months ago
Alternatives and similar repositories for FusionFactory
Users that are interested in FusionFactory are comparing it to the libraries listed below
Sorting:
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆80Updated last month
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆21Updated 3 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated 2 weeks ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆62Updated 2 months ago
- ☆15Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- ☆41Updated 4 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆82Updated this week
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆70Updated 6 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆97Updated last year
- ☆140Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 4 months ago
- ☆31Updated 4 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆126Updated 8 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆93Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆67Updated 7 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Updated 6 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆72Updated 8 months ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆61Updated last month
- ☆51Updated 7 months ago
- ☆95Updated 9 months ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆26Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 6 months ago
- Implementation of the MATRIX framework (ICML 2024)☆60Updated last year
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 6 months ago
- Reinforced Multi-LLM Agents training☆60Updated 6 months ago
- ☆75Updated last month
- ☆175Updated 3 weeks ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆114Updated 4 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆54Updated this week