☆21Feb 22, 2026Updated last week
Alternatives and similar repositories for SplitReason
Users that are interested in SplitReason are comparing it to the libraries listed below
Sorting:
- ☆32Oct 13, 2025Updated 4 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 5 months ago
- ☆15Nov 7, 2024Updated last year
- ☆46Sep 27, 2025Updated 5 months ago
- ☆21Jul 18, 2024Updated last year
- CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving (NAACL 2024 Findings))☆16Apr 26, 2024Updated last year
- ☆20May 14, 2025Updated 9 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆57Dec 26, 2025Updated 2 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆30Jul 6, 2025Updated 7 months ago
- PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆41Jan 7, 2026Updated last month
- ☆179Dec 5, 2025Updated 2 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 9 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆57Nov 5, 2025Updated 3 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated last month
- ☆23Feb 18, 2025Updated last year
- ☆31Sep 12, 2025Updated 5 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆25Dec 16, 2024Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆80May 30, 2025Updated 9 months ago
- ☆33Nov 18, 2025Updated 3 months ago
- ☆60Jan 12, 2026Updated last month
- ☆28May 24, 2025Updated 9 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Jun 23, 2025Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆55Jul 7, 2025Updated 7 months ago
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- ☆39May 20, 2025Updated 9 months ago
- ☆31Feb 9, 2025Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆86May 27, 2025Updated 9 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- ☆55Jun 4, 2025Updated 8 months ago
- ☆34May 9, 2025Updated 9 months ago
- [ICLR 2026] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆41May 20, 2025Updated 9 months ago
- Code for paper "Stylized Dialogue Response Generation Using Stylized Unpaired Texts"☆31Aug 18, 2022Updated 3 years ago