hkust-nlp / CodeIO
[ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆515Updated 2 months ago
Alternatives and similar repositories for CodeIO:
Users that are interested in CodeIO are comparing it to the libraries listed below
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆512Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆808Updated last week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆495Updated 2 weeks ago
- AN O1 REPLICATION FOR CODING☆333Updated 4 months ago
- Large Reasoning Models☆804Updated 5 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆312Updated 3 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆373Updated last week
- ☆524Updated 3 weeks ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆839Updated last month
- ☆924Updated 3 months ago
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆382Updated last month
- ☆287Updated last month
- A series of technical report on Slow Thinking with LLM☆659Updated 3 weeks ago
- TTRL: Test-Time Reinforcement Learning☆407Updated last week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆178Updated last month
- LIMO: Less is More for Reasoning☆927Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆253Updated 2 months ago
- ☆255Updated 3 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆693Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆244Updated 3 weeks ago
- Muon is Scalable for LLM Training☆1,039Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆296Updated 6 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆220Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆450Updated last month
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆180Updated this week
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆458Updated this week
- ☆739Updated 2 weeks ago
- OLMoE: Open Mixture-of-Experts Language Models☆739Updated last month
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆458Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆186Updated 3 weeks ago