hkust-nlp / CodeIOLinks
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆537Updated 2 months ago
Alternatives and similar repositories for CodeIO
Users that are interested in CodeIO are comparing it to the libraries listed below
Sorting:
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆566Updated 3 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- ☆585Updated 2 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆420Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆410Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆157Updated this week
- Large Reasoning Models☆805Updated 7 months ago
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆582Updated 3 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆503Updated 3 months ago
- ☆609Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,064Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆574Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆588Updated last month
- [COLM 2025] LIMO: Less is More for Reasoning☆977Updated this week
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆447Updated last week
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆728Updated 3 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆976Updated 2 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated this week
- ☆796Updated last month
- ☆460Updated 2 weeks ago
- ☆946Updated 5 months ago
- ☆319Updated 9 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆527Updated last month
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆244Updated 2 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆499Updated 4 months ago
- ☆728Updated last month
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆185Updated 3 months ago
- Scaling Data for SWE-agents☆283Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆643Updated 2 weeks ago
- ☆228Updated last month