hkust-nlp / CodeIO
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
☆484Updated last month
Alternatives and similar repositories for CodeIO:
Users that are interested in CodeIO are comparing it to the libraries listed below
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆474Updated last week
- Large Reasoning Models☆800Updated 3 months ago
- ☆485Updated last week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning☆395Updated this week
- ☆518Updated last week
- AN O1 REPLICATION FOR CODING☆329Updated 3 months ago
- ☆913Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆748Updated 3 weeks ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆376Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆915Updated this week
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆664Updated last week
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆177Updated 2 weeks ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆133Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆338Updated 9 months ago
- LIMO: Less is More for Reasoning☆875Updated last month
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆353Updated last week
- Automatic evals for LLMs☆346Updated this week
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆293Updated 5 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆334Updated last month
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆403Updated 2 weeks ago
- ☆312Updated 6 months ago
- ☆559Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆310Updated 3 months ago
- A series of technical report on Slow Thinking with LLM☆595Updated last week
- ☆262Updated last week
- The RedStone repository includes code for preparing extensive datasets used in training large language models.☆120Updated last month
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆395Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆231Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆568Updated this week