ADaM-BJTU/O1-CODER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ADaM-BJTU/O1-CODER)

ADaM-BJTU / O1-CODER

AN O1 REPLICATION FOR CODING

☆332

Alternatives and similar repositories for O1-CODER

Users that are interested in O1-CODER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ADaM-BJTU / OpenRFT
View on GitHub
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆157Dec 24, 2024Updated last year
ADaM-BJTU / W2SG
View on GitHub
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Feb 26, 2024Updated 2 years ago
openreasoner / openr
View on GitHub
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,848Jan 17, 2025Updated last year
ATH-MaaS / Marco-o1
View on GitHub
An Open Large Reasoning Model for Real-World Solutions
☆1,537Jun 17, 2026Updated last month
SimpleBerry / LLaMA-O1
View on GitHub
Large Reasoning Models
☆803Dec 3, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆2,001Jan 14, 2025Updated last year
Open-Source-O1 / Open-O1
View on GitHub
☆1,340Nov 21, 2024Updated last year
zhentingqi / rStar
View on GitHub
☆972Jan 23, 2025Updated last year
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
View on GitHub
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆56Mar 21, 2025Updated last year
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
krystalan / DRT
View on GitHub
Deep Reasoning Translation (DRT) Project
☆242Sep 1, 2025Updated 10 months ago
ADaM-BJTU / MemAct
View on GitHub
☆30Nov 29, 2025Updated 8 months ago
hijkzzz / Awesome-LLM-Strawberry
View on GitHub
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
☆6,893Dec 17, 2025Updated 7 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,855Jul 14, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,740Updated this week
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,866Mar 18, 2025Updated last year
ADaM-BJTU / AutoCoA
View on GitHub
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆132Mar 18, 2025Updated last year
RUCAIBox / Slow_Thinking_with_LLMs
View on GitHub
A series of technical report on Slow Thinking with LLM
☆767Aug 13, 2025Updated 11 months ago
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
NEUIR / COAST
View on GitHub
Official repository for the paper "COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis".
☆18Feb 19, 2025Updated last year
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 6 months ago
lqtrung1998 / mwp_ReFT
View on GitHub
☆554Jan 2, 2025Updated last year
swtheing / PF-PPO-RLHF
View on GitHub
☆34Sep 14, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bigcode-project / selfcodealign
View on GitHub
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆323Feb 24, 2025Updated last year
THUDM / ReST-MCTS
View on GitHub
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆709Jan 20, 2025Updated last year
efficientscaling / Z1
View on GitHub
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"
☆69Apr 11, 2025Updated last year
GAIR-NLP / ProX
View on GitHub
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆272Jul 8, 2025Updated last year
facebookresearch / swe-rl
View on GitHub
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆712Mar 16, 2025Updated last year
huggingface / search-and-learn
View on GitHub
Recipes to scale inference-time compute of open models
☆1,130May 26, 2026Updated 2 months ago
build-with-groq / g1
View on GitHub
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆4,173Dec 30, 2025Updated 6 months ago
ezelikman / quiet-star
View on GitHub
Code for Quiet-STaR
☆739Aug 21, 2024Updated last year
ganler / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆313May 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Ablustrund / APPS_Plus
View on GitHub
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆73Aug 31, 2024Updated last year
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
SkyworkAI / skywork-o1-prm-inference
View on GitHub
☆69Nov 26, 2024Updated last year
FreedomIntelligence / HuatuoGPT-o1
View on GitHub
Medical o1, Towards medical complex reasoning with LLMs
☆1,344Jan 20, 2025Updated last year
aorwall / moatless-tree-search
View on GitHub
☆141Jun 6, 2025Updated last year
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,871Dec 23, 2025Updated 7 months ago
GAIR-NLP / LIMR
View on GitHub
☆221Feb 20, 2025Updated last year