The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆97Apr 9, 2025Updated 10 months ago
Alternatives and similar repositories for AceCoder
Users that are interested in AceCoder are comparing it to the libraries listed below
Sorting:
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆38Feb 1, 2026Updated 3 weeks ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- Reproducing R1 for Code with Reliable Rewards☆288May 5, 2025Updated 9 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆41Jul 21, 2025Updated 7 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆221Nov 27, 2025Updated 3 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆182Jun 5, 2025Updated 8 months ago
- ☆21Jul 21, 2025Updated 7 months ago
- A version of verl to support diverse tool use☆879Feb 19, 2026Updated last week
- ☆19Aug 4, 2025Updated 6 months ago
- official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"☆60Dec 20, 2023Updated 2 years ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆180Jul 8, 2025Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated 3 weeks ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆568May 6, 2025Updated 9 months ago
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- R3: Robust Rubric-Agnostic Reward Models☆20Jul 12, 2025Updated 7 months ago
- ☆35May 16, 2025Updated 9 months ago
- ☆342Jun 5, 2025Updated 8 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆114Jan 23, 2025Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆121May 6, 2025Updated 9 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Sep 18, 2025Updated 5 months ago
- Preparing for ML Interviews.☆54Jan 12, 2026Updated last month
- ☆53Feb 11, 2025Updated last year
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆38May 15, 2024Updated last year
- Official Repo for Open-Reasoner-Zero☆2,084Jun 2, 2025Updated 8 months ago
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆479Jan 3, 2026Updated last month
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆633Jan 29, 2026Updated last month
- ☆15Feb 12, 2026Updated 2 weeks ago
- ☆25Aug 19, 2025Updated 6 months ago
- ☆16Feb 22, 2025Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Jul 15, 2025Updated 7 months ago
- ☆14Mar 20, 2025Updated 11 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Mar 6, 2025Updated 11 months ago
- X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests☆79Feb 7, 2026Updated 3 weeks ago