✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆311Sep 6, 2025Updated 5 months ago
Alternatives and similar repositories for kodcode
Users that are interested in kodcode are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆288May 5, 2025Updated 9 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 10 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- ☆20Oct 10, 2025Updated 4 months ago
- Muon fsdp 2☆53Aug 8, 2025Updated 6 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆677Mar 16, 2025Updated 11 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Official Repo for Open-Reasoner-Zero☆2,084Jun 2, 2025Updated 8 months ago
- Heuristic filtering framework for RefineCode☆82Mar 13, 2025Updated 11 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated 11 months ago
- ☆50Aug 21, 2025Updated 6 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆282Sep 25, 2025Updated 5 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆34Jan 16, 2026Updated last month
- ☆13Dec 12, 2025Updated 2 months ago
- Backdooring Neural Code Search☆14Sep 8, 2023Updated 2 years ago
- Async pipelined version of Verl☆124Apr 8, 2025Updated 10 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆104Sep 24, 2025Updated 5 months ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 3 months ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆13Aug 8, 2025Updated 6 months ago
- a-m-team's exploration in large language modeling☆194May 29, 2025Updated 8 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆72Feb 25, 2025Updated last year
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆741Jun 6, 2025Updated 8 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆577Updated this week
- ☆33Sep 14, 2025Updated 5 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆243Jul 13, 2025Updated 7 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆166Oct 11, 2024Updated last year
- Align Anything: Training All-modality Model with Feedback☆4,636Nov 27, 2025Updated 3 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆183Jul 23, 2025Updated 7 months ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 8 months ago
- Run AI models end-to-end encrypted.☆3,060Feb 10, 2025Updated last year
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,882Feb 2, 2026Updated 3 weeks ago
- Visual Storytelling post-edit dataset☆18Sep 27, 2019Updated 6 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆152Sep 19, 2025Updated 5 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆109Apr 4, 2025Updated 10 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated 3 weeks ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆391Jan 19, 2025Updated last year
- Scalable RL solution for advanced reasoning of language models☆1,806Mar 18, 2025Updated 11 months ago