Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.
☆18Apr 22, 2025Updated 11 months ago
Alternatives and similar repositories for CP-Zero
Users that are interested in CP-Zero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 4 months ago
- ☆20Oct 10, 2025Updated 5 months ago
- ☆77Mar 6, 2026Updated 2 weeks ago
- Reproducing R1 for Code with Reliable Rewards☆297May 5, 2025Updated 10 months ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆171Mar 9, 2026Updated 2 weeks ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 9 months ago
- A scrapy crawler that crawls problems and its best solutions on codeforces.com☆13Feb 25, 2016Updated 10 years ago
- CAN Bus Voltage Dataset for the SIMPLE paper☆11Oct 2, 2019Updated 6 years ago
- OpenAI GPT For Python Developers☆12Jun 9, 2023Updated 2 years ago
- ☆42Nov 8, 2025Updated 4 months ago
- Training Segment Anything Model(SAM) by MetaAI from scratch and fine-tuning it with NDIS Park(Night and Day Instance Segmented Park) data…☆13Jun 21, 2025Updated 9 months ago
- Async pipelined version of Verl☆124Apr 8, 2025Updated 11 months ago
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆67Feb 3, 2025Updated last year
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 10 months ago
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- ☆51Mar 9, 2026Updated 2 weeks ago
- Labs of 2019 Web Information Processing and Application in USTC.☆11Jan 15, 2020Updated 6 years ago
- ☆35Jan 25, 2026Updated last month
- ☆20May 24, 2025Updated 9 months ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- Repository with details for a ECUPrint dataset (CAN logs and CAN voltage samples)☆20Oct 2, 2022Updated 3 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- 2022 USTC 011705 (OSH) Course Project of Runikraft Group☆13Jul 22, 2022Updated 3 years ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆313Sep 6, 2025Updated 6 months ago
- On Memorization of Large Language Models in Logical Reasoning☆76Mar 29, 2025Updated 11 months ago
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆36Jul 15, 2025Updated 8 months ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017☆41Dec 5, 2017Updated 8 years ago
- ☆16Feb 6, 2024Updated 2 years ago
- ☆11Mar 15, 2024Updated 2 years ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 4 months ago
- ☆11Dec 15, 2025Updated 3 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- ☆19Jun 19, 2022Updated 3 years ago
- A cooler tour the same as scala language☆19Mar 19, 2016Updated 10 years ago
- ☆12Jun 15, 2023Updated 2 years ago
- RepoLaunch is an agentic SWE tool aimed at automating the build, execution and test of GitHub repositories across programming languages a…☆53Mar 8, 2026Updated 2 weeks ago