Ablustrund/APPS_Plus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ablustrund/APPS_Plus)

Ablustrund / APPS_Plus

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

☆73

Alternatives and similar repositories for APPS_Plus

Users that are interested in APPS_Plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zyq-scut / RLTF
View on GitHub
Accepted by Transactions on Machine Learning Research (TMLR)
☆135Oct 5, 2024Updated last year
reddy-lab-code-research / PPOCoder
View on GitHub
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆116Jan 9, 2024Updated 2 years ago
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
zorazrw / odex
View on GitHub
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆49Dec 22, 2023Updated 2 years ago
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆76Jun 25, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Romelium / mpatch
View on GitHub
Applies diffs based on context, not line numbers. Useful for AI-generated code.
☆19Jun 2, 2026Updated last month
microsoft / MageBench
View on GitHub
Official Repo for MageBench: Bridging Large Multimodal Models to Agents
☆22Jan 8, 2025Updated last year
niansong1996 / lever
View on GitHub
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆90Jul 5, 2023Updated 3 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
RUCAIBox / SWE-World
View on GitHub
☆49Mar 6, 2026Updated 4 months ago
multi-swe-bench / MagentLess
View on GitHub
☆13Jul 31, 2025Updated 11 months ago
OSU-NLP-Group / Pangu
View on GitHub
☆12Jul 10, 2023Updated 3 years ago
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
salesforce / CodeRL
View on GitHub
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…
☆573Jun 2, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shachardon / naturally_occurring_feedback
View on GitHub
☆14Dec 1, 2025Updated 7 months ago
giganticode / run_bug_run
View on GitHub
The RunBugRun dataset of executable bugs
☆25Sep 24, 2025Updated 10 months ago
RickySkywalker / LeanOfThought-Official
View on GitHub
This is the official implementation for MA-LoT.
☆20Aug 4, 2025Updated 11 months ago
dhgottesman / keen_estimating_knowledge_in_llms
View on GitHub
☆18Nov 5, 2025Updated 8 months ago
newfacade / LeetCodeDataset
View on GitHub
LeetCode Training and Evaluation Dataset
☆53Apr 22, 2025Updated last year
qishenghu / InstructCoder
View on GitHub
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆66Oct 4, 2024Updated last year
rookie-joe / FormalAlign
View on GitHub
☆17Jul 12, 2025Updated last year
zhangmiaosen2000 / Towards-On-Policy-SFT
View on GitHub
☆19Mar 26, 2026Updated 4 months ago
Miracle-Messi / Isa-AutoFormal
View on GitHub
☆17Oct 27, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bammt / Learn-to-check
View on GitHub
the datasets of our paper
☆11Feb 26, 2024Updated 2 years ago
albertqjiang / MMA
View on GitHub
The official repository for the paper Multilingual Mathematical Autoformalization
☆39May 20, 2024Updated 2 years ago
zhangmiaosen2000 / Phi-Ground
View on GitHub
Home page for Microsoft Phi-Ground tech-report
☆22Sep 8, 2025Updated 10 months ago
multimodal-art-projection / CriticLean
View on GitHub
☆50Aug 5, 2025Updated 11 months ago
CodeEditorBench / CodeEditorBench
View on GitHub
☆58May 28, 2024Updated 2 years ago
LiaoMengqi / E3-RL4LLMs
View on GitHub
[ EMNLP 2025 Main ] Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
☆17Nov 7, 2025Updated 8 months ago
YihongDong / CDD-TED4LLMs
View on GitHub
☆16Nov 26, 2024Updated last year
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
yilunzhao / RobuT
View on GitHub
Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"
☆15Feb 8, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ArjunPanickssery / self_recognition
View on GitHub
☆10May 17, 2024Updated 2 years ago
yeahrmek / pylean
View on GitHub
Python wrapper for lean-gym
☆13Apr 5, 2023Updated 3 years ago
julianmichael / debate
View on GitHub
Debate interface, experiments, etc.
☆11Mar 12, 2024Updated 2 years ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆106Mar 6, 2025Updated last year
uclaml / COPS
View on GitHub
The official implementation of Cross-Task Experience Sharing (COPS)
☆29Oct 23, 2024Updated last year
XiaojuanTang / Mars
View on GitHub
a benchmark to evaluate the situated inductive reasoning
☆16Jan 7, 2025Updated last year
GavinZhengOI / LiveCodeBench-Pro
View on GitHub
☆176Dec 13, 2025Updated 7 months ago