hendrycks/apps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hendrycks/apps)

hendrycks / apps

APPS: Automated Programming Progress Standard (NeurIPS 2021)

☆534

Alternatives and similar repositories for apps

Users that are interested in apps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

salesforce / CodeRL
View on GitHub
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…
☆573Jun 2, 2026Updated last month
openai / human-eval
View on GitHub
Code for the paper "Evaluating Large Language Models Trained on Code"
☆3,310Jan 17, 2025Updated last year
reddy-lab-code-research / PPOCoder
View on GitHub
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆116Jan 9, 2024Updated 2 years ago
evalplus / evalplus
View on GitHub
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
☆1,782Oct 2, 2025Updated 9 months ago
google-deepmind / code_contests
View on GitHub
☆2,203Oct 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / CodeXGLUE
View on GitHub
CodeXGLUE
☆1,831Apr 23, 2024Updated 2 years ago
shunzh / Code-AI-Tree-Search
View on GitHub
☆118Jul 17, 2024Updated 2 years ago
hendrycks / math
View on GitHub
The MATH Dataset (NeurIPS 2021)
☆1,375Sep 6, 2025Updated 10 months ago
xlang-ai / DS-1000
View on GitHub
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
☆275Oct 30, 2024Updated last year
mahimanzum / FixEval
View on GitHub
We introduce FixEval , a dataset for competitive programming bug fixing along with a comprehensive test suite and show the necessity of e…
☆26Aug 31, 2022Updated 3 years ago
dpfried / incoder
View on GitHub
Generative model for code infilling and synthesis
☆312Sep 9, 2023Updated 2 years ago
CoderEval / CoderEval
View on GitHub
A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.
☆158Dec 25, 2024Updated last year
microsoft / CodeT
View on GitHub
☆677Nov 1, 2024Updated last year
llylly / RANUM
View on GitHub
[ICSE 2023] Differentiable interpretation and failure-inducing input generation for neural network numerical bugs.
☆13Jan 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abacaj / code-eval
View on GitHub
Run evaluation on LLMs using human-eval benchmark
☆429Sep 12, 2023Updated 2 years ago
bigcode-project / bigcode-evaluation-harness
View on GitHub
A framework for the evaluation of autoregressive code generation language models.
☆1,052Jul 22, 2025Updated 11 months ago
ethancaballero / description2code
View on GitHub
☆90Mar 21, 2022Updated 4 years ago
amazon-science / recode
View on GitHub
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
☆58Mar 20, 2024Updated 2 years ago
nuprl / MultiPL-E
View on GitHub
A multi-programming language benchmark for LLMs
☆312Apr 12, 2026Updated 3 months ago
reddy-lab-code-research / XLCoST
View on GitHub
Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence
☆95Jan 21, 2025Updated last year
yuewang-cuhk / awesome-programming-language-pretraining-papers
View on GitHub
Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)
☆60Dec 17, 2021Updated 4 years ago
BuiltOntheRock / FSE22_BuiltOntheRock
View on GitHub
☆26Jul 19, 2022Updated 4 years ago
Zyq-scut / RLTF
View on GitHub
Accepted by Transactions on Machine Learning Research (TMLR)
☆135Oct 5, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
terryyz / ice-score
View on GitHub
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
☆79Jun 16, 2024Updated 2 years ago
wasiahmad / PLBART
View on GitHub
Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].
☆186Mar 1, 2022Updated 4 years ago
floatai / HumanEval-XL
View on GitHub
[LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
☆42Mar 7, 2025Updated last year
FlagOpen / TACO
View on GitHub
☆239Feb 28, 2026Updated 4 months ago
FudanSELab / ClassEval
View on GitHub
Benchmark ClassEval for class-level code generation.
☆151Oct 24, 2024Updated last year
github / CodeSearchNet
View on GitHub
Datasets, tools, and benchmarks for representation learning of code.
☆2,442Jan 31, 2022Updated 4 years ago
reddy-lab-code-research / MuST-CoST
View on GitHub
Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"
☆11Mar 7, 2022Updated 4 years ago
agemagician / CodeTrans
View on GitHub
Pretrained Language Models for Source code
☆257Jun 1, 2021Updated 5 years ago
huangd1999 / EffiCoder
View on GitHub
[ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
☆16May 24, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
salesforce / CodeT5
View on GitHub
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
☆3,101Jun 25, 2026Updated 3 weeks ago
nyu-mll / ILF-for-code-generation
View on GitHub
☆81Mar 24, 2025Updated last year
microsoft / CodeBERT
View on GitHub
CodeBERT
☆2,786Jul 9, 2023Updated 3 years ago
ARiSE-Lab / CYCLE_OOPSLA_24
View on GitHub
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Mar 8, 2024Updated 2 years ago
facebookresearch / CodeGen
View on GitHub
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…
☆776Mar 12, 2026Updated 4 months ago
IBM / Project_CodeNet
View on GitHub
This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX
☆1,687Dec 21, 2025Updated 6 months ago
zysszy / CAT
View on GitHub
Improving Machine Translation Systems via Isotopic Replacement
☆12Apr 14, 2023Updated 3 years ago