ethancaballero / description2codeLinks

☆86

Alternatives and similar repositories for description2code

Users that are interested in description2code are comparing it to the libraries listed below

Sorting:

hendrycks / apps
APPS: Automated Programming Progress Standard (NeurIPS 2021)
☆479Updated last year
amazon-science / mxeval
☆110Updated last year
neulab / code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
☆196Updated last year
madaan / pie-perf
Training language models to make programs faster
☆91Updated last year
shuyanzhou / docprompting
Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023
☆248Updated last year
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆86Updated 10 months ago
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆89Updated 2 years ago
openai / human-eval-infilling
Code for the paper "Efficient Training of Language Models to Fill in the Middle"
☆183Updated 2 years ago
Zyq-scut / RLTF
Accepted by Transactions on Machine Learning Research (TMLR)
☆130Updated 10 months ago
GammaTauAI / leetcode-hard-gym
A hard gym for programming
☆160Updated last year
salesforce / CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…
☆541Updated 6 months ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆114Updated last year
rajasagashe / JuICe
Code for generating the JuICe dataset.
☆37Updated 3 years ago
dpfried / incoder
Generative model for code infilling and synthesis
☆304Updated last year
terryyz / ice-score
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
☆76Updated last year
nuprl / MultiPL-E
A multi-programming language benchmark for LLMs
☆267Updated 3 weeks ago
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆151Updated 9 months ago
allenai / Lila
A unified benchmark for math reasoning
☆88Updated 2 years ago
xlang-ai / DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
☆251Updated 9 months ago
wellecks / naturalprover
NaturalProver: Grounded Mathematical Proof Generation with Language Models
☆38Updated 2 years ago
amazon-science / recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
☆52Updated last year
csebuetnlp / CoDesc
A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
☆53Updated 3 years ago
asaparov / prontoqa
Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
☆147Updated 9 months ago
amazon-science / cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆153Updated last year
wasiahmad / AVATAR
Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.
☆55Updated last year
ntunlp / ExecEval
A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.
☆56Updated 9 months ago
shunzh / Code-AI-Tree-Search
☆119Updated last year
Leolty / repobench
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆169Updated 11 months ago
bigcode-project / bigcode-analysis
Repository for analysis and experiments in the BigCode project.
☆121Updated last year
sriniiyer / concode
Mapping Language to Code in a Programmatic Context
☆80Updated 4 years ago