facebookresearch / CodeGenLinks
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
☆756Updated last year
Alternatives and similar repositories for CodeGen
Users that are interested in CodeGen are comparing it to the libraries listed below
Sorting:
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆534Updated 5 months ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,714Updated 3 years ago
- Generative model for code infilling and synthesis☆302Updated last year
- Pretrained Language Models for Source code☆254Updated 4 years ago
- CodeBERT☆2,548Updated last year
- CodeXGLUE☆1,692Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆193Updated last year
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆290Updated 4 months ago
- Guide to using pre-trained large language models of source code☆1,827Updated 11 months ago
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆470Updated last year
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆299Updated 2 weeks ago
- ☆657Updated 7 months ago
- Code Generation using GPT-J!☆517Updated 3 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆2,797Updated 5 months ago
- Ongoing research training transformer models at scale☆389Updated 10 months ago
- ☆462Updated 10 months ago
- Datasets, tools, and benchmarks for representation learning of code.☆2,326Updated 3 years ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆74Updated 5 months ago
- CodeGen2 models for program synthesis☆273Updated 2 years ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆156Updated last year
- [ICML 2020] DrRepair: Learning to Repair Programs from Error Messages☆195Updated 4 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆167Updated 3 years ago
- Python bindings to the Tree-sitter parsing library☆1,110Updated last week
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆169Updated 3 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆75Updated last year
- A multi-programming language benchmark for LLMs☆254Updated last week
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,017Updated last year
- A library for mining of path-based representations of code (and more)☆290Updated last year
- Aix-bench, the Java benchmark for code synthesis problem.☆51Updated 2 years ago