facebookresearch / CodeGenLinks
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
☆768Updated last year
Alternatives and similar repositories for CodeGen
Users that are interested in CodeGen are comparing it to the libraries listed below
Sorting:
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆558Updated last year
- Generative model for code infilling and synthesis☆313Updated 2 years ago
- CodeXGLUE☆1,799Updated last year
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,727Updated 4 years ago
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆501Updated last year
- Guide to using pre-trained large language models of source code☆1,841Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆207Updated last year
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Updated 2 years ago
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆91Updated last year
- Pretrained Language Models for Source code☆253Updated 4 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆79Updated 2 years ago
- ☆671Updated last year
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆186Updated 3 years ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,099Updated 2 years ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 11 months ago
- A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.☆340Updated 2 years ago
- Code Generation using GPT-J!☆516Updated 3 years ago
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆172Updated 3 years ago
- A pre-trained GPT model for Python code completion and generation☆282Updated 2 years ago
- CodeGen2 models for program synthesis☆271Updated 2 years ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆172Updated 2 years ago
- ☆489Updated last year
- CodeBERT☆2,726Updated 2 years ago
- Training language models to make programs faster☆98Updated last year
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆565Updated 6 months ago
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆313Updated 2 weeks ago
- TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)☆91Updated 3 years ago
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆730Updated 3 months ago
- Ongoing research training transformer models at scale☆395Updated last year
- Aix-bench, the Java benchmark for code synthesis problem.☆51Updated 3 years ago