facebookresearch / CodeGenLinks
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
☆768Updated last year
Alternatives and similar repositories for CodeGen
Users that are interested in CodeGen are comparing it to the libraries listed below
Sorting:
- Generative model for code infilling and synthesis☆310Updated 2 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆561Updated 11 months ago
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,724Updated 4 years ago
- CodeXGLUE☆1,795Updated last year
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆498Updated last year
- Pretrained Language Models for Source code☆252Updated 4 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆77Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆206Updated last year
- Guide to using pre-trained large language models of source code☆1,840Updated last year
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆90Updated 11 months ago
- Code Generation using GPT-J!☆516Updated 3 years ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- ☆672Updated last year
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆564Updated 5 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Updated 10 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆300Updated 2 years ago
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆172Updated 3 years ago
- CodeBERT☆2,712Updated 2 years ago
- Training language models to make programs faster☆97Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆479Updated 10 months ago
- ☆486Updated last year
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆311Updated 3 months ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,092Updated last year
- A library for mining of path-based representations of code (and more)☆299Updated last month
- A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.☆339Updated 2 years ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆172Updated 2 years ago
- Ongoing research training transformer models at scale☆394Updated last year
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆726Updated last month
- TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)☆91Updated 3 years ago
- Fine-tune SantaCoder for Code/Text Generation.☆194Updated 2 years ago