facebookresearch / CodeGen
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
☆743Updated last year
Alternatives and similar repositories for CodeGen:
Users that are interested in CodeGen are comparing it to the libraries listed below
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,708Updated 3 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆525Updated 3 months ago
- CodeXGLUE☆1,655Updated last year
- Generative model for code infilling and synthesis☆300Updated last year
- Pretrained Language Models for Source code☆254Updated 3 years ago
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆190Updated last year
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆461Updated 10 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.☆332Updated last year
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆169Updated 3 years ago
- Contrastive Code Representation Learning: functionality-based JavaScript embeddings through self-supervised learning☆165Updated 3 years ago
- A multi-programming language benchmark for LLMs☆241Updated 3 months ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,583Updated 2 years ago
- ☆649Updated 5 months ago
- [ICML 2021] Break-It-Fix-It: Unsupervised Learning for Program Repair☆113Updated 2 years ago
- CodeGen2 models for program synthesis☆274Updated last year
- Datasets, tools, and benchmarks for representation learning of code.☆2,288Updated 3 years ago
- Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021☆111Updated 2 years ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆283Updated last year
- Ongoing research training transformer models at scale☆386Updated 8 months ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆243Updated last year
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆70Updated 3 months ago
- A framework for the evaluation of autoregressive code generation language models.☆932Updated 5 months ago
- A multi-lingual program repair benchmark set based on the Quixey Challenge☆113Updated 2 years ago
- Benchmark ClassEval for class-level code generation.☆141Updated 6 months ago
- Training language models to make programs faster☆87Updated last year
- CodeBERT☆2,482Updated last year
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 2 months ago
- Code Generation using GPT-J!☆516Updated 2 years ago
- Extract and combine multiple source code views using tree-sitter☆132Updated 4 months ago