facebookresearch / CodeGen
Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.
☆749Updated last year
Alternatives and similar repositories for CodeGen
Users that are interested in CodeGen are comparing it to the libraries listed below
Sorting:
- Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf☆1,709Updated 3 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆531Updated 3 months ago
- Generative model for code infilling and synthesis☆302Updated last year
- CodeGen2 models for program synthesis☆275Updated last year
- CodeXGLUE☆1,670Updated last year
- Pretrained Language Models for Source code☆253Updated 3 years ago
- ☆651Updated 6 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆285Updated last year
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆190Updated last year
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆462Updated 10 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆464Updated 3 months ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,587Updated last week
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆73Updated 3 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆188Updated 3 years ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆996Updated 9 months ago
- A static analysis library for computing graph representations of Python programs suitable for use with graph neural networks.☆333Updated last year
- A pre-trained GPT model for Python code completion and generation☆275Updated last year
- ☆451Updated 9 months ago
- Guide to using pre-trained large language models of source code☆1,825Updated 10 months ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆74Updated last year
- Ongoing research training transformer models at scale☆387Updated 8 months ago
- A library for mining of path-based representations of code (and more)☆287Updated last year
- A multi-programming language benchmark for LLMs☆246Updated 3 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 3 months ago
- Fine-tune SantaCoder for Code/Text Generation.☆192Updated 2 years ago
- Implementation of the paper "Language-agnostic representation learning of source code from structure and context".☆169Updated 3 years ago
- Code Generation using GPT-J!☆518Updated 2 years ago
- ☆1,472Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆557Updated 9 months ago
- CodeBERT☆2,503Updated last year