microsoft / CodeBERTLinks
CodeBERT
☆2,532Updated last year
Alternatives and similar repositories for CodeBERT
Users that are interested in CodeBERT are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,680Updated last year
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,134Updated last year
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆557Updated 9 months ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆2,770Updated 4 months ago
- Datasets, tools, and benchmarks for representation learning of code.☆2,309Updated 3 years ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,008Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,389Updated 2 years ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,089Updated 4 months ago
- Toolkit for creating, sharing and using natural language prompts.☆2,872Updated last year
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,474Updated 2 weeks ago
- ☆235Updated last year
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆750Updated last year
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆296Updated this week
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆710Updated 10 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆533Updated 4 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆188Updated 3 years ago
- Train transformer language models with reinforcement learning.☆14,046Updated this week
- A library for mining of path-based representations of code (and more)☆288Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,478Updated 9 months ago
- A framework for the evaluation of autoregressive code generation language models.☆950Updated 7 months ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆154Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,008Updated last week
- Python bindings to the Tree-sitter parsing library☆1,079Updated this week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,659Updated last year
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆466Updated 11 months ago
- Generative model for code infilling and synthesis☆302Updated last year
- ☆2,824Updated this week
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆291Updated 4 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,475Updated 11 months ago
- BERT score for text generation☆1,752Updated 10 months ago