microsoft / CodeBERTLinks
CodeBERT
☆2,704Updated 2 years ago
Alternatives and similar repositories for CodeBERT
Users that are interested in CodeBERT are comparing it to the libraries listed below
Sorting:
- CodeXGLUE☆1,785Updated last year
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆3,083Updated last year
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆765Updated last year
- Datasets, tools, and benchmarks for representation learning of code.☆2,396Updated 3 years ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆3,045Updated 10 months ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,142Updated 2 years ago
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆564Updated 4 months ago
- ☆25Updated 3 years ago
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆725Updated last month
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆310Updated 2 months ago
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆497Updated last year
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆172Updated 2 years ago
- Guide to using pre-trained large language models of source code☆1,841Updated last year
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆186Updated 3 years ago
- This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX☆1,647Updated 2 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆558Updated 10 months ago
- Generative model for code infilling and synthesis☆308Updated 2 years ago
- A library for mining of path-based representations of code (and more)☆299Updated last month
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆89Updated 10 months ago
- ☆239Updated last year
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,157Updated last month
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆292Updated 10 months ago
- Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024☆1,640Updated 2 months ago
- GraphGen4Code: a toolkit for creating code knowledge graphs based on WALA code analysis and extraction of documentation and forum content…☆324Updated this week
- Python bindings to the Tree-sitter parsing library☆1,291Updated 2 weeks ago
- ☆672Updated last year
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆126Updated 8 months ago
- A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research☆903Updated last month
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆155Updated 11 months ago
- [TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.☆3,110Updated 2 weeks ago