microsoft / CodeBERT
CodeBERT
☆2,447Updated last year
Alternatives and similar repositories for CodeBERT:
Users that are interested in CodeBERT are comparing it to the libraries listed below
- CodeXGLUE☆1,642Updated 11 months ago
- Home of CodeT5: Open Code LLMs for Code Understanding and Generation☆2,943Updated last year
- Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"☆558Updated 7 months ago
- TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Representations of Code"☆1,127Updated last year
- Datasets, tools, and benchmarks for representation learning of code.☆2,281Updated 3 years ago
- Code for the paper "Evaluating Large Language Models Trained on Code"☆2,661Updated 2 months ago
- Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].☆187Updated 3 years ago
- NaturalCC: An Open-Source Toolkit for Code Intelligence☆288Updated last month
- APPS: Automated Programming Progress Standard (NeurIPS 2021)☆451Updated 9 months ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.☆5,052Updated 2 months ago
- Website for "A Survey of Machine Learning for Big Code and Naturalness"☆290Updated last month
- Generative model for code infilling and synthesis☆300Updated last year
- ☆235Updated last year
- Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from d…☆735Updated last year
- A framework for the evaluation of autoregressive code generation language models.☆915Updated 5 months ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆521Updated 2 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,610Updated last year
- A library for mining of path-based representations of code (and more)☆287Updated last year
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆3,009Updated 8 months ago
- ☆2,130Updated last year
- A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering☆708Updated 8 months ago
- methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositor…☆146Updated last year
- A modular RL library to fine-tune language models to human preferences☆2,294Updated last year
- BERT score for text generation☆1,712Updated 8 months ago
- Official implementation of our work, A Transformer-based Approach for Source Code Summarization [ACL 2020].☆192Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,424Updated 2 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)☆7,273Updated last year
- ☆2,776Updated this week
- 🤗 Evaluate: A library for easily evaluating machine learning models and datasets.☆2,161Updated 2 months ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,772Updated last year