waingram / code-embeddings
A Comparative Study of Various Code Embeddings in Software Semantic Matching
☆15Updated 2 years ago
Alternatives and similar repositories for code-embeddings:
Users that are interested in code-embeddings are comparing it to the libraries listed below
- Models and datasets for annotated code search.☆35Updated last year
- Code for the paper "A Structural Model for Contextual Code Changes"☆31Updated last year
- Semantic Code Search☆35Updated 2 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆73Updated last year
- ☆24Updated 3 years ago
- ☆15Updated 3 years ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Updated 3 months ago
- Code Snippet Recommendation from Stack Overflow Post☆18Updated 3 years ago
- ☆23Updated 2 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆53Updated 3 years ago
- ☆42Updated 2 months ago
- ☆66Updated 2 years ago
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- Neural Code Translator provides instructions, datasets, and a deep learning infrastructure (based on seq2seq) that aims at learning code …☆38Updated 6 years ago
- ☆36Updated 3 years ago
- CD4Py: Code De-Duplication for Python☆23Updated 4 years ago
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 7 months ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- Recent Advances in Programming Language Pre-Trained Models (PL-PTMs)☆58Updated 3 years ago
- code and data for paper "BASHEXPLAINER: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT", which accepted in…☆12Updated 2 years ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Updated 4 years ago
- ☆28Updated 2 years ago
- ☆17Updated last year
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆88Updated 3 years ago
- ☆13Updated last year
- ☆17Updated 2 years ago
- ☆44Updated 2 years ago
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated last year
- Hoppity☆59Updated 4 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆16Updated 4 years ago