JetBrains-Research / embeddings-for-trees
Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.
☆22Updated 3 years ago
Alternatives and similar repositories for embeddings-for-trees:
Users that are interested in embeddings-for-trees are comparing it to the libraries listed below
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago
- Learning to Update Natural Language Comments Based on Code Changes: Artifact☆33Updated 4 years ago
- Mining tool and large-scale datasets of single statement bug fixes in Python☆15Updated last year
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- PyTorch's implementation of the code2seq model.☆61Updated 5 months ago
- an implementation of "code2vec: Learning Distributed Representations of Code"☆29Updated 6 months ago
- [AAAI 2021] - TreeCaps: Tree-based Capsule Network for Source Code Processing☆22Updated last year
- Models and datasets for annotated code search.☆35Updated last year
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Updated this week
- ☆35Updated 2 years ago
- ☆23Updated last year
- Re-implementation of "CODE2SEQ: GENERATING SEQUENCES FROM STRUCTURED REPRESENTATIONS OF CODE"☆44Updated 5 months ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 3 years ago
- A toolkit for pre-processing large source code corpora☆46Updated 2 years ago
- PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets suitable for graph-based machine learning. W…☆87Updated 2 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated 5 months ago
- ☆10Updated 4 years ago
- Paper Artifacts for "Aroma: Code Recommendation via Structural Code Search"☆57Updated 3 years ago
- Deep Just-In-Time Inconsistency Detection Between Comments and Source Code: Artifact☆21Updated 2 years ago
- ESEC/FSE'21: Prediction-Preserving Program Simplification☆10Updated 2 years ago
- ☆24Updated 2 years ago
- Semantic Code Search☆34Updated last year
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆14Updated 2 years ago
- A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Eva…☆30Updated 4 years ago
- An IntelliJ-based IDE plugin for Python AST transformations☆18Updated last year
- VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning☆38Updated 2 years ago
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆12Updated 3 months ago
- ML models often mispredict, and it is hard to tell when and why. We present a data mining based approach to discover whether there is a c…☆18Updated 2 years ago
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆83Updated last year
- Utilities used by the Deep Program Understanding team☆102Updated last year