martysai / source-code-summarization
Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.
☆16Updated 4 years ago
Alternatives and similar repositories for source-code-summarization:
Users that are interested in source-code-summarization are comparing it to the libraries listed below
- ☆15Updated 3 years ago
- ☆42Updated 2 months ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆53Updated 3 years ago
- Models and datasets for annotated code search.☆35Updated last year
- PyTorch library for synthesizing programs from natural language☆18Updated 9 months ago
- Set of PyTorch modules for developing and evaluating different algorithms for embedding trees.☆22Updated 3 years ago
- ☆23Updated 2 years ago
- Code and data for AAAI 2022 paper "Multilingual Code Snippets Training for Program Translation"☆10Updated 3 years ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆54Updated 8 months ago
- A highly sophisticated sequence-to-sequence model for code generation☆40Updated 3 years ago
- Code for generating the JuICe dataset.☆37Updated 3 years ago
- Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)☆41Updated 3 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆73Updated last year
- Learning to Model Editing Processes☆26Updated 2 years ago
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Updated 3 months ago
- Official implementation of our work, 'GypSum: Learning Hybrid Representations for Code Summarization'.☆14Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- ManyTypes4Py: A benchmark Python dataset for machine learning-based type inference☆22Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"☆41Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15Updated 2 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- ☆14Updated 6 months ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- ☆44Updated 2 years ago
- Official code of our work, Summarize and Generate to Back-Translate: Unsupervised Translation of Programming Languages [arXiv].☆11Updated 2 years ago
- ☆23Updated 2 months ago
- BLANCA - Benchmarks for LANguage models on Coding Artifacts☆9Updated 3 years ago
- ☆29Updated last month
- ☆11Updated 2 years ago
- C# Data Extraction for "Learning to Represent Edits"☆26Updated 6 years ago