martysai / source-code-summarization
Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for source-code-summarization
- ☆15Updated 3 years ago
- A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.☆52Updated 2 years ago
- ☆40Updated 2 months ago
- ☆23Updated last year
- code for "Natural Language to Code Translation with Execution"☆39Updated 2 years ago
- Models and datasets for annotated code search.☆33Updated last year
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆53Updated 3 months ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆13Updated last year
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆69Updated 4 months ago
- The dataset for the variable-misuse task, used in the ICLR 2020 paper 'Global Relational Models of Source Code' [https://openreview.net/f…☆22Updated 4 years ago
- code and data for paper "ComFormer: Code Comment Generation via Transformer and Fusion Method-based Hybrid Code Representation" accepted …☆14Updated 2 years ago
- Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)☆40Updated 3 years ago
- Semantic Code Search☆34Updated last year
- CoditT5: Pretraining for Source Code and Natural Language Editing☆28Updated 2 weeks ago
- A redistributable subset of the ETH Py150 corpus [https://www.sri.inf.ethz.ch/py150], introduced in the ICML 2020 paper 'Learning and Eva…☆29Updated 4 years ago
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer☆35Updated 3 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated last year
- Web queries dataset for code search☆30Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- ☆17Updated 2 years ago
- ☆43Updated 2 years ago
- Code Generator☆23Updated last year
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆12Updated 6 months ago
- Tasks for describing differences between text distributions.☆16Updated 3 months ago
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Updated 2 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆14Updated 2 years ago
- Code for "StructCoder: Structure-Aware Transformer for Code Generation"☆67Updated 9 months ago
- ☆11Updated 8 months ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Updated 3 years ago