Evaluation of source authorship attribution tool
☆23Jun 5, 2021Updated 4 years ago
Alternatives and similar repositories for authorship-detection
Users that are interested in authorship-detection are comparing it to the libraries listed below
Sorting:
- Collected solutions from Google Code Jam programming competition (2008-2021).☆68Sep 19, 2024Updated last year
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 4 years ago
- ☆16May 23, 2023Updated 2 years ago
- ☆10Aug 25, 2020Updated 5 years ago
- An IntelliJ IDEA plugin that allows to get suggestions for better method names☆10Dec 4, 2019Updated 6 years ago
- The Tangled Genealogy of IoT Malware☆12Jan 5, 2021Updated 5 years ago
- 🔍 Code Search Tools & Experiments☆12Updated this week
- CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot☆11Jul 11, 2023Updated 2 years ago
- Graphs and grammars for Context-Free Path Querying algorithms evaluation.☆10Sep 11, 2024Updated last year
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- Code for paper "Lancer: Your Code Tell Me What You Need"☆11Jun 17, 2022Updated 3 years ago
- Finding similar repositories on GitHub☆51Jan 12, 2023Updated 3 years ago
- A framework for the large scale analysis of programming language usage.☆30Jun 27, 2023Updated 2 years ago
- 基于CodeBert预训练模型,微调后/直接对目标数据集进行测试☆14Oct 19, 2021Updated 4 years ago
- AST factorization: transformation AST of Kotlin source code to a vector☆11Oct 17, 2019Updated 6 years ago
- Code for the paper "Embedding Java Classes with code2vec: Improvements from Variable Obfuscation" in MSR 2020☆32Mar 24, 2023Updated 2 years ago
- Web queries dataset for code search☆32Jun 3, 2023Updated 2 years ago
- Deep learning code semantic similarity☆66Jun 11, 2019Updated 6 years ago
- ☆13Jun 24, 2019Updated 6 years ago
- Scripts for the creation of the Kaggle Torrent☆14May 17, 2021Updated 4 years ago
- The collection of Context-Free Path Querying algorithms☆14Dec 16, 2025Updated 2 months ago
- Code for the ICPC 2020 paper Improved Source Code Summarization via a Graph Neural Network☆68Apr 9, 2021Updated 4 years ago
- ☆18Apr 14, 2021Updated 4 years ago
- A toolkit for pre-processing large source code corpora☆45Sep 30, 2022Updated 3 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- Bindings to Google's Compact Language Detector 3 to JVM Based Languages☆21Jun 2, 2024Updated last year
- CD4Py: Code De-Duplication for Python☆23Dec 13, 2020Updated 5 years ago
- Dataset and code corresponding to Associating Natural Language Comment and Source Code Entities (AAAI 2020)☆20Oct 24, 2020Updated 5 years ago
- ☆48Nov 19, 2025Updated 3 months ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆85May 10, 2021Updated 4 years ago
- A benchmark for evaluating embeddings of identifiers in source code.☆22Aug 23, 2021Updated 4 years ago
- Artifacts and other data for "Code Vectors: Understanding Programs Through Embedded Abstraced Symbolic Traces"☆22Jun 5, 2020Updated 5 years ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- Automatic generation of reviews of scientific papers☆31Apr 3, 2025Updated 11 months ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆48Jan 12, 2023Updated 3 years ago
- Plugin for checking license compatibility in IntelliJ IDEA☆25Jan 13, 2026Updated last month
- Code for "Typilus: Neural Type Hints" PLDI 2020☆62Feb 8, 2023Updated 3 years ago
- A Tool for Mining Rich Abstract Syntax Trees from Code☆61Oct 24, 2025Updated 4 months ago
- PyTorch's implementation of the code2seq model.☆62Jul 25, 2024Updated last year