A simple tool for detecting near-duplicate source code
☆104Oct 3, 2024Updated last year
Alternatives and similar repositories for near-duplicate-code-detector
Users that are interested in near-duplicate-code-detector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlie…☆84Mar 24, 2023Updated 3 years ago
- Text clustering algorithm, implemented in .NET☆21Jun 22, 2023Updated 3 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- Utilities used by the Deep Program Understanding team☆104Jun 12, 2023Updated 3 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆51Sep 5, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Too little variation - A tool to discover code duplication in various languages☆11Jun 22, 2026Updated last week
- Code for "Generative Code Modeling with Graphs" (ICLR'19)☆172Dec 8, 2022Updated 3 years ago
- Babelfish documentation (GitBook)☆44Nov 12, 2019Updated 6 years ago
- Data and Code for Reproducing "Global Relational Models of Source Code"☆86May 10, 2021Updated 5 years ago
- this repository is obsolete please go to our new repository☆14Jan 12, 2018Updated 8 years ago
- ☆14Mar 1, 2020Updated 6 years ago
- Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure an…☆15May 18, 2022Updated 4 years ago
- Machine Learning for Source Code Analysis☆17Nov 20, 2023Updated 2 years ago
- Jacdac .NET library☆15Aug 14, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A GitHub Action for suggesting Python type annotations.☆42Mar 23, 2023Updated 3 years ago
- A simple Python3 tool to detect similarities between files within a repository☆208Jun 1, 2024Updated 2 years ago
- IST'21 & SANER'22: Semantic-Preserving Program Transformations☆31Oct 25, 2022Updated 3 years ago
- Mapping Language to Code in a Programmatic Context☆80Jan 27, 2021Updated 5 years ago
- Light-weight library to implement CQRS (Command Query Responsibility Segregation) pattern in dotnet. Inspired by the Mediatr library.☆35Jun 19, 2026Updated 2 weeks ago
- Information related to OpenAPI usage at Microsoft☆16Feb 6, 2026Updated 4 months ago
- ☆21Jul 5, 2021Updated 4 years ago
- ☆20Nov 6, 2019Updated 6 years ago
- A testing framework for Visual Studio extensions☆22Jun 23, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Binary object notation - a standard for representing JSON in a compact, efficient format☆16Jun 12, 2023Updated 3 years ago
- ☆16Jul 8, 2024Updated last year
- ☆26Jun 23, 2026Updated last week
- [DEPRECATED] A simple example service, demonstrating gRPC integration with the Bond framework.☆18Mar 7, 2022Updated 4 years ago
- Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021☆110Aug 30, 2022Updated 3 years ago
- demonstration for our ACL 2018 paper, "On the Practical Computational Power of Finite Precision RNNs for Language Recognition"☆11May 26, 2019Updated 7 years ago
- ☆14Feb 14, 2018Updated 8 years ago
- Re-implementation of "CODE2SEQ: GENERATING SEQUENCES FROM STRUCTURED REPRESENTATIONS OF CODE"☆45Jul 25, 2024Updated last year
- A generic tool for automatic schema generation for a set of C# classes.☆16Oct 14, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Seq2seq Type Inference using Static Analysis and CodeT5☆32Jul 9, 2023Updated 2 years ago
- Database smell detector☆13Jan 24, 2018Updated 8 years ago
- Python library to share machine learning models easily and reliably.☆18Nov 5, 2019Updated 6 years ago
- A library for mining of path-based representations of code (and more)☆299Nov 7, 2025Updated 7 months ago
- Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.☆59Jul 31, 2024Updated last year
- Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Sourc…☆66Dec 3, 2021Updated 4 years ago
- A C# library that makes working with the enums more than 18 times faster without any memory allocation using the CSharp source generators…☆20Jul 22, 2023Updated 2 years ago