suminb / winnowing
A Python implementation of the Winnowing (local algorithms for document fingerprinting)
☆53Updated 5 years ago
Alternatives and similar repositories for winnowing:
Users that are interested in winnowing are comparing it to the libraries listed below
- ☆20Updated 8 years ago
- statistical similarity of binaries (Esh)☆73Updated 8 years ago
- The public dataset in the paper "PatchDB: A Large-Scale Security Patch Dataset". This paper appears in the 51st Annual IEEE/IFIP Interna…☆40Updated last year
- Neural Variable Renaming for Decompiled Binaries☆44Updated 4 years ago
- the code for three models introduced in DYNAMIC NEURAL PROGRAM EMBEDDINGS FOR PROGRAM REPAIR (ICLR 18)☆32Updated 6 years ago
- MLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)☆39Updated 6 years ago
- A set of tools for extracting tokens and ASTs from code☆22Updated 6 years ago
- TRACY☆19Updated 8 years ago
- Website for Learning from "Big Code"☆29Updated 3 years ago
- ☆9Updated 10 years ago
- Babelfish Python client☆16Updated 5 years ago
- DeepBugs is a framework for learning bug detectors from an existing code corpus.☆151Updated 4 years ago
- Programmer De-anonymization via Code Stylometry☆77Updated 7 years ago
- Deep learning code semantic similarity☆63Updated 5 years ago
- An inter-procedural data-flow analysis framework using value-based context sensitivity☆90Updated 11 months ago
- code2vec: Learning Distributed Representations of Code☆14Updated 6 years ago
- APISan: Sanitizing API Usages through Semantic Cross-Checking☆63Updated 3 years ago
- A System for Debloating C/C++ Programs☆31Updated 3 years ago
- ☆12Updated 2 years ago
- ☆53Updated 7 years ago
- ☆14Updated 2 years ago
- Probabilistic API Mining☆53Updated 7 years ago
- Pyc-cfg is a pure python control flow graph builder for almost all Ansi C programming language.☆53Updated 7 years ago
- Static analysis tool to slice python programs☆36Updated 8 years ago
- A toolkit for pre-processing large source code corpora☆47Updated 2 years ago
- Neural Code Comprehension: A Learnable Representation of Code Semantics☆211Updated 5 months ago
- Taxonomy of Real Faults in Deep Learning Systems☆16Updated 5 years ago
- [ICSE'18] Hierarchical Learning of Cross-Language Mappings through Distributed Vector Representations for Code☆22Updated 6 years ago
- A pass that can generate PDG(in *.dot) for LLVM.☆36Updated 8 years ago
- ANTLR 3 fuzzy parser☆48Updated 12 years ago