s3curitybug / similarity-uniform-fuzzy-hashLinks
Similarity algorithm (computes the similarity between two files as a 0 to 1 score) with linear complexity, based on context triggered piecewise (fuzzy) hashes.
☆34Updated 7 years ago
Alternatives and similar repositories for similarity-uniform-fuzzy-hash
Users that are interested in similarity-uniform-fuzzy-hash are comparing it to the libraries listed below
Sorting:
- A Java library for byte pattern matching and searching☆41Updated 4 years ago
- String matching algorithm benchmark☆37Updated last year
- AST factorization: transformation AST of Kotlin source code to a vector☆11Updated 5 years ago
- Type discovery for Python☆24Updated 9 years ago
- This is a research project for dynamic fingerprinting, which means even some user change features of their computer, we can still fingerp…☆39Updated 6 years ago
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆20Updated 4 years ago
- Java implementation of Lempel-Ziv Jaccard Distance☆21Updated 7 years ago
- ☆21Updated last month
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 2 months ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Updated 2 years ago
- Python wrapper for ssdeep fuzzy hashing library☆151Updated 3 years ago
- A small tool which uses the CommonCrawl URL Index to download documents with certain file types or mime-types. This is used for mass-test…☆68Updated last week
- Java fuzz testing library for implementations of ABNF rules such as IETF RFCs☆33Updated 9 months ago
- Log tailing and parsing framework in Java☆26Updated 10 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 4 years ago
- Simple heuristic for measuring web page similarity (& data set)☆90Updated 7 years ago
- JITed Taint Tracking in V8☆15Updated 11 years ago
- This module contains an implementation of the Nilsimsa locality-sensitive hashing algorithm in Java.☆18Updated 6 years ago
- A tool for anomaly detection over streaming data based on sentiment analysis☆30Updated 6 years ago
- Assessing Source Code Semantic Similarity with Unsupervised Learning☆41Updated 7 years ago
- Detect memory leaks in minutes without a heap dump.☆17Updated 8 years ago
- This repository contains the code for our paper "Browser-based CPU Fingerprinting".☆40Updated 2 years ago
- Python Implementation of Super and Hyper Log Log Sketches☆49Updated 13 years ago
- a mutation testing engine for Java based on mutant schemata / metamutants / metaprogramming☆20Updated 2 years ago
- Advanced similarity and duplicate source code at scale.☆55Updated 6 years ago
- Programmer De-anonymization from Binary Executables☆86Updated 7 years ago
- A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.☆44Updated 3 years ago
- Pure Java implementation of XGBoost predictor for online prediction tasks.☆27Updated 2 years ago
- ShiftLeft OverflowDB☆124Updated last month
- AFL-like fuzzer for the Java Virtual Machine☆48Updated 6 years ago