rrenaud / Gibberish-DetectorLinks
A small program to detect gibberish using a Markov Chain
☆603Updated last year
Alternatives and similar repositories for Gibberish-Detector
Users that are interested in Gibberish-Detector are comparing it to the libraries listed below
Sorting:
- Nostril: Nonsense String Evaluator☆195Updated 3 years ago
- Python wrapper for ssdeep fuzzy hashing library☆151Updated 3 years ago
- Sample DGA classifier☆125Updated 9 years ago
- Compare html similarity using structural and style metrics☆212Updated 2 years ago
- Package to facilitate URL clustering☆67Updated 9 years ago
- Simple heuristic for measuring web page similarity (& data set)☆90Updated 7 years ago
- Train a model, and detect gibberish strings with it.☆62Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆375Updated 2 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 3 years ago
- ☆269Updated 6 years ago
- Machine learning to classify Malicious (Spam)/Benign URL's☆129Updated 3 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated last month
- Fast multi-keyword search engine for text strings☆255Updated 8 months ago
- Categorization of IP Addresses☆528Updated 2 years ago
- Python client library for Google Safe Browsing API☆84Updated last year
- This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only …☆951Updated 4 years ago
- Python wrapper for RE2☆296Updated 2 years ago
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- NER toolkit for HTML data☆259Updated last year
- Fast URL decoder library☆173Updated 5 months ago
- The repository that contains the algorithms for generating domain names, dictionaries of malicious domain names. Developed to research th…☆219Updated 7 years ago
- Machine Learning and Security | Using machine learning to detect malicious URLs☆269Updated 2 years ago
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- Tree edit distance using the Zhang Shasha algorithm☆449Updated 4 years ago
- Data Hacking Project☆777Updated 6 years ago
- Python library implementing a trie data structure.☆823Updated 4 years ago
- Automatic keyword extraction - no alchemy required!☆169Updated 9 years ago
- An efficient simhash implementation for python☆125Updated 5 years ago
- A collection of known Domain Generation Algorithms☆66Updated 9 years ago
- Fast Python Bloom Filter using Mmap☆744Updated 5 years ago