rrenaud / Gibberish-DetectorLinks
A small program to detect gibberish using a Markov Chain
☆603Updated last year
Alternatives and similar repositories for Gibberish-Detector
Users that are interested in Gibberish-Detector are comparing it to the libraries listed below
Sorting:
- Nostril: Nonsense String Evaluator☆195Updated 3 years ago
- A Python Implementation of Simhash Algorithm☆1,019Updated 3 years ago
- Compare html similarity using structural and style metrics☆212Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆374Updated 2 years ago
- ☆269Updated 6 years ago
- Sample DGA classifier☆125Updated 9 years ago
- The repository that contains the algorithms for generating domain names, dictionaries of malicious domain names. Developed to research th…☆219Updated 7 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 3 years ago
- Python wrapper for ssdeep fuzzy hashing library☆151Updated 3 years ago
- Simple heuristic for measuring web page similarity (& data set)☆90Updated 7 years ago
- Package to facilitate URL clustering☆68Updated 9 years ago
- Train a model, and detect gibberish strings with it.☆62Updated 3 years ago
- Modern robots.txt Parser for Python☆194Updated last year
- The implementation of the Seq2Seq model for web attack detection. The Seq2Seq model is usually used in Neural Machine Translation. The ma…☆157Updated 3 years ago
- DGA Domains detection☆66Updated 7 years ago
- A collection of known Domain Generation Algorithms☆66Updated 9 years ago
- Machine Learning and Security | Using machine learning to detect malicious URLs☆271Updated 2 years ago
- Python wrapper for RE2☆297Updated 2 years ago
- Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).☆1,905Updated last month
- Extracts the top level domain (TLD) from the URL given.☆182Updated last month
- NER toolkit for HTML data☆259Updated last year
- An efficient simhash implementation for python☆125Updated 5 years ago
- Corpus of auto-labeled text for the cyber security domain☆95Updated 5 years ago
- Fast multi-keyword search engine for text strings☆256Updated 9 months ago
- Just the facts -- web page content extraction☆1,267Updated 11 months ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 2 months ago
- Suspicious DGA from PDNS and Sandbox.☆185Updated 2 years ago
- A project to attempt to automatically login to a website given a single seed☆124Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated last year