rrenaud / Gibberish-Detector
A small program to detect gibberish using a Markov Chain
☆598Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Gibberish-Detector
- Nostril: Nonsense String Evaluator☆190Updated 2 years ago
- Train a model, and detect gibberish strings with it.☆59Updated 2 years ago
- ☆270Updated 6 years ago
- Sample DGA classifier☆125Updated 9 years ago
- Machine learning to classify Malicious (Spam)/Benign URL's☆126Updated 3 years ago
- Compare html similarity using structural and style metrics☆210Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- The implementation of the Seq2Seq model for web attack detection. The Seq2Seq model is usually used in Neural Machine Translation. The ma…☆154Updated 2 years ago
- Machine Learning and Security | Using machine learning to detect malicious URLs☆266Updated 2 years ago
- Package to facilitate URL clustering☆68Updated 8 years ago
- Python wrapper for ssdeep fuzzy hashing library☆152Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆802Updated this week
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆539Updated 3 years ago
- Python implementation of TextRank algorithm for automatic keyword extraction and summarization using Levenshtein distance as relation bet…☆765Updated 2 years ago
- Heuristic based boilerplate removal tool☆729Updated 6 months ago
- Fast Python Bloom Filter using Mmap☆130Updated 6 months ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- An efficient simhash implementation for python☆125Updated 5 years ago
- Code for the paper URLNet - Learning a URL Representation with Deep Learning for Malicious URL Detection☆153Updated 3 years ago
- Exploring internet domain names with deep learning using vector embeddings☆21Updated 5 years ago
- Retrieve and parse whois data for IPv4 and IPv6 addresses☆555Updated last month
- The repository that contains the algorithms for generating domain names, dictionaries of malicious domain names. Developed to research th…☆219Updated 7 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆59Updated 3 years ago
- A python implementation of the Rapid Automatic Keyword Extraction☆975Updated 4 years ago
- Scalable Bloom Filter implemented in Python☆164Updated 2 years ago
- Simple heuristic for measuring web page similarity (& data set)☆89Updated 6 years ago
- Corpus of auto-labeled text for the cyber security domain☆91Updated 4 years ago
- Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidenta…☆164Updated 7 months ago
- Malicious Web Sites Detection using Suspicious URL☆72Updated 4 years ago
- Data Hacking Project☆775Updated 5 years ago