rrenaud / Gibberish-DetectorLinks
A small program to detect gibberish using a Markov Chain
☆605Updated last year
Alternatives and similar repositories for Gibberish-Detector
Users that are interested in Gibberish-Detector are comparing it to the libraries listed below
Sorting:
- Nostril: Nonsense String Evaluator☆198Updated 3 years ago
- Compare html similarity using structural and style metrics☆217Updated 2 years ago
- Python wrapper for ssdeep fuzzy hashing library☆151Updated 4 years ago
- Machine learning to classify Malicious (Spam)/Benign URL's☆133Updated 4 years ago
- ☆270Updated 7 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- Package to facilitate URL clustering☆71Updated 9 years ago
- Machine Learning and Security | Using machine learning to detect malicious URLs☆273Updated 3 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 4 years ago
- An efficient simhash implementation for python☆127Updated 6 years ago
- Sample DGA classifier☆128Updated 10 years ago
- Simple heuristic for measuring web page similarity (& data set)☆90Updated 7 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆292Updated 2 years ago
- ☆816Updated last year
- The implementation of the Seq2Seq model for web attack detection. The Seq2Seq model is usually used in Neural Machine Translation. The ma…☆156Updated 3 years ago
- Simhash and near-duplicate detection☆421Updated 2 years ago
- The repository that contains the algorithms for generating domain names, dictionaries of malicious domain names. Developed to research th…☆222Updated 8 years ago
- Example Python code for comparing documents using MinHash☆252Updated 6 years ago
- Python client library for Google Safe Browsing API☆84Updated 2 years ago
- Just the facts -- web page content extraction☆1,278Updated 6 months ago
- Heuristic based boilerplate removal tool☆810Updated 10 months ago
- This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only …☆951Updated 5 years ago
- Corpus of auto-labeled text for the cyber security domain☆97Updated 6 years ago
- Python wrapper for RE2☆297Updated 2 years ago
- A tool that parses emails by enhancing the Python standard library, extracting all details into a comprehensive object.☆428Updated 2 months ago
- A collection of common regular expressions bundled with an easy to use interface.☆1,582Updated 2 years ago
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 5 years ago
- Code for the paper URLNet - Learning a URL Representation with Deep Learning for Malicious URL Detection☆172Updated 4 years ago
- Suspicious DGA from PDNS and Sandbox.☆187Updated 3 years ago
- Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidenta…☆170Updated 7 months ago