casics / nostril
Nostril: Nonsense String Evaluator
☆190Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for nostril
- A small program to detect gibberish using a Markov Chain☆598Updated 9 months ago
- Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidenta…☆164Updated 7 months ago
- Train a model, and detect gibberish strings with it.☆59Updated 2 years ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆168Updated this week
- ☆165Updated 5 months ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Textpipe: clean and extract metadata from text☆299Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- ECMAScript parsing infrastructure for multipurpose analysis☆236Updated last year
- URL normalization for Python☆94Updated 2 years ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- python eml parser module☆215Updated 2 weeks ago
- Extracts the top level domain (TLD) from the URL given.☆179Updated last year
- A fast python implementation of the SimHash algorithm.☆27Updated 3 years ago
- Python wrapper for RE2☆99Updated 2 months ago
- Heuristic based boilerplate removal tool☆729Updated 6 months ago
- A lucene query parser generating ElasticSearch queries and more !☆189Updated 2 months ago
- Parse natural language time expressions in python☆131Updated last year
- Fast Python Bloom Filter using Mmap☆130Updated 6 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆257Updated 5 months ago
- Fuzzy matching and more functionality for spaCy.☆252Updated 4 months ago
- Compare html similarity using structural and style metrics☆210Updated last year
- spellchecking library for python☆601Updated 5 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆193Updated last year
- Python module to generate regular all expression matches☆187Updated this week
- Accurately find/replace/remove emojis in text strings☆158Updated 11 months ago
- A compound word splitter for Python☆48Updated 3 years ago