casics / nostrilLinks
Nostril: Nonsense String Evaluator
β195Updated 3 years ago
Alternatives and similar repositories for nostril
Users that are interested in nostril are comparing it to the libraries listed below
Sorting:
- A small program to detect gibberish using a Markov Chainβ603Updated last year
- π A CPython extension for the Hyperscan regular expression matching library.β178Updated last week
- Python wrapper for ssdeep fuzzy hashing libraryβ151Updated 3 years ago
- Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentaβ¦β167Updated 2 weeks ago
- β171Updated 2 months ago
- Abydos NLP/IR library for Pythonβ186Updated 2 years ago
- Compare html similarity using structural and style metricsβ212Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.β256Updated 11 months ago
- Find strings/words in text; convenience and C speedβ126Updated 2 years ago
- Simple heuristic for measuring web page similarity (& data set)β90Updated 7 years ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy dataβ¦β93Updated 3 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β153Updated last year
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enoughβ285Updated 2 years ago
- spellchecking library for pythonβ609Updated 11 months ago
- Accurately find/replace/remove emojis in text stringsβ162Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engineβ185Updated 4 years ago
- A simple fuzzy matching set for python stringsβ227Updated 9 months ago
- Pure python Aho-Corasick library.β215Updated 2 years ago
- Train a model, and detect gibberish strings with it.β62Updated 3 years ago
- Tokenizer for raw mailsβ387Updated last week
- A lucene query parser generating ElasticSearch queries and more !β191Updated 3 months ago
- Extracts the top level domain (TLD) from the URL given.β182Updated last week
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerβ190Updated 2 years ago
- π Additional lookup tables and data resources for spaCyβ105Updated this week
- Locality-sensitive hashing algorithm for text similarity comparisonsβ58Updated last month
- Character-based word embeddings model based on RNN for handling real worldΒ textsβ173Updated last year
- Fuzzy string matching, grouping, and evaluation.β764Updated last month
- A Fast Levenshtein Distance Library for Pythonβ83Updated 3 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.β263Updated last year
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago