domanchi / gibberish-detector
Train a model, and detect gibberish strings with it.
☆61Updated 3 years ago
Alternatives and similar repositories for gibberish-detector:
Users that are interested in gibberish-detector are comparing it to the libraries listed below
- Language detection using Spacy and Fasttext☆55Updated last year
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- Fuzzy matching and more functionality for spaCy.☆256Updated 8 months ago
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, …☆80Updated 3 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Multi-Langauge Identification☆29Updated 7 months ago
- Blazing fast topic modelling for short texts.☆31Updated 2 months ago
- spaCy entry points for Curated Transformers☆27Updated 5 months ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆121Updated 10 months ago
- ☆68Updated 3 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated last month
- 80x faster and 95% accurate language identification with Fasttext☆150Updated last year
- Python package for deduplication/entity resolution using active learning☆76Updated 6 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆16Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 4 months ago
- An open-source package for python to clean raw text data☆69Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆97Updated 10 months ago
- Build and upload fastText Python wheels to PyPI☆23Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆158Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated 11 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆182Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆34Updated last month
- 🌸 Train floret vectors☆18Updated last year
- Script for downloading GitHub.☆91Updated 8 months ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago