domanchi / gibberish-detector
Train a model, and detect gibberish strings with it.
β61Updated 3 years ago
Alternatives and similar repositories for gibberish-detector
Users that are interested in gibberish-detector are comparing it to the libraries listed below
Sorting:
- β69Updated 3 years ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- Language detection using Spacy and Fasttextβ55Updated last year
- Fuzzy matching and more functionality for spaCy.β256Updated 10 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β152Updated last year
- Legal document classification with EuroVoc descriptors on 22 languages.β26Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ161Updated 2 years ago
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.β123Updated 11 months ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ98Updated last year
- NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, β¦β82Updated 5 months ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- spaCy entry points for Curated Transformersβ30Updated 7 months ago
- An open-source package for python to clean raw text dataβ69Updated last year
- Multi-Langauge Identificationβ28Updated 9 months ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for MLβ63Updated 3 months ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- π« SpaCy wrapper for ConceptNet π«β93Updated last year
- A python package to simulate typographical errors.β35Updated last year
- Abydos NLP/IR library for Pythonβ186Updated 2 years ago
- π’ Work with static vector modelsβ28Updated 3 weeks ago
- β46Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated 2 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.β33Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.pyβ55Updated 5 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β122Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-sβ¦β214Updated 3 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year