prasanthg3 / cleantextLinks
An open-source package for python to clean raw text data
โ70Updated last year
Alternatives and similar repositories for cleantext
Users that are interested in cleantext are comparing it to the libraries listed below
Sorting:
- Python package for deduplication/entity resolution using active learningโ80Updated 9 months ago
- ๐ฅ Use Hugging Face text and token classification pipelines directly in spaCyโ63Updated last year
- โ๏ธ Parallel and distributed training with spaCy and Rayโ54Updated last year
- Easy PDF to text to spaCy text extraction in Python.โ39Updated 7 months ago
- ๐งฌ A VS Code extension for annotating data with Prodigyโ30Updated 3 years ago
- Language detection using Spacy and Fasttextโ55Updated last year
- Bag of, not words, but tricks!โ68Updated last year
- A python package to simulate typographical errors.โ35Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataโ161Updated 2 years ago
- Sentence transformers models for SpaCyโ107Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.โ118Updated last year
- ๐งช Cutting-edge experimental spaCy components and featuresโ99Updated last year
- Information extraction from English and German texts based on predicate logicโ136Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality โฆโ106Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ155Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsโ54Updated 2 years ago
- โ55Updated last year
- โ17Updated 2 years ago
- Dataframe Integration with spaCy.โ102Updated 4 years ago
- Generate reports for spaCy models.โ29Updated 3 years ago
- โ69Updated 3 years ago
- spaCy match and replace, maintaining conjugationโ35Updated 2 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.โ62Updated this week
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ93Updated last year
- โ30Updated 2 years ago
- โ43Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ58Updated 2 years ago
- spaCy entry points for Curated Transformersโ31Updated this week
- Fuzzy matching and more functionality for spaCy.โ256Updated 10 months ago