Ce11an / spacy-cleanerLinks
Easily clean text with spaCy!
☆34Updated last year
Alternatives and similar repositories for spacy-cleaner
Users that are interested in spacy-cleaner are comparing it to the libraries listed below
Sorting:
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆74Updated 2 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- Fuzzy Topic Models☆26Updated last year
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated last month
- SPEAR: Programmatically label and build training data quickly.☆109Updated last year
- Super Simple Similarities Service☆155Updated 8 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Streamline scikit-learn model comparison.☆143Updated 3 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Bag of, not words, but tricks!☆68Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆82Updated 3 years ago
- Creating class-based TF-IDF matrices☆91Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- Gzip and nearest neighbors for text classification☆57Updated 2 years ago
- A data labelling tool based on Streamlit.☆23Updated 4 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- ☆24Updated 4 years ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Updated 10 months ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- Vectorizers for a range of different data types☆103Updated 2 months ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 4 years ago