Ce11an / spacy-cleaner
Easily clean text with spaCy!
☆32Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for spacy-cleaner
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- Have UV deal with all your Jupyter deps.☆18Updated 2 months ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Pipeline components that support partial_fit.☆43Updated 4 months ago
- It's a cooler way to store simple linear models.☆28Updated 4 months ago
- Record matching and entity resolution at scale in Spark☆31Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆68Updated 11 months ago
- Using short models to classify long texts☆20Updated last year
- A python package to simulate typographical errors.☆31Updated 11 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- A comprehensive tool for linguistic analysis of communities☆48Updated 3 years ago
- ☆22Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Just another sentiment wrapper.☆17Updated 2 years ago
- Prune your sklearn models☆19Updated 3 weeks ago
- ☆53Updated 10 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆16Updated last year
- spaCy entry points for Curated Transformers☆25Updated last month
- ☆30Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Library for evaluating RAG using Nuclia's models☆14Updated 3 months ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- 🌸 Train floret vectors☆18Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 8 months ago
- Repository for my master thesis on automated string handling☆16Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- A visual labeling system implemented in Jupyter widgets.☆11Updated 11 months ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year