pmbaumgartner / clabel
A utility for labeling clusters of text data.
☆28Updated 3 years ago
Alternatives and similar repositories for clabel:
Users that are interested in clabel are comparing it to the libraries listed below
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- ☆30Updated 2 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆76Updated 6 months ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- A lightweight tool to measure the full memory of a Python session☆19Updated 3 weeks ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- spaCy entry points for Curated Transformers☆27Updated 5 months ago
- Just another sentiment wrapper.☆17Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆13Updated 11 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- ☆70Updated 2 years ago
- It's a cooler way to store simple linear models.☆28Updated 8 months ago
- A python module that will check for package updates.☆28Updated 3 years ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago
- ☆15Updated 6 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- A python package to simulate typographical errors.☆32Updated last year
- ☆29Updated last year
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- ☆19Updated 4 years ago