pmbaumgartner / clabel
A utility for labeling clusters of text data.
☆28Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for clabel
- Python package for deduplication/entity resolution using active learning☆79Updated 2 months ago
- ☆29Updated 2 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated last year
- spaCy entry points for Curated Transformers☆24Updated last month
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- ☆29Updated 10 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- A lightweight tool to measure the full memory of a Python session☆19Updated 3 weeks ago
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- A Toolbox for the Evaluation of machine learning Explanations☆15Updated 10 months ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Pipeline components that support partial_fit.☆43Updated 3 months ago
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆12Updated 7 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Render Jupyter Notebooks With Metaflow Cards☆24Updated last month
- ☆18Updated 2 years ago
- Examples of vector DB indexing and query with various vector databases.☆12Updated 3 weeks ago
- Convenient access to `pynvml` (the library behind `nvidia-smi`)☆20Updated 3 weeks ago
- It's a cooler way to store simple linear models.☆28Updated 3 months ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 4 years ago