pmbaumgartner / clabelLinks
A utility for labeling clusters of text data.
☆28Updated 4 years ago
Alternatives and similar repositories for clabel
Users that are interested in clabel are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- ☆70Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Automatically check mismatch between code and comments using AI and ML☆54Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆43Updated 5 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆80Updated last year
- Generate reports for spaCy models.☆29Updated 3 years ago
- ☆31Updated 2 years ago
- Compare different encoding methods to see how well they perform on a classification task. Determine if a reddit comment is from /r/StarWa…☆13Updated 3 years ago
- ☆15Updated 7 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- The stand-alone training engine module for the ALOHA.eu project.☆15Updated 6 years ago
- SPEAR: Programmatically label and build training data quickly.☆109Updated last year
- An intelligent, flexible grammar of machine learning.☆82Updated 4 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- 🔤 Measure edit distance based on keyboard layout☆64Updated 4 months ago
- Pipeline components that support partial_fit.☆46Updated last year
- A bit of extra usability for sqlalchemy v2.☆78Updated last year
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆53Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- Search system on top of Elasticsearch, Kubeflow and Katib☆29Updated 2 years ago