ml-tue / automated-string-cleaningLinks
Repository for my master thesis on automated string handling
☆16Updated 4 years ago
Alternatives and similar repositories for automated-string-cleaning
Users that are interested in automated-string-cleaning are comparing it to the libraries listed below
Sorting:
- SPEAR: Programmatically label and build training data quickly.☆107Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 3 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- Super Simple Similarities Service☆152Updated 3 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated 10 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- Streamline scikit-learn model comparison.☆144Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 10 months ago
- Bag of, not words, but tricks!☆68Updated last year
- Explainable Zero-Shot Topic Extraction☆63Updated 11 months ago
- Distributed skorch on Ray Train☆58Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- ☆42Updated 2 years ago
- Metadata store for Production ML☆88Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆35Updated last year
- causal-falsify: A Python library with algorithms for falsifying unconfoundedness assumption in a composite dataset from multiple sources.☆26Updated 3 weeks ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆30Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆81Updated 11 months ago