ml-tue / automated-string-cleaningLinks
Repository for my master thesis on automated string handling
β16Updated 4 years ago
Alternatives and similar repositories for automated-string-cleaning
Users that are interested in automated-string-cleaning are comparing it to the libraries listed below
Sorting:
- π Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projectsβ82Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.β108Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)β156Updated 2 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.β30Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 3 years ago
- Pipeline components that support partial_fit.β46Updated last year
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API πβ53Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.β214Updated 10 months ago
- Streamline scikit-learn model comparison.β144Updated 2 years ago
- π οΈ Tools for Transformers compression using PyTorch Lightning β‘β84Updated 9 months ago
- Super Simple Similarities Serviceβ153Updated 4 months ago
- Explainable Zero-Shot Topic Extractionβ63Updated last year
- A Python library aimed at dissecting and augmenting NER training data.β58Updated 2 years ago
- Metadata store for Production MLβ88Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.β55Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β152Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated 11 months ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.β36Updated 2 years ago
- Distributed skorch on Ray Trainβ58Updated 2 years ago
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning systemβ77Updated 2 years ago
- β30Updated 3 years ago
- β19Updated 4 years ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.β37Updated 4 years ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.β13Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β36Updated 3 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- Bag of, not words, but tricks!β68Updated last year
- β43Updated 2 years ago
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"β64Updated 3 years ago
- An efficient, to-the-point, and easy-to-use checklist to following when deploying an ML model into production.β30Updated 2 years ago