ml-tue / automated-string-cleaningLinks
Repository for my master thesis on automated string handling
☆16Updated 3 years ago
Alternatives and similar repositories for automated-string-cleaning
Users that are interested in automated-string-cleaning are comparing it to the libraries listed below
Sorting:
- SPEAR: Programmatically label and build training data quickly.☆107Updated last year
- Pipeline components that support partial_fit.☆46Updated 11 months ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆80Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 10 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 10 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- ☆30Updated 3 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆80Updated 10 months ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Distributed skorch on Ray Train☆57Updated 2 years ago
- ☆19Updated 4 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆15Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆29Updated 6 months ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 3 months ago
- Prune your sklearn models☆19Updated 8 months ago
- Super Simple Similarities Service☆149Updated 3 months ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- ☆21Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆43Updated 2 years ago
- ☆24Updated 3 years ago