vraj-ucsd / ML-Data-Prep-ZooLinks
☆31Updated 4 years ago
Alternatives and similar repositories for ML-Data-Prep-Zoo
Users that are interested in ML-Data-Prep-Zoo are comparing it to the libraries listed below
Sorting:
- this repo might get accepted☆28Updated 4 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- 🐍 Material for PyData Global 2021 Presentation: Effective Testing for Machine Learning Projects☆81Updated 4 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- It's a cooler way to store simple linear models.☆27Updated last year
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 4 years ago
- ☆22Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.☆109Updated last year
- Super Simple Similarities Service☆155Updated 9 months ago
- Repository for my master thesis on automated string handling☆16Updated 4 years ago
- HiPlot fetcher for experiments logged with MLflow☆14Updated 3 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆42Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- A library of Reversible Data Transforms☆131Updated last week
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆41Updated 2 years ago
- Unified slicing for all Python data structures.☆37Updated 5 months ago
- Bag of, not words, but tricks!☆68Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆24Updated 2 months ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 3 years ago
- openclean - Data Cleaning and data profiling library for Python☆83Updated 4 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Updated 2 years ago
- A proof of concept library for generating and running machine learning model tests☆13Updated 5 years ago
- [Intemarché] Sales forecasting challenge☆11Updated 4 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 4 years ago
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆76Updated 3 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆107Updated 2 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆120Updated 6 months ago
- 🍦 Deployment tool for online machine learning models☆98Updated 3 years ago