mfcabrera / hooqu
hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
☆25Updated this week
Related projects ⓘ
Alternatives and complementary repositories for hooqu
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- A software engineering framework to jump start your machine learning projects☆36Updated 4 months ago
- Primrose modeling framework for simple production models☆34Updated 7 months ago
- Record matching and entity resolution at scale in Spark☆31Updated last year
- A python library bakeoff for medium sized datasets☆24Updated last year
- Build your feature store with macros right within your dbt repository☆37Updated last year
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆42Updated 3 years ago
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆32Updated 3 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆68Updated 8 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆46Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆93Updated last month
- Helpers & syntactic sugar for PySpark.☆60Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- ☆19Updated last year
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated 8 months ago
- 🎯 kettle is a CLI tool for creating and deploying cloud functions & docker containers for machine learning☆32Updated last year
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- A Toolbox for the Evaluation of machine learning Explanations☆15Updated 10 months ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- Dask integration for Snowflake☆30Updated 4 months ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- ☆29Updated 10 months ago
- A Delta Lake reader for Dask☆46Updated last month
- ☆12Updated 4 years ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 2 years ago
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access t…☆25Updated 2 years ago
- Read Delta tables without any Spark☆47Updated 8 months ago
- CLI for data platform☆19Updated 11 months ago