mfcabrera / hooqu
hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
☆26Updated 2 months ago
Alternatives and similar repositories for hooqu:
Users that are interested in hooqu are comparing it to the libraries listed below
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 5 months ago
- An abstraction layer for parameter tuning☆35Updated 5 months ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆47Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated 11 months ago
- Similarity encoding of dirty categorical variables (strings)☆20Updated 6 years ago
- ☀️🦶 A lightweight framework for collaborative, open-source feature engineering☆32Updated 3 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Record matching and entity resolution at scale in Spark☆34Updated last year
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆42Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆44Updated last year
- Primrose modeling framework for simple production models☆33Updated 10 months ago
- Data Catalog for Databases and Data Warehouses☆32Updated last year
- 📈🔍 Lets Python do AB testing analysis☆76Updated 10 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆100Updated last month
- A software engineering framework to jump start your machine learning projects☆37Updated 7 months ago
- Prune your sklearn models☆19Updated 3 months ago
- Python package implementing transformers for pre processing steps for machine learning.☆54Updated this week
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- this repo might get accepted☆29Updated 4 years ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 4 months ago
- real-time data + ML pipeline☆54Updated 2 weeks ago
- ☆19Updated last year
- Fast, resilient and reproducible data analysis with cached SQL queries☆30Updated last year
- Session-based recommender system: Serenade☆80Updated 3 months ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago