deepyaman / jaffle-shopLinks
Example project for building scalable data pipelines with Kedro and Ibis.
☆13Updated this week
Alternatives and similar repositories for jaffle-shop
Users that are interested in jaffle-shop are comparing it to the libraries listed below
Sorting:
- Dask integration for Snowflake☆30Updated 4 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 4 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou …☆114Updated last month
- Code examples showing flow deployment to various types of infrastructure☆111Updated 2 years ago
- A repository of runnable examples using ibis☆46Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆93Updated last year
- A place to provide Coiled feedback☆28Updated 9 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆34Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆48Updated 9 months ago
- Templates for your Kedro projects.☆80Updated this week
- An abstraction layer for parameter tuning☆35Updated last month
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated 2 years ago
- Write your dbt models using Ibis☆74Updated 8 months ago
- First-party plugins maintained by the Kedro team.☆110Updated this week
- ☆27Updated 3 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆14Updated 10 months ago
- Time based splits for cross validation☆39Updated 2 weeks ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆37Updated 7 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆229Updated last month
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Jupyter Cell / Line Magics for DuckDB☆54Updated 2 months ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Swiple enables you to easily observe, understand, validate and improve the quality of your data☆84Updated this week
- Coming soon☆62Updated 2 years ago
- ☆11Updated 2 years ago