deepyaman / jaffle-shopLinks
Example project for building scalable data pipelines with Kedro and Ibis.
☆13Updated last month
Alternatives and similar repositories for jaffle-shop
Users that are interested in jaffle-shop are comparing it to the libraries listed below
Sorting:
- Dask integration for Snowflake☆30Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆84Updated 8 months ago
- Code examples showing flow deployment to various types of infrastructure☆109Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆17Updated last year
- A repository of runnable examples using ibis☆44Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆114Updated 2 weeks ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Jupyter Cell / Line Magics for DuckDB☆51Updated last month
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆34Updated 3 months ago
- ☆11Updated 2 years ago
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆33Updated last year
- Templates for your Kedro projects.☆77Updated last week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated last year
- Fast approximate joins on string columns for polars dataframes.☆13Updated 9 months ago
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 2 months ago
- ☆27Updated 11 months ago
- Full stack data engineering tools and infrastructure set-up☆55Updated 4 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated last month
- ☆27Updated 3 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 4 months ago
- SQLMesh example projects☆32Updated last month
- Time based splits for cross validation☆38Updated last week
- fsspec-compatible Azure Datake and Azure Blob Storage access☆196Updated last week
- An abstraction layer for parameter tuning☆35Updated 11 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆24Updated last year