deepyaman / jaffle-shopLinks
Example project for building scalable data pipelines with Kedro and Ibis.
☆13Updated last month
Alternatives and similar repositories for jaffle-shop
Users that are interested in jaffle-shop are comparing it to the libraries listed below
Sorting:
- Code examples showing flow deployment to various types of infrastructure☆109Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 2 years ago
- A repository of runnable examples using ibis☆45Updated last year
- Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies☆33Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated last month
- Dask integration for Snowflake☆30Updated 3 weeks ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Jupyter Cell / Line Magics for DuckDB☆52Updated last week
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆34Updated 3 months ago
- Prefect integrations with SQLAlchemy.☆25Updated last year
- The DBT of ML, as Aligned describes data dependencies in ML systems, and reduce technical data debt☆60Updated this week
- A software engineering framework to jump start your machine learning projects☆37Updated last year
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆85Updated 8 months ago
- SQLMesh example projects☆33Updated last month
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆34Updated 7 months ago
- Prefect integrations for working with Docker☆43Updated last year
- A place to provide Coiled feedback☆19Updated 5 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- An abstraction layer for parameter tuning☆35Updated 11 months ago
- ☆11Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated last week
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Updated last year
- ☆27Updated 3 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆13Updated 6 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated last month
- Deploy a Prefect flow to serverless AWS Lambda function☆35Updated 2 years ago
- Write your dbt models using Ibis☆70Updated 5 months ago
- Time based splits for cross validation☆38Updated 3 weeks ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago