limx0 / prefect-lakefsLinks
☆12Updated last year
Alternatives and similar repositories for prefect-lakefs
Users that are interested in prefect-lakefs are comparing it to the libraries listed below
Sorting:
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆95Updated 10 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆216Updated 2 years ago
- ☆23Updated last year
- Python stream processing for analytics☆41Updated this week
- ☆80Updated 2 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago
- ☆92Updated last year
- Pocket data flows orchestrated using Prefect☆48Updated 9 months ago
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access t…☆25Updated 3 years ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆36Updated last year
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- Read Delta tables without any Spark☆47Updated last year
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆201Updated 2 months ago
- Write your dbt models using Ibis☆74Updated 9 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Updated last year
- The Modern Data Stack in a (Smaller) Box☆12Updated 2 years ago
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆124Updated 11 months ago
- ☆81Updated 10 months ago
- dagster scikit-learn pipeline example.☆46Updated 2 years ago
- Azure extension for DuckDB☆70Updated 2 weeks ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Updated 2 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆38Updated 8 months ago
- A web extension to empower dbt users☆27Updated 3 years ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Updated 5 months ago
- Read Apache Arrow batches from ODBC data sources in Python☆73Updated last month
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆108Updated this week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆113Updated 10 months ago