tusharchou / local-data-platformLinks
python library for iceberg lake house on your local
☆14Updated last month
Alternatives and similar repositories for local-data-platform
Users that are interested in local-data-platform are comparing it to the libraries listed below
Sorting:
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆19Updated 9 months ago
- Example files used in the DuckDB - Unity Catalog blog☆10Updated last year
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Updated 11 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆258Updated last month
- ☆393Updated last week
- ☆178Updated 8 months ago
- A compute manifest and composable tools for ML, built on Ibis, DataFusion, and Arrow Flight.☆484Updated this week
- This repository serves as a comprehensive guide to effective data modeling and robust data quality assurance using popular open-source to…☆39Updated 2 years ago
- ☆41Updated 9 months ago
- The smallest DuckDB SQL orchestrator on Earth.☆336Updated 2 months ago
- The logos on datastackdiagram.com☆18Updated 9 months ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆117Updated 11 months ago
- DuckDB CronJob Extension☆45Updated last week
- Manage Unity Catalog tables with Pydantic Models☆10Updated 11 months ago
- A lightweight Python-based tool for extracting and analyzing column lineage for dbt projects☆199Updated this week
- Python Package for ducklake☆20Updated 8 months ago
- A Postgres Proxy Server in Python☆316Updated last year
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Updated 10 months ago
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆321Updated last week
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆20Updated last year
- DuckDB HTTP API Server and Query Interface in a Community Extension☆266Updated last week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆232Updated 2 months ago
- Turning PySpark Into a Universal DataFrame API☆485Updated this week
- Quickstart for any service☆167Updated this week
- Declarative context engineering for agents☆436Updated this week
- ☆153Updated 2 months ago
- Open Data Stack Platform: a collection of projects and pipelines built with open data stack tools for scalable, observable data platform…☆22Updated last month
- LLM based AI Agent to automate Data Analysis for dbt projects with remote MCP server☆159Updated 6 months ago
- 🚀 GizmoSQL — High-Performance SQL Server☆282Updated last week
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆204Updated 3 months ago