☆159Feb 25, 2026Updated 3 weeks ago
Alternatives and similar repositories for simple-dataengineering-ai-stack
Users that are interested in simple-dataengineering-ai-stack are comparing it to the libraries listed below
Sorting:
- Cache the intermediate results of queries on timeseries data in DataFusion.☆19Oct 29, 2024Updated last year
- Modern Data Stack in a box with dbt-duckdb and Apache Superset☆16Mar 5, 2026Updated 2 weeks ago
- FUSE-based DuckDB file system 🦆☆49Jun 16, 2025Updated 9 months ago
- ☆10Feb 2, 2024Updated 2 years ago
- ☆22Dec 19, 2025Updated 3 months ago
- ☆16Nov 27, 2025Updated 3 months ago
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated 11 months ago
- High performance Privacy By Design using Matryoshka and Spark talk code☆13May 21, 2019Updated 6 years ago
- ☆65Jan 20, 2026Updated 2 months ago
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Jun 4, 2019Updated 6 years ago
- Exemplary fullstack Medium.com clone powered by Laravel Inertia Vue Typescript☆11May 23, 2022Updated 3 years ago
- The Kafka message scheduling tool.☆19Jan 20, 2025Updated last year
- Typed, annotated vectors for well-documented datasets☆11Jan 30, 2026Updated last month
- This project demonstrates many of dbt's features when used with the Snowflake Data Cloud☆31Feb 9, 2026Updated last month
- duckdb-etl-framework☆15Dec 20, 2024Updated last year
- Extract Stats Q/A from Tables With Provenance☆26Dec 27, 2025Updated 2 months ago
- merge two sorted lists fast☆12Nov 15, 2023Updated 2 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Collection of AWS Lambdas for creating and managing Delta tables☆57Updated this week
- Starter code for deploying a FastAPI app on AWS ECS☆15Apr 10, 2024Updated last year
- Basemap.de world vector with a photon geocoder packaged as tauri app for any device☆12Jan 5, 2025Updated last year
- Repository for Data Engineering Zoomcamp 2024☆14Mar 25, 2024Updated last year
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- FastGeoTable is a PostGIS geospatial api to enable creating/editing geographical tables within a spatial database.☆13Aug 19, 2022Updated 3 years ago
- DuckDB Copilot Extension☆10Jan 12, 2026Updated 2 months ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- ✨A MCP server that provides intelligent access to the HoloViz ecosystem for humans and AIs.☆28Mar 7, 2026Updated last week
- SQL transformation tool for DuckDB written in Rust☆73Mar 13, 2025Updated last year
- Distributed data sync using trimerge☆11Mar 26, 2024Updated last year
- ☆15Jan 21, 2022Updated 4 years ago
- For checkpoints/rewind, permission level & lsp in pi☆39Feb 17, 2026Updated last month
- ☆11Oct 31, 2022Updated 3 years ago
- 🔧 Power tools for pi — repo autopsy, tsgo LSP, codex background loops, session reader, and more☆31Updated this week
- ☆16Oct 26, 2021Updated 4 years ago
- DuckDB API for PHP 🐘 🦆☆77Feb 22, 2026Updated 3 weeks ago
- OpenStreetMap: Find residential areas with too few buildings in them☆13Mar 9, 2023Updated 3 years ago
- Performant, highly available distributed storage using SeaweedFS in Docker Swarm☆15Jan 10, 2023Updated 3 years ago
- python library for iceberg lake house on your local☆14Jan 8, 2026Updated 2 months ago
- REST client for using Fortnox API☆22Aug 12, 2022Updated 3 years ago