HotTechStack / simple-dataengineering-ai-stackLinks
β153Updated 2 months ago
Alternatives and similar repositories for simple-dataengineering-ai-stack
Users that are interested in simple-dataengineering-ai-stack are comparing it to the libraries listed below
Sorting:
- The Open-Source Enterprise Data Platform in a single Portalβ264Updated this week
- Contribute to dlt verified sources π₯β104Updated last month
- β179Updated 8 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team β¦β131Updated this week
- Demo Project for Open Source MDSβ170Updated 5 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Supersetβ55Updated 3 months ago
- β65Updated 8 months ago
- A write-audit-publish implementation on a data lake without the JVMβ45Updated last year
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.β23Updated last year
- Python package for querying iceberg data through duckdb.β72Updated last year
- β¨ Build dashboards with end-to-end version control. π CLI w/ batteries included, no infra required. Develop on your laptop for instant rβ¦β92Updated last week
- Pushdown compute from Snowflake to DuckDB running on your infrastructureβ203Updated 3 months ago
- New generation opensource data stackβ76Updated 3 years ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.β41Updated last year
- β393Updated this week
- β41Updated 9 months ago
- A playground for running duckdb as a stateless query engine over a data lakeβ218Updated 2 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.β95Updated 11 months ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)β116Updated 11 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principleβ¦β124Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Supersetβ258Updated last month
- DataKit is a browser-based data analysis platform that processes multi-gigabyte files locally. All processing happens in your browser - nβ¦β277Updated 2 weeks ago
- Python wrapper for the Sling CLI toolβ63Updated last month
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those pluβ¦β59Updated 2 years ago
- Iceberg Playground in a Boxβ67Updated 7 months ago
- β26Updated 3 years ago
- A Rust based data/CSV/Parquet file generatorβ63Updated 11 months ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.devβ39Updated 8 months ago
- Repo for orienting dbt users to the Dagster asset frameworkβ56Updated 3 years ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidenceβ232Updated last month