hachej / multi-engine-data-stack
☆19Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for multi-engine-data-stack
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Nicely modeled data built on the Github Archive.☆58Updated 8 months ago
- ☆26Updated last year
- A serverless duckDB deployment at GCP☆35Updated 2 years ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆37Updated 3 months ago
- An experimental Athena extension for DuckDB 🐤☆50Updated 9 months ago
- ☆66Updated last month
- Repo for orienting dbt users to the Dagster asset framework☆50Updated 2 years ago
- Linear regression in SQL using dbt☆66Updated last month
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated 8 months ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 4 months ago
- Proof-of-concept extension combining the delta extension with Unity Catalog☆53Updated this week
- Unity Catalog UI☆39Updated 2 months ago
- A dbt-Core package for generating models from an activity stream.☆39Updated 7 months ago
- A write-audit-publish implementation on a data lake without the JVM☆41Updated 3 months ago
- 📦 Example repository showing how to use dbt inside Visual Studio Code development containers☆39Updated last year
- [DEPRECATED] A dbt adapter for Excel.☆90Updated last year
- A cool simple example of functional data engineering☆33Updated last year
- ☆52Updated 4 months ago
- ☆42Updated 2 weeks ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆27Updated last week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆57Updated this week
- ☆84Updated 2 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆56Updated last year
- Fake Pandas / PySpark DataFrame creator☆42Updated 8 months ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- ☆16Updated last year
- analyse your electricity usage data from Belgian smart meters with dbt, duckdb and evidence☆20Updated last year