fpgmaas / stream-iotLinks
An end-to-end workflow for processing streaming data on Azure.
☆15Updated 8 months ago
Alternatives and similar repositories for stream-iot
Users that are interested in stream-iot are comparing it to the libraries listed below
Sorting:
- build dw with dbt☆45Updated 7 months ago
- Cost Efficient Data Pipelines with DuckDB☆53Updated 3 weeks ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- New generation opensource data stack☆68Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- Repo for CDC with debezium blog post☆28Updated 8 months ago
- Delta Lake Documentation☆49Updated 11 months ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆33Updated 3 weeks ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆112Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆116Updated 2 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆77Updated 11 months ago
- duckdb-etl-framework☆11Updated 5 months ago
- ☆36Updated 2 months ago
- ☆18Updated 9 months ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆83Updated last year
- ☆203Updated 4 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆42Updated 6 months ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆25Updated 2 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆220Updated last month
- A declarative PySpark framework for row- and aggregate-level data quality validation.☆46Updated this week
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- A guide for leading a data (engineering) team☆62Updated last year
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated last year
- Demo repository to lambda-fy your dbt runs☆11Updated last year