nydasco / real_time_streaming_pipelineLinks
An example repository showing how to leverage Kafka to stream your data
☆21Updated last year
Alternatives and similar repositories for real_time_streaming_pipeline
Users that are interested in real_time_streaming_pipeline are comparing it to the libraries listed below
Sorting:
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆16Updated 3 weeks ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆251Updated last month
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆153Updated last year
- Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆107Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆54Updated 3 weeks ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆226Updated 3 weeks ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆119Updated 7 months ago
- Contribute to dlt verified sources 🔥☆100Updated 2 weeks ago
- ☆38Updated 7 months ago
- Python wrapper for the Sling CLI tool☆58Updated last week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆412Updated this week
- Dagster University courses☆114Updated 2 weeks ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆225Updated last week
- A DataOps framework for building a lakehouse.☆53Updated this week
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 6 months ago
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆233Updated last week
- ☆80Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- ☆211Updated 9 months ago
- 🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.☆229Updated last week
- ☆160Updated 5 months ago
- Cost Efficient Data Pipelines with DuckDB☆58Updated 5 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆74Updated 2 weeks ago
- Repo for CDC with debezium blog post☆29Updated last year
- All things awesome related to Dagster!☆129Updated 3 weeks ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆270Updated last month
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆125Updated 9 months ago
- Personal project for setting up an open source data warehouse.☆31Updated 3 months ago
- An example of a Dagster project with a possible folder structure to organize the assets, jobs, repositories, schedules, and ops. Also has…☆100Updated last year