DeepHiveMind / Distributed_DataMesh_2.0_Cloud_Implementation
Distributed Data Mesh 2.0 | DataMesh-as-a-Code on Cloud | Theory to Industrialization
☆35Updated 2 years ago
Alternatives and similar repositories for Distributed_DataMesh_2.0_Cloud_Implementation:
Users that are interested in Distributed_DataMesh_2.0_Cloud_Implementation are comparing it to the libraries listed below
- New generation opensource data stack☆65Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆49Updated 4 years ago
- ☆95Updated last year
- This repository contains recipes for Apache Pinot.☆29Updated 3 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆25Updated 11 months ago
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- lakefs-samples repository☆77Updated 2 weeks ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- ☆13Updated last year
- Data Mesh Manager (Community Edition)☆32Updated 3 weeks ago
- Delta Lake Documentation☆48Updated 8 months ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆131Updated last month
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- Generative AI in realtime with Confluent Cloud.☆22Updated 10 months ago
- ☆20Updated 5 years ago
- A Data Mesh demo repository☆13Updated 4 months ago
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆74Updated this week
- real-time data + ML pipeline☆54Updated 3 weeks ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆42Updated 2 years ago
- ☆13Updated last year
- Curated list of resources about Apache Airflow☆19Updated 3 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- Data Mesh Architecture☆74Updated 7 months ago
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 6 months ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆19Updated this week
- Example Set up For DBT Cloud using Github Integrations☆11Updated 4 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆50Updated last year
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆22Updated 3 months ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago