DeepHiveMind / Distributed_DataMesh_2.0_Cloud_Implementation
Distributed Data Mesh 2.0 | DataMesh-as-a-Code on Cloud | Theory to Industrialization
☆37Updated 2 years ago
Alternatives and similar repositories for Distributed_DataMesh_2.0_Cloud_Implementation:
Users that are interested in Distributed_DataMesh_2.0_Cloud_Implementation are comparing it to the libraries listed below
- New generation opensource data stack☆67Updated 2 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆141Updated 3 weeks ago
- ☆96Updated last year
- NiFi Processor for Apache Pulsar☆10Updated 5 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Updated 4 months ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- Prescriptive guidance for building, deploying, and monitoring machine learning models with Azure Databricks using containers in line with…☆23Updated 8 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- This repository contains recipes for Apache Pinot.☆30Updated last month
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- Delta Lake Documentation☆49Updated 10 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago
- ☆13Updated last year
- Edit your data contract in the Data Contract Editor☆21Updated 6 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- Unity Catalog UI☆40Updated 7 months ago
- my personal working directory of milvus projects☆17Updated last year
- Data Mesh Manager (Community Edition)☆35Updated 3 weeks ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆23Updated 5 months ago
- ☆18Updated last year