adidas / m3d-api
Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of data lakes.
☆31Updated last year
Alternatives and similar repositories for m3d-api:
Users that are interested in m3d-api are comparing it to the libraries listed below
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 3 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Unity Catalog UI☆40Updated 7 months ago
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- ☆96Updated last year
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Updated 3 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- ☆13Updated last year
- An open specification for data products in Data Mesh☆55Updated 5 months ago
- Utility functions for dbt projects running on Spark☆32Updated last month
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- Delta Lake Documentation☆49Updated 9 months ago
- Data Tools Subjective List☆83Updated last year
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 4 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- New generation opensource data stack☆65Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆54Updated last year
- A Table format agnostic data sharing framework☆38Updated last year
- Dremio Container Tools☆160Updated 2 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Rules based grant management for Snowflake☆40Updated 6 years ago