adidas / m3d-apiLinks
Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of data lakes.
β31Updated 2 years ago
Alternatives and similar repositories for m3d-api
Users that are interested in m3d-api are comparing it to the libraries listed below
Sorting:
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.β18Updated 4 years ago
- π Run, schedule, and manage your dbt jobs using Kubernetes.β24Updated 6 years ago
- A python client library for the Stitch Import APIβ42Updated last year
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and repβ¦β20Updated 5 years ago
- Delta reader for the Ray open-source toolkit for building ML applicationsβ46Updated last year
- A curated list of awesome Databricks resources, including Sparkβ20Updated last year
- Fivetran data models for QuickBooks using dbt.β33Updated this week
- β96Updated last year
- Sample configuration to deploy a modern data platform.β88Updated 3 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.β79Updated last week
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested daβ¦β112Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.β126Updated 3 years ago
- Full stack data engineering tools and infrastructure set-upβ53Updated 4 years ago
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generatorβ12Updated last year
- Yet Another (Spark) ETL Frameworkβ21Updated last year
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data piβ¦β95Updated last week
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes clusterβ38Updated 4 years ago
- Awesome list of dataops products, open source and resourcesβ24Updated 3 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage datβ¦β16Updated 4 years ago
- β11Updated 7 months ago
- Curated list of resources about Apache Airflowβ19Updated 4 years ago
- Codes for website https://manuzhang.github.io/awesome-streaming/β10Updated 7 months ago
- The sane way of building a data layer in Airflowβ24Updated 5 years ago
- A K8s-based infrastructure for analyticsβ24Updated 5 years ago
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago
- A template for an AWS Lambda function that triggers Prefect Flow Runsβ20Updated 3 years ago
- dbt package for monitoring airflow DAGs and tasksβ29Updated 4 months ago
- A python package to create a database on the platform using our moj data warehousing frameworkβ21Updated 2 weeks ago
- dlt-dagster-demoβ11Updated last year
- β11Updated 5 years ago