aws / amazon-mwaa-docker-images
☆46Updated last week
Alternatives and similar repositories for amazon-mwaa-docker-images:
Users that are interested in amazon-mwaa-docker-images are comparing it to the libraries listed below
- Delta Lake Documentation☆49Updated 10 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆26Updated 8 months ago
- ☆11Updated 5 months ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆41Updated 11 months ago
- Unity Catalog UI☆40Updated 7 months ago
- Utility functions for dbt projects running on Spark☆32Updated 2 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆23Updated 5 months ago
- A curated list of dagster code snippets for data engineers☆54Updated last year
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- Data Tools Subjective List☆83Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆35Updated 2 months ago
- Build DataOps platform with Apache Airflow and dbt on AWS☆55Updated 3 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆1Updated 3 weeks ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- simplify working with DataHub API endpoints☆48Updated 3 weeks ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆147Updated this week
- A curated list of awesome Databricks resources, including Spark☆17Updated 9 months ago
- This repository contains the dbt-glue adapter☆116Updated last week
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆194Updated last week
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- A DataOps framework for building a lakehouse.☆50Updated last week
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational e…☆106Updated last month
- dbt (data build tool) adapter for the Dremio☆51Updated last week
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated last month
- Pytest plugin for dbt core☆60Updated 3 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆162Updated 3 months ago