mozilla / docker-etlLinks
Collection of dockerized ETL jobs managed by data engineering.
☆20Updated last week
Alternatives and similar repositories for docker-etl
Users that are interested in docker-etl are comparing it to the libraries listed below
Sorting:
- Weekly Data Engineering Newsletter☆96Updated last year
- Utility functions for dbt projects running on Spark☆33Updated 7 months ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆113Updated last month
- The go to demo for public and private dbt Learn☆80Updated 5 months ago
- Astronomer Core Docker Images☆107Updated last year
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 3 weeks ago
- Bigquery ETL☆323Updated this week
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆112Updated last year
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16Updated 4 years ago
- Airflow configuration for Telemetry☆194Updated 2 weeks ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year
- Automatically discover and tag PII data across BigQuery tables and apply column-level access controls based on confidentiality level.☆59Updated this week
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- The Picnic Data Vault framework.☆129Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆47Updated last week
- PySpark schema generator☆43Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- ☆201Updated last year
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- A guide for leading a data (engineering) team☆63Updated last year
- Palm CLI - the tool-belt for data teams☆47Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 2 years ago
- Airflow declarative DAGs via YAML☆133Updated last year