mozilla / docker-etlLinks
Collection of dockerized ETL jobs managed by data engineering.
☆20Updated this week
Alternatives and similar repositories for docker-etl
Users that are interested in docker-etl are comparing it to the libraries listed below
Sorting:
- Utility functions for dbt projects running on Spark☆34Updated 3 weeks ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- Read Delta tables without any Spark☆47Updated last year
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16Updated 4 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- End-to-end DataOps platform deployed by Terraform.☆69Updated 9 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- The Picnic Data Vault framework.☆130Updated this week
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- A Table format agnostic data sharing framework☆42Updated last year
- Make simple storing test results and visualisation of these in a BI dashboard☆51Updated 3 weeks ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- The go to demo for public and private dbt Learn☆80Updated 9 months ago
- PySpark schema generator☆43Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 2 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆21Updated 2 years ago
- A guide for leading a data (engineering) team☆63Updated last year
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆113Updated 2 years ago
- Delta Lake examples☆236Updated last year
- ☆23Updated 4 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated last week
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- Data Catalog Tag Templates☆30Updated 8 months ago
- ☆202Updated 2 years ago