tcmlabs / hexagonal-architecture-python-spark
Hexagonal (ports and adapters) architecture applied to Spark and Python data engineering project
☆33Updated last year
Alternatives and similar repositories for hexagonal-architecture-python-spark:
Users that are interested in hexagonal-architecture-python-spark are comparing it to the libraries listed below
- Code snippets for Data Engineering Design Patterns book☆77Updated 3 weeks ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆217Updated 2 weeks ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- ✨ A Pydantic to PySpark schema library☆81Updated this week
- A cool simple example of functional data engineering☆33Updated 2 years ago
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆66Updated 6 months ago
- Fake Snowflake Connector for Python. Run, mock and test Snowflake DB locally.☆125Updated this week
- ☆113Updated 8 months ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- ☆16Updated last year
- ☆75Updated 5 months ago
- Example repo to create end to end tests for data pipeline.☆23Updated 9 months ago
- Possibly the fastest DataFrame-agnostic quality check library in town.☆185Updated 2 weeks ago
- Code for dbt tutorial☆155Updated 10 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆126Updated 2 years ago
- ☆43Updated 3 years ago
- All things awesome related to Dagster!☆101Updated last month
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆66Updated last year
- A guide for leading a data (engineering) team☆62Updated 11 months ago
- ☆20Updated 3 years ago
- Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate☆113Updated last year
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆117Updated 2 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆108Updated last week
- Example repository showing how to build a data platform with Prefect, dbt and Snowflake☆100Updated 2 years ago
- Simple stream processing pipeline☆100Updated 9 months ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- An example of how to deploy Apache Airflow on Amazon ECS Fargate☆41Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆109Updated 2 years ago