handreassa / delta-dockerLinks
Template to spin up delta lake locally using docker
☆23Updated 2 years ago
Alternatives and similar repositories for delta-docker
Users that are interested in delta-docker are comparing it to the libraries listed below
Sorting:
- build dw with dbt☆47Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆270Updated last month
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- ☆139Updated 8 months ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆147Updated 2 weeks ago
- Building a Data Pipeline with an Open Source Stack☆54Updated 4 months ago
- Code for "Efficient Data Processing in Spark" Course☆346Updated 3 weeks ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆198Updated last year
- A simple VS Code devcontainer setup for local PySpark development☆55Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆56Updated 4 months ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- Complete Azure Data Factory CICD Process Via Azure Pipeline☆26Updated last year
- Notebooks to learn Databricks Lakehouse Platform☆37Updated last week
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated last year
- Python data repo, jupyter notebook, python scripts and data.☆536Updated 11 months ago
- ☆124Updated last year
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆78Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆46Updated last year
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆128Updated last week
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆256Updated 7 months ago
- Local Environment to Practice Data Engineering☆141Updated 10 months ago
- Delta Lake Documentation☆50Updated last year
- ☆42Updated 3 years ago
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆69Updated last year
- Docker with Airflow and Spark standalone cluster☆261Updated 2 years ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆47Updated 2 years ago
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆224Updated 4 months ago
- (Python, PySpark)☆11Updated 4 years ago