Building a Modern Data Lake with Minio, Spark, Airflow via Docker.
☆23May 11, 2024Updated last year
Alternatives and similar repositories for docker-airflow-spark
Users that are interested in docker-airflow-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- The simple ETL with docker container☆66May 30, 2025Updated 9 months ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- This repository contains a set of hands-on challenges designed to introduce you to Dapr's most popular APIs and give you a starting point…☆22Dec 11, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Dec 30, 2022Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- ☆16Jan 19, 2022Updated 4 years ago
- Soname ALerts & MONitoring☆19Jan 21, 2025Updated last year
- ☆41Jan 24, 2023Updated 3 years ago
- ☆37Jan 29, 2021Updated 5 years ago
- Example of deploying Lightdash on GCP cloud run☆20Jun 16, 2021Updated 4 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Sep 20, 2023Updated 2 years ago
- Anonymize faces in video stream☆10Oct 24, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆16Feb 12, 2025Updated last year
- Writing PySpark logs in Apache Spark and Databricks☆17Jun 13, 2022Updated 3 years ago
- Project with Airflow + Spark + MinIO + Postgres + Python3.8☆28Sep 9, 2022Updated 3 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆81Aug 21, 2023Updated 2 years ago
- Um sistema de aquisição de dados de pessoas, veículos e empresas de diversas fontes☆15Nov 1, 2022Updated 3 years ago
- Docker with Airflow and Spark standalone cluster☆263Aug 5, 2023Updated 2 years ago
- ☆19Feb 25, 2022Updated 4 years ago
- ☆16Jul 25, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- A compilation of components to optimize the development of your ecommerce☆13Jun 23, 2025Updated 9 months ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Mar 18, 2026Updated last week
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated 2 months ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- Robust Bond Portfolio Construction via Convex-Concave Saddle Point Optimization☆13May 13, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- FSUIPC external application interface tools listening tools written in nodeJS☆14May 15, 2023Updated 2 years ago
- ☆11Jul 30, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- ☆23Dec 30, 2025Updated 2 months ago
- ☆18Feb 2, 2023Updated 3 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆19Feb 27, 2023Updated 3 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆37Sep 1, 2023Updated 2 years ago