yTek01 / docker-spark-airflowView external linksLinks
☆41Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for docker-spark-airflow
Users that are interested in docker-spark-airflow are comparing it to the libraries listed below
Sorting:
- Docker with Airflow and Spark standalone cluster☆262Aug 5, 2023Updated 2 years ago
- ☆12Mar 17, 2022Updated 3 years ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated last year
- Project with Airflow + Spark + MinIO + Postgres + Python3.8☆28Sep 9, 2022Updated 3 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆51Dec 8, 2022Updated 3 years ago
- Typings for Confluent Kafka Python Client☆27Dec 2, 2025Updated 2 months ago
- Companion repository that goes along with Snowflake's "Advanced Data Engineering with Snowflake" course☆21Apr 23, 2025Updated 9 months ago
- ☆19Feb 25, 2022Updated 3 years ago
- Jupyter notebooks for the teaching of mechanics☆11Oct 8, 2024Updated last year
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated last week
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆98Jun 7, 2024Updated last year
- A script/docker that automatically translates PDFs using the DeepL API☆11Jan 18, 2026Updated 3 weeks ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- Base Kafka Producer, consumer, flask api and PySpark Structured streaming Job☆11Oct 20, 2021Updated 4 years ago
- Distributed data sync using trimerge☆11Mar 26, 2024Updated last year
- ☆12May 22, 2023Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Dec 13, 2025Updated 2 months ago
- A simple CLI command that initialises a Kedro project from an existing Python package☆11Aug 23, 2024Updated last year
- 쌤(SAM)! 도와주세요! 발표 및 블로그 글에 활용된 레포지토리예요.☆10Oct 28, 2023Updated 2 years ago
- my global CLAUDE.md☆19Aug 16, 2025Updated 5 months ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 6 months ago
- ☆10Jan 24, 2023Updated 3 years ago
- End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)☆10May 26, 2023Updated 2 years ago
- ☆10Jan 31, 2021Updated 5 years ago
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 3 months ago
- Simple UI cli LLaMA Model Finetuning☆10Mar 23, 2023Updated 2 years ago
- Spark Streaming Checkpoint File Manager for MinIO☆11Apr 25, 2023Updated 2 years ago
- Data pipeline to build a data warehouse on Postgres☆14Aug 11, 2024Updated last year
- Basic mTLS example using NGINX and Node JS☆12Nov 30, 2023Updated 2 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 4 years ago
- Docker Apache Airflow 2☆10Jan 27, 2022Updated 4 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆15Jun 19, 2022Updated 3 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- automagically fixes simple flake8 lints☆15Jun 26, 2024Updated last year
- ☆12Apr 21, 2024Updated last year
- This is a very basic PoC for a graphical no-code builder that generates solidity smart contract code from a given blockly block.☆10Oct 22, 2020Updated 5 years ago
- Mongo Aggregation Builder☆43Oct 1, 2014Updated 11 years ago
- API for parse.com in node.js☆44Aug 16, 2016Updated 9 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Jul 6, 2023Updated 2 years ago