yTek01 / docker-spark-airflowView external linksLinks
☆41Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for docker-spark-airflow
Users that are interested in docker-spark-airflow are comparing it to the libraries listed below
Sorting:
- Docker with Airflow and Spark standalone cluster☆262Aug 5, 2023Updated 2 years ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- This is a boilerplate which has dependencies for pyspark(3.3.0) mongo(>4.x) connectivity☆10May 3, 2024Updated last year
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated last year
- Project with Airflow + Spark + MinIO + Postgres + Python3.8☆28Sep 9, 2022Updated 3 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆51Dec 8, 2022Updated 3 years ago
- ☆19Feb 25, 2022Updated 3 years ago
- A partially implemented ODBC driver for the Trino distributed SQL engine☆18Feb 2, 2026Updated last week
- Jupyter notebooks for the teaching of mechanics☆11Oct 8, 2024Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆98Jun 7, 2024Updated last year
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- ☆15Apr 1, 2025Updated 10 months ago
- ☆13Nov 12, 2022Updated 3 years ago
- ☆12May 22, 2023Updated 2 years ago
- Merge media queries (@media), @supports, and other duplicate At-rules together under a single block.☆10Aug 25, 2025Updated 5 months ago
- Python implementation of binary max-heaps.☆11Mar 22, 2020Updated 5 years ago
- 쌤(SAM)! 도와주세요! 발표 및 블로그 글에 활용된 레포지토리예요.☆10Oct 28, 2023Updated 2 years ago
- Spark Streaming Checkpoint File Manager for MinIO☆11Apr 25, 2023Updated 2 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 4 years ago
- All content about Master in Artificial Intelligence from UPC UB URV☆12Feb 15, 2023Updated 3 years ago
- Python implementation of EIP 1577 content hash☆16Dec 24, 2023Updated 2 years ago
- ☆25Jun 27, 2025Updated 7 months ago
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- Easier navigation between production and test files in vim.☆11Mar 16, 2017Updated 8 years ago
- Data pipeline to build a data warehouse on Postgres☆14Aug 11, 2024Updated last year
- ☆10Jan 31, 2021Updated 5 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 3 months ago
- Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club☆56Mar 10, 2025Updated 11 months ago
- Automatic inline image toggling as the cursor enters and exits them☆17Apr 18, 2023Updated 2 years ago
- Singapore Condo Rental Prices - From Data Acquisition to Prediction☆14Feb 13, 2021Updated 5 years ago
- DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…☆10Oct 13, 2020Updated 5 years ago
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆12Jul 12, 2024Updated last year
- This is a very basic PoC for a graphical no-code builder that generates solidity smart contract code from a given blockly block.☆10Oct 22, 2020Updated 5 years ago
- Random experiments in C☆15Sep 30, 2020Updated 5 years ago
- Wrapper on top of pino which provides integration with cls-hooked for better context in log messages☆12Feb 11, 2022Updated 4 years ago
- Docker Apache Airflow 2☆10Jan 27, 2022Updated 4 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Jul 6, 2023Updated 2 years ago
- ☆13Jun 21, 2021Updated 4 years ago