le-oasis / docker-airflow-sparkView external linksLinks
Building a Modern Data Lake with Minio, Spark, Airflow via Docker.
☆23May 11, 2024Updated last year
Alternatives and similar repositories for docker-airflow-spark
Users that are interested in docker-airflow-spark are comparing it to the libraries listed below
Sorting:
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- The simple ETL with docker container☆66May 30, 2025Updated 8 months ago
- ☆19Feb 25, 2022Updated 3 years ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- A script/docker that automatically translates PDFs using the DeepL API☆11Jan 18, 2026Updated 3 weeks ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- ☆41Jan 24, 2023Updated 3 years ago
- Modern games store web application built with React and Spring☆11Dec 15, 2023Updated 2 years ago
- Data pipeline to build a data warehouse on Postgres☆14Aug 11, 2024Updated last year
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 3 months ago
- 👾 Repositório para armazenar todos os componentes referentes a aplicação Backend do projeto☆10Apr 27, 2024Updated last year
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 4 years ago
- node js http server☆10Jan 26, 2018Updated 8 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Roadmap for all those who want to get a kick start as Data Scientist.☆11Feb 2, 2022Updated 4 years ago
- the codes and some preliminary progress in the work of robust stochastic portfolio optimization☆11Oct 15, 2020Updated 5 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆15Jun 19, 2022Updated 3 years ago
- granadoespadav32 private server setup☆17Jan 24, 2024Updated 2 years ago
- ☆17Dec 12, 2025Updated 2 months ago
- Dash Mantine Components Theme Builder☆12Aug 9, 2025Updated 6 months ago
- Various utilities and info for King's Raid☆16Mar 30, 2021Updated 4 years ago
- This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, M…☆13Jul 7, 2024Updated last year
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- Python tool for profiling-based anomaly monitoring on ETL data pipelines leveraging ML and Apache Spark.☆16Mar 5, 2024Updated last year
- A fast development template for Admin-dashboard based on Ext JS Classic toolkit☆10Jun 29, 2018Updated 7 years ago
- ☆13Dec 30, 2022Updated 3 years ago
- Accompanying code for our NeurIPS 2019 paper☆12Nov 7, 2019Updated 6 years ago
- ☆18Feb 2, 2023Updated 3 years ago
- 📚 Repositório para armazenar as documentações do projeto☆10Mar 12, 2024Updated last year
- The application provides a RESTful API that allows clients to upload files (pdf, csv, txt), generates a conversational retrieval model us…☆13Jul 8, 2023Updated 2 years ago
- Process manager and website for hosting multiple Streamlit apps☆14Jun 28, 2023Updated 2 years ago
- Codes for the paper "Data-Driven Sample Average Approximation with Covariate Information"☆13Aug 13, 2022Updated 3 years ago
- Template for creating and publishing simple (vanilla js, html and css) bidirectional streamlit component☆18Sep 13, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- ETL to scrape a real estate website, process house prices and data, and build an ML model of the house prices.☆16Jul 11, 2022Updated 3 years ago
- Source files for our regularization paper!☆16May 11, 2019Updated 6 years ago
- Building a machine learning model to classify failures☆13Mar 20, 2024Updated last year