☆41Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for docker-spark-airflow
Users that are interested in docker-spark-airflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker with Airflow and Spark standalone cluster☆265Aug 5, 2023Updated 2 years ago
- Doing sql in notebooks.☆15Aug 14, 2023Updated 2 years ago
- Dataproc Scala Examples is an effort to assist in the creation of Spark jobs written in Scala to run on Dataproc.☆12Mar 26, 2026Updated 3 months ago
- Project with Airflow + Spark + MinIO + Postgres + Python3.8☆28Sep 9, 2022Updated 3 years ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Delta-Lake, ETL, Spark, Airflow☆50Oct 9, 2022Updated 3 years ago
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- ☆12Mar 17, 2022Updated 4 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Feb 19, 2018Updated 8 years ago
- Geospatial Next.js app with DuckDB-Wasm☆15May 24, 2023Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 4 years ago
- SonarQube sobre Docker☆14Nov 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Dec 8, 2022Updated 3 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…☆10Oct 13, 2020Updated 5 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆108May 26, 2026Updated last month
- Schedules a bot to send a message everyday☆15May 22, 2023Updated 3 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 3 years ago
- Autocomplete / Autofill Text field with Dropdown menu to choose between suggested values from a given list.☆14Feb 23, 2024Updated 2 years ago
- Singapore Condo Rental Prices - From Data Acquisition to Prediction☆14Feb 13, 2021Updated 5 years ago
- Generate cloud-init ready vm images via packer and deploy these via terraform.☆16Jan 6, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Apr 19, 2025Updated last year
- Typings for Confluent Kafka Python Client☆27Apr 11, 2026Updated 2 months ago
- An example of running Testcontainer tests in CI pipelines.☆19Apr 13, 2025Updated last year
- A script/docker that automatically translates PDFs using the DeepL API☆13Jun 14, 2026Updated 2 weeks ago
- A boilerplate for authoring npm modules, with tests and linting.☆10Jun 8, 2017Updated 9 years ago
- Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF☆51Oct 11, 2022Updated 3 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- Music App based on Spotify API and Streamlit framework☆16Oct 17, 2025Updated 8 months ago
- Wrapper on top of pino which provides integration with cls-hooked for better context in log messages☆12Feb 11, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A partially implemented ODBC driver for the Trino distributed SQL engine☆20Jun 2, 2026Updated 3 weeks ago
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- small configuration for the home server.☆24Dec 27, 2022Updated 3 years ago
- Dask on ECS Fargate☆14Sep 23, 2019Updated 6 years ago
- Selenium Grid in ECS using Fargate Spot Containers☆14Feb 1, 2023Updated 3 years ago
- An MLflow Provider Package for Apache Airflow☆26Oct 22, 2025Updated 8 months ago
- ☆13Jun 29, 2017Updated 8 years ago