Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks
☆24Apr 2, 2022Updated 4 years ago
Alternatives and similar repositories for docker-airflow-spark
Users that are interested in docker-airflow-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker with Airflow and Spark standalone cluster☆263Aug 5, 2023Updated 2 years ago
- ☆41Jan 24, 2023Updated 3 years ago
- Demo of using Airflow☆11Jun 24, 2022Updated 3 years ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated last year
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 2023 edition of #100daysofnetworks☆22Apr 5, 2026Updated last week
- Full Machine Learning Lifecycle using Airflow, MLflow, and AWS S3☆26Mar 28, 2023Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 2 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆16Mar 18, 2022Updated 4 years ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- AI enhanced automation tool for financial modelling and market analysis.☆12Sep 10, 2019Updated 6 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- ☆10Feb 19, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Nov 4, 2023Updated 2 years ago
- Autocomplete / Autofill Text field with Dropdown menu to choose between suggested values from a given list.☆14Feb 23, 2024Updated 2 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- Project with Airflow + Spark + MinIO + Postgres + Python3.8☆28Sep 9, 2022Updated 3 years ago
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- In this notebook, we will create an AI and time serie driven forecasting engine based on a set of 5 AI models and 5 time series models an…☆14Jun 12, 2021Updated 4 years ago
- ☆18Nov 27, 2020Updated 5 years ago
- ☆16Jan 19, 2022Updated 4 years ago
- A script/docker that automatically translates PDFs using the DeepL API☆12Jan 18, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python wrapper for the Open Brewery DB API☆16Mar 7, 2024Updated 2 years ago
- Shiny app for IFRS provisioning and estimated loss report☆10Jun 10, 2021Updated 4 years ago
- ☆14Dec 28, 2023Updated 2 years ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- A Python function for bootstrapping☆10Nov 5, 2019Updated 6 years ago
- A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker☆16Mar 7, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- НИС "Методологии разработки ПО", ФКН ВШЭ, Старичков Н.Ю., Крахмалёв Д.С.☆13Mar 12, 2022Updated 4 years ago
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- An Excel integration of OpenGamma Strata.☆13Sep 19, 2021Updated 4 years ago
- Rock Solid Python with Type Hints Course Student Materials☆25Jul 8, 2024Updated last year
- The "World Data Report" is a Power BI project that offers a detailed overview of global data, covering weather, geographical, demographic…☆15Nov 30, 2025Updated 4 months ago