☆41Jan 24, 2023Updated 3 years ago
Alternatives and similar repositories for docker-spark-airflow
Users that are interested in docker-spark-airflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- Docker with Airflow and Spark standalone cluster☆264Aug 5, 2023Updated 2 years ago
- Doing sql in notebooks.☆15Aug 14, 2023Updated 2 years ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆49Oct 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- Python code that will collapse structured columns separating out the attributes into new columns☆10Mar 15, 2022Updated 4 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- SonarQube sobre Docker☆14Nov 7, 2021Updated 4 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Dec 8, 2022Updated 3 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Mar 29, 2021Updated 5 years ago
- This is a boilerplate which has dependencies for pyspark(3.3.0) mongo(>4.x) connectivity☆10May 3, 2024Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆104Jun 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Typings for Confluent Kafka Python Client☆27Apr 11, 2026Updated last month
- ☆19Feb 25, 2022Updated 4 years ago
- Cast Spotify to your Raspberry Pi via the browser!☆17Oct 19, 2014Updated 11 years ago
- ☆25Mar 15, 2024Updated 2 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆18Jun 19, 2022Updated 3 years ago
- Music App based on Spotify API and Streamlit framework☆17Oct 17, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆49Apr 5, 2026Updated last month
- learning logstash and elastic search plugins☆21Jul 15, 2022Updated 3 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆18May 24, 2023Updated 2 years ago
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- Dask on ECS Fargate☆14Sep 23, 2019Updated 6 years ago
- Selenium Grid in ECS using Fargate Spot Containers☆14Feb 1, 2023Updated 3 years ago
- An MLflow Provider Package for Apache Airflow☆26Oct 22, 2025Updated 6 months ago
- A simple CLI command that initialises a Kedro project from an existing Python package☆11Aug 23, 2024Updated last year
- ☆10Jan 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- All content about Master in Artificial Intelligence from UPC UB URV☆12Feb 15, 2023Updated 3 years ago
- The "World Data Report" is a Power BI project that offers a detailed overview of global data, covering weather, geographical, demographic…☆15Nov 30, 2025Updated 5 months ago
- ☆11Apr 9, 2017Updated 9 years ago
- Flights Search Application GRANDstack Experimental Implementation☆23Mar 4, 2023Updated 3 years ago
- This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, M…☆13Jul 7, 2024Updated last year
- An MOOC offered by the University of Helsinki. Course information can be found below☆10Jun 10, 2021Updated 4 years ago
- ☆11Mar 14, 2023Updated 3 years ago