This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.
☆38Jun 9, 2023Updated 2 years ago
Alternatives and similar repositories for spark-standalone-cluster
Users that are interested in spark-standalone-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆94Feb 4, 2025Updated last year
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆509Nov 7, 2025Updated 4 months ago
- ☆47Jul 4, 2023Updated 2 years ago
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆13Sep 5, 2024Updated last year
- dbtVault + Greenplum demo☆11Feb 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 10 months ago
- Data Analysis Experiments☆12Nov 2, 2017Updated 8 years ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆33Aug 7, 2025Updated 7 months ago
- ☆18Mar 9, 2026Updated 3 weeks ago
- Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw☆13Mar 10, 2026Updated 2 weeks ago
- Generic TCP Server for Erlang applications☆11Apr 7, 2015Updated 10 years ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆24Updated this week
- End-to-end data engineer project☆22Aug 17, 2023Updated 2 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆22Dec 19, 2025Updated 3 months ago
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- Ferramenta que auxilia o usuário na criação de postagens semi automáticas☆10Jan 24, 2023Updated 3 years ago
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- ☆41Jul 4, 2022Updated 3 years ago
- 🎭 Natural language web automation with Puppeteer☆14Jun 16, 2024Updated last year
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- ☆31Dec 6, 2023Updated 2 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆172Feb 4, 2021Updated 5 years ago
- Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detai…☆40Nov 15, 2021Updated 4 years ago
- Hadoop-Hive-Spark cluster + Jupyter on Docker☆86Jan 2, 2025Updated last year
- JusChat é um assistente jurídico inteligente baseado em tecnologia GraphRAG (Graph Retrieval Augmented Generation) que utiliza processame…☆21Jul 20, 2025Updated 8 months ago
- Adaptation postgres adapter for Greenplum☆36Mar 7, 2024Updated 2 years ago
- 🗃 Abre-te Código é um hackathon focado na expansão do acesso ao patrimônio cultural por meio do desenvolvimento de tecnologias a partir …☆11Oct 24, 2020Updated 5 years ago
- ☆11Dec 13, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- This is end-to-end-recommender-system repo has full recommender system implementation from collecting data, modeling and deploying machin…☆13May 19, 2021Updated 4 years ago
- Withdraw All Your Linkedin Connect Invitation At Once With No Effort☆22May 13, 2020Updated 5 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆20Jul 31, 2023Updated 2 years ago
- ☆124Mar 9, 2026Updated 2 weeks ago
- ☆11Jan 26, 2023Updated 3 years ago
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆51Feb 7, 2025Updated last year