This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.
☆37Jun 9, 2023Updated 2 years ago
Alternatives and similar repositories for spark-standalone-cluster
Users that are interested in spark-standalone-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kotlin extensions / Interfaces that extends the Java/Scala implementation/implicits of Smile NLP. Basically a simplification for Kotlin (…☆14Mar 31, 2020Updated 6 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆508Nov 7, 2025Updated 6 months ago
- ☆47Jul 4, 2023Updated 2 years ago
- dbtVault + Greenplum demo☆11Feb 19, 2024Updated 2 years ago
- Smart pet feeder using object detection and ESP32-cam☆27May 19, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆21Feb 9, 2022Updated 4 years ago
- ☆23Mar 9, 2026Updated 2 months ago
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- Escrevi este roadmap para ajudar amigos próximos, está aberto a sugestões!☆14Sep 9, 2025Updated 8 months ago
- ☆16Sep 6, 2023Updated 2 years ago
- ☆21Mar 9, 2026Updated 2 months ago
- ☆21Oct 9, 2025Updated 7 months ago
- ☆15Feb 15, 2023Updated 3 years ago
- Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw☆13Mar 10, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆25May 8, 2026Updated 3 weeks ago
- End-to-end data engineer project☆24Aug 17, 2023Updated 2 years ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆38Aug 7, 2025Updated 9 months ago
- TypeScript GitHub Clone CLI☆11Jul 6, 2022Updated 3 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- Ferramenta que auxilia o usuário na criação de postagens semi automáticas☆10Jan 24, 2023Updated 3 years ago
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Feb 4, 2021Updated 5 years ago
- Lazy iterables in erlang☆12Sep 22, 2016Updated 9 years ago
- Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detai…☆39Nov 15, 2021Updated 4 years ago
- Bi-directional SMS gateway with pluggable providers☆14May 24, 2019Updated 7 years ago
- Adaptation postgres adapter for Greenplum☆36Mar 7, 2024Updated 2 years ago
- Rust library to work with global positions and vectors☆16Mar 12, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Dec 13, 2022Updated 3 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Sep 8, 2021Updated 4 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 5 months ago
- Boilerplate for Famo.us/Angular apps wrapped with Cordova☆13Nov 8, 2014Updated 11 years ago
- This is end-to-end-recommender-system repo has full recommender system implementation from collecting data, modeling and deploying machin…☆13May 19, 2021Updated 5 years ago
- A Fail-over Name Resolver☆51Oct 13, 2020Updated 5 years ago
- Project from the CTU Big Data course which purpose was to compute tf-idf values for the czech wikipedia☆10Jul 8, 2014Updated 11 years ago