This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.
☆37Jun 9, 2023Updated 3 years ago
Alternatives and similar repositories for spark-standalone-cluster
Users that are interested in spark-standalone-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆94Feb 4, 2025Updated last year
- Kotlin extensions / Interfaces that extends the Java/Scala implementation/implicits of Smile NLP. Basically a simplification for Kotlin (…☆14Mar 31, 2020Updated 6 years ago
- A lightweight, dependency-free setup for git & ZShell.☆20May 28, 2025Updated last year
- ☆12Oct 15, 2023Updated 2 years ago
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆13Sep 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆26Nov 12, 2022Updated 3 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Jul 6, 2023Updated 2 years ago
- ☆24Dec 21, 2020Updated 5 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated last year
- Data Analysis Experiments☆12Nov 2, 2017Updated 8 years ago
- ☆25Mar 9, 2026Updated 3 months ago
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- ☆20Aug 27, 2024Updated last year
- Data Vault 2.0: Code generation, Vertica, Airflow☆13Nov 20, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw☆13Mar 10, 2026Updated 3 months ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆25Jun 11, 2026Updated last week
- Reading rosbag files in pure Rust☆14May 27, 2024Updated 2 years ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆38Aug 7, 2025Updated 10 months ago
- TypeScript GitHub Clone CLI☆11Jul 6, 2022Updated 3 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- ☆22Dec 19, 2025Updated 5 months ago
- Ferramenta que auxilia o usuário na criação de postagens semi automáticas☆10Jan 24, 2023Updated 3 years ago
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.☆169Feb 4, 2021Updated 5 years ago
- Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detai…☆39Nov 15, 2021Updated 4 years ago
- Adaptation postgres adapter for Greenplum☆36Mar 7, 2024Updated 2 years ago
- Rust library to work with global positions and vectors☆16Mar 12, 2026Updated 3 months ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is end-to-end-recommender-system repo has full recommender system implementation from collecting data, modeling and deploying machin…☆13May 19, 2021Updated 5 years ago
- An Ansible Role that manages installation and configuration of ClickHouse.☆21Aug 2, 2023Updated 2 years ago
- Withdraw All Your Linkedin Connect Invitation At Once With No Effort☆22May 13, 2020Updated 6 years ago
- coursera☆18Jun 23, 2016Updated 9 years ago
- Singer.io Target for Amazon Redshift - PipelineWise compatible☆12Sep 20, 2024Updated last year
- ☆10Jan 26, 2023Updated 3 years ago
- ☆12Oct 2, 2020Updated 5 years ago