spark on kubernetes
☆104Feb 20, 2023Updated 3 years ago
Alternatives and similar repositories for spark-kubernetes
Users that are interested in spark-kubernetes are comparing it to the libraries listed below
Sorting:
- Deploy your Spark Production Cluster on Kubernetes☆46Sep 13, 2020Updated 5 years ago
- Data validation library for PySpark 3.0.0☆33Nov 11, 2022Updated 3 years ago
- Apache Spark with HDFS cluster within Kubernetes☆11Jul 11, 2023Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆509Nov 7, 2025Updated 3 months ago
- Helm Charts to Deploy Apache Drill on Kubernetes☆17Jan 5, 2024Updated 2 years ago
- 🔌 Flask S3Viewer is a powerful extension that makes it easy to browse S3 in any Flask application. (Python S3 Uploader / Flask S3 Upload…☆14Jan 8, 2025Updated last year
- How to manage Slowly Changing Dimensions with Apache Hive☆55Aug 27, 2019Updated 6 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆135Nov 4, 2022Updated 3 years ago
- Image building contents for running Spark standalone on Kubernetes☆16Apr 10, 2020Updated 5 years ago
- Spark on Kubernetes using Helm☆33Jun 9, 2020Updated 5 years ago
- AppDynamics Apache Hadoop Monitoring Extention☆23Oct 3, 2024Updated last year
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Jul 23, 2023Updated 2 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆202Oct 20, 2022Updated 3 years ago
- This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace☆21Feb 12, 2025Updated last year
- The Internals of Spark on Kubernetes☆73May 9, 2022Updated 3 years ago
- Big Data search with Spark and Lucene☆18Dec 15, 2023Updated 2 years ago
- Tutorial on experiment tracking and reproducibility for Machine Learning projects with DVC☆17Dec 8, 2022Updated 3 years ago
- Following along with the Hive tutorial at StrataConf / HadoopWorld☆22Mar 22, 2019Updated 6 years ago
- 最简单的 spark sql on kubernetes 生产环境部署方案☆19Jun 12, 2023Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Mar 16, 2020Updated 5 years ago
- Apache Spark docker image☆2,058Apr 21, 2023Updated 2 years ago
- Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration☆23Nov 12, 2024Updated last year
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Jan 5, 2023Updated 3 years ago
- A Pokémon themed progress bar for IntelliJ IDEA.☆27Aug 24, 2023Updated 2 years ago
- Oh you know, just a coupla, two, tree Kafka Streams in Scala☆24Feb 19, 2021Updated 5 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Jan 8, 2022Updated 4 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated last year
- ☆20Feb 28, 2018Updated 8 years ago
- Document and showcase how you can create Spark Applications which run inside Docker Containers using Apache Mesos.☆28Feb 25, 2016Updated 10 years ago
- Repo for all my code on the articles I post on medium☆106Oct 21, 2022Updated 3 years ago
- Instantly modernize your Java SWT and Eclipse RCP applications with this drop-in library.☆25Updated this week
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- 📦 Starting box for Vagrant. Inside box Ubuntu 20.04 LTS with Git, Docker and Docker compose.☆19May 5, 2022Updated 3 years ago
- Adelic p-adic Dark Matter☆13Feb 15, 2026Updated 2 weeks ago
- AWS LocalStack + Spark Cluster + Zeppelin [Docker]☆10Jul 6, 2022Updated 3 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Task pipelining for taskiq☆43Feb 12, 2026Updated 2 weeks ago
- Complete reimplementation of the old (and deprecated) catify core process engine based on akka.io and neo4j.☆30Jun 7, 2014Updated 11 years ago
- Example repo to kickstart integration with mlflow pipelines.☆77Nov 14, 2022Updated 3 years ago