Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
☆610Jan 8, 2020Updated 6 years ago
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆398Sep 25, 2024Updated last year
- Ansible playbooks for Apache Spark on kube☆27Jul 20, 2017Updated 8 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,125May 20, 2026Updated last week
- [EOL] Image build contents for Kubernetes applications.☆47Jan 22, 2018Updated 8 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆179Apr 23, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Running YARN on Kubernetes with PetSet controller.☆165Feb 22, 2018Updated 8 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Oct 5, 2022Updated 3 years ago
- Web console for a spark cluster management app☆28Feb 16, 2026Updated 3 months ago
- A Kafka Operator for Kubernetes☆294Oct 28, 2018Updated 7 years ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,093May 22, 2023Updated 3 years ago
- ☆73Nov 17, 2021Updated 4 years ago
- REST job server for Apache Spark☆2,843Mar 3, 2026Updated 2 months ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,034Nov 21, 2022Updated 3 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end☆15Aug 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Operator for managing the Spark clusters on Kubernetes and OpenShift.☆159Nov 18, 2021Updated 4 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,852Jul 10, 2023Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆84Mar 16, 2020Updated 6 years ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,105Updated this week
- Machine Learning Toolkit for Kubernetes☆15,654Updated this week
- Kubernetes spawner for JupyterHub☆601May 4, 2026Updated 3 weeks ago
- [EOL] This is a place for various components in the Kubernetes ecosystem that aren't part of the Kubernetes core.☆2,443Apr 17, 2019Updated 7 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- Base classes to use when writing tests with Spark☆1,554Apr 20, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,198Apr 29, 2025Updated last year
- Apache Spark on Kubernetes☆19Mar 19, 2017Updated 9 years ago
- [EOL] A Firmament-based Kubernetes scheduler☆406Jul 19, 2021Updated 4 years ago
- [EOL] Compute Resource Usage Analysis and Monitoring of Container Clusters☆2,639Nov 30, 2018Updated 7 years ago
- Mirror of Apache Toree (Incubating)☆750May 22, 2026Updated last week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆954May 19, 2026Updated last week
- Virtual Kubelet is an open source Kubernetes kubelet implementation.☆4,511Apr 27, 2026Updated last month
- Apache Spark - A unified analytics engine for large-scale data processing☆43,311May 21, 2026Updated last week
- ☆41Jul 27, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Drizzle integration with Apache Spark☆120Sep 11, 2018Updated 7 years ago
- Consume services in Kubernetes using the Open Service Broker API☆1,038Feb 1, 2022Updated 4 years ago
- Apache YuniKorn Core☆1,013Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,594Updated this week
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆99Jul 12, 2020Updated 5 years ago
- Serverless proxy for Spark cluster☆325Apr 13, 2026Updated last month
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago