bloomberg / apache-spark-on-k8s
Apache Spark enhanced with native Kubernetes scheduler back-end
☆16Updated last year
Alternatives and similar repositories for apache-spark-on-k8s:
Users that are interested in apache-spark-on-k8s are comparing it to the libraries listed below
- DataHub on AWS demonstration resources☆10Updated last year
- ☆11Updated 5 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated this week
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 7 years ago
- Examples for High Performance Spark☆15Updated 2 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 11 months ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 6 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- A Helm Chart for Apache Airflow☆14Updated 6 years ago
- Testing Scala code with scalatest☆12Updated 2 years ago
- Example script to deploy DAGs to Google Cloud Composer.☆15Updated 2 years ago
- This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS …☆19Updated 3 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated last week
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 10 months ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 2 months ago
- Open Source Secret Provider plugin for the Kafka Connect framework☆46Updated 6 months ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆22Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- ☆27Updated 4 months ago
- ☆37Updated 5 years ago