bloomberg / apache-spark-on-k8sLinks
Apache Spark enhanced with native Kubernetes scheduler back-end
☆15Updated 2 years ago
Alternatives and similar repositories for apache-spark-on-k8s
Users that are interested in apache-spark-on-k8s are comparing it to the libraries listed below
Sorting:
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 4 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster☆38Updated 4 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 6 months ago
- ☆10Updated 3 years ago
- Skeleton project for Apache Airflow training participants to work on.☆17Updated 5 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆158Updated 2 years ago
- ☆28Updated 11 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- This repository contains recipes for Apache Pinot.☆30Updated 6 months ago
- pysh-db - The Data Science Toolkit (DSK)☆13Updated 6 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- Helm chart for deploying Apache Airflow in kubernetes☆19Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 6 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Prometheus Exporter for Airflow☆161Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Aiven's S3 Sink Connector for Apache Kafka®☆71Updated 11 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 2 weeks ago
- Airflow declarative DAGs via YAML☆133Updated last year
- ☆53Updated 2 weeks ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 5 months ago
- A Giter8 template for scio☆31Updated last week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week