swan-cern / sparkmonitor
An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks
☆46Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for sparkmonitor
- Jupyter extensions for SWAN☆58Updated 2 weeks ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated 3 weeks ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 7 months ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆84Updated 2 weeks ago
- A Delta Lake reader for Dask☆46Updated last month
- Deploy dask on YARN clusters☆69Updated 3 months ago
- Read Delta tables without any Spark☆47Updated 8 months ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆52Updated 2 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated last year
- Unity Catalog UI☆39Updated 2 months ago
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆52Updated 2 years ago
- Spark SQL magic command for Jupyter notebooks☆34Updated 3 years ago
- ☆11Updated 5 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated 2 months ago
- ☆54Updated 10 months ago
- ☆30Updated 2 years ago
- Docker images for dask☆231Updated this week
- A Table format agnostic data sharing framework☆38Updated 9 months ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- MLflow-tracking server example with Minio and H2O☆18Updated 5 years ago
- Dask integration for Snowflake☆30Updated 4 months ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago