moutai / sparkondaLinks
Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters
☆10Updated 3 years ago
Alternatives and similar repositories for sparkonda
Users that are interested in sparkonda are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Deploy dask-distributed on google container engine using kubernetes☆40Updated 6 years ago
- python parallel map on kubernetes☆33Updated 8 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 7 years ago
- Luigi Plugin for Hubot☆36Updated 9 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- This repo is deprecated. A spawner for JupyterHub☆23Updated 8 years ago
- Jupyter Notebook extension for Apache Spark integration☆191Updated 5 years ago
- Unified interface for local and distributed ndarrays☆157Updated 7 years ago
- Helpers & syntactic sugar for PySpark.☆62Updated last month
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 2 years ago
- Utilities to work with Scala/Java code with py4j☆40Updated 2 years ago
- Functional Airflow DAG definitions.☆38Updated 8 years ago
- ☆32Updated 5 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- ☆146Updated 9 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- A DockerSwarm Jupyterhub setup, which uses a NFS Server running in a Docker Container for persistent storage☆20Updated 7 years ago
- Parquet Command-line Tools☆19Updated 9 years ago
- An Ansible module for managing Python packages via Conda☆55Updated last year
- Extensible Python Framework for Apache Mesos☆33Updated 8 years ago
- Experimental docker-compose setup to bootstrap distributed on a docker-swarm cluster.☆92Updated 8 years ago
- Deploy Dask on Marathon☆10Updated 8 years ago
- An example project for doing grid search in MLlib☆13Updated 11 years ago
- The Scalding tutorial as a standalone SBT project☆51Updated 8 years ago
- ☆73Updated 5 years ago
- An Apache Spark-shell backend for IPython☆105Updated 4 years ago
- Apache (Py)Spark type annotations (stub files).☆118Updated 3 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 9 months ago
- Collection of dask example notebooks☆57Updated 7 years ago