rokroskar / sparkhpc
launching and controlling spark on hpc clusters
☆23Updated 2 years ago
Alternatives and similar repositories for sparkhpc:
Users that are interested in sparkhpc are comparing it to the libraries listed below
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆197Updated 2 months ago
- A plugin to enable Apache Spark to read HDF5 files☆20Updated 8 years ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 10 years ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 6 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- Tool for running/managing ad hoc spark clusters on a Slurm cluster☆25Updated 2 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆30Updated 4 months ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 5 years ago
- Deploy Dask using MPI4Py☆55Updated last month
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated 10 months ago
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 4 years ago
- General purpose, language-agnostic Continuous Benchmarking (CB) framework☆35Updated 5 years ago
- Framework for deploying Hadoop clusters on traditional HPC from userland☆45Updated 7 years ago
- Collection of tutorials for using Shifter to bring containers to HPC☆26Updated 6 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated last month
- Analyze graph/hierarchical performance data using pandas dataframes☆114Updated 3 months ago
- XALT: System tracking of users codes on clusters☆44Updated last month
- A simple utility for executing multiple sequential or multi-threaded applications in a single multi-node batch job☆61Updated last year
- Slurm Simulator: Slurm Modification to Enable its Simulation☆33Updated last year
- MPI Testing Tool☆63Updated 4 months ago
- ☆24Updated last year
- Scalable dynamic library and python loading in HPC environments☆100Updated 2 weeks ago
- Slurm Docker Container on CentOS 7☆88Updated last year
- MPI Library Memory Consumption Utilities☆18Updated 2 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago