rokroskar / sparkhpc
launching and controlling spark on hpc clusters
☆23Updated 2 years ago
Alternatives and similar repositories for sparkhpc:
Users that are interested in sparkhpc are comparing it to the libraries listed below
- MPI-oriented extension of the Spark computational model☆24Updated 6 years ago
- Framework for deploying Hadoop clusters on traditional HPC from userland☆46Updated 7 years ago
- A plugin to enable Apache Spark to read HDF5 files☆20Updated 8 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 10 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated last month
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 5 years ago
- Deploy Dask using MPI4Py☆53Updated 3 weeks ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- Apache Spark Data Source for ROOT File Format☆29Updated 5 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆30Updated 3 months ago
- XALT: System tracking of users codes on clusters☆43Updated last month
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated 9 months ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 2 years ago
- Moved to https://github.com/project-alchemist/Alchemist☆12Updated 7 years ago
- Slurm Docker Container on CentOS 7☆88Updated 11 months ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- A simple utility for executing multiple sequential or multi-threaded applications in a single multi-node batch job☆62Updated last year
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 8 years ago
- Collection of tutorials for using Shifter to bring containers to HPC☆26Updated 6 years ago
- Slurm Simulator: Slurm Modification to Enable its Simulation☆32Updated last year
- Tool for running/managing ad hoc spark clusters on a Slurm cluster☆23Updated 2 years ago
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 4 years ago
- ☆58Updated 2 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆20Updated last month
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆102Updated last week
- ☆92Updated 5 years ago