rokroskar / sparkhpc
launching and controlling spark on hpc clusters
☆23Updated 2 years ago
Alternatives and similar repositories for sparkhpc:
Users that are interested in sparkhpc are comparing it to the libraries listed below
- MPI-oriented extension of the Spark computational model☆24Updated 6 years ago
- A plugin to enable Apache Spark to read HDF5 files☆20Updated 8 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆30Updated last month
- XALT: System tracking of users codes on clusters☆43Updated last month
- Slurm Simulator: Slurm Modification to Enable its Simulation☆32Updated 11 months ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Deploy Dask using MPI4Py☆52Updated 3 months ago
- Collection of tutorials for using Shifter to bring containers to HPC☆26Updated 5 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆196Updated last year
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 9 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 8 years ago
- Deploy Dask on DRMAA clusters☆40Updated 3 years ago
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 4 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 5 years ago
- Framework for deploying Hadoop clusters on traditional HPC from userland☆46Updated 7 years ago
- ☆97Updated 3 months ago
- Tool for running/managing ad hoc spark clusters on a Slurm cluster☆22Updated 2 years ago
- Python Tools for the POP Metrics☆12Updated 2 years ago
- Analyze graph/hierarchical performance data using pandas dataframes☆109Updated 2 months ago
- Apache Spark Data Source for ROOT File Format☆29Updated 5 years ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 3 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆19Updated last month
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- Moved to https://github.com/project-alchemist/Alchemist☆12Updated 6 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated 7 months ago
- cotainr - a user space Apptainer/Singularity container builder.☆21Updated last week
- Scalable dynamic library and python loading in HPC environments☆99Updated last month
- MPI Library Memory Consumption Utilities☆18Updated last year
- Keras tutorial code for the SC18 tutorial on Deep Learning at Scale☆12Updated 6 years ago