glennklockwood / myhadoop
Framework for deploying Hadoop clusters on traditional HPC from userland
☆46Updated 7 years ago
Alternatives and similar repositories for myhadoop:
Users that are interested in myhadoop are comparing it to the libraries listed below
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 2 months ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆30Updated 4 months ago
- gather and plot data about Slurm scheduling and job statistics☆51Updated 10 years ago
- Remote Spawner class for JupyterHub to spawn IPython notebooks and a remote server and tunnel the port via SSH☆26Updated 8 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated 10 months ago
- Custom Spawner for Jupyterhub to start slurm jobs when users log in☆24Updated 3 years ago
- Scalable dynamic library and python loading in HPC environments☆100Updated this week
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 4 years ago
- ☆28Updated 5 years ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 10 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- Default Repo description from terraform module☆5Updated 5 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- ☆98Updated 6 months ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 5 years ago
- XALT: System tracking of users codes on clusters☆43Updated 3 weeks ago
- Parallel version of the Bash shell☆88Updated last year
- A tool to alleviate your Python package installation worries.☆20Updated last year
- Custom Spawner for Jupyterhub to start servers in batch scheduled systems☆194Updated 2 weeks ago
- ☆59Updated 2 years ago
- launching and controlling spark on hpc clusters☆23Updated 2 years ago
- Deploy Dask using MPI4Py☆55Updated 3 weeks ago
- HPCPerfStats (formerly TACC Stats) is an automated resource-usage monitoring and analysis package.☆46Updated 2 weeks ago
- Unidata Science Gateway on the NSF Jetstream2 Cloud☆19Updated last week
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- Harvard Extensions for Lmod deployment☆28Updated last week
- An open framework for collecting and analyzing HPC metrics.☆88Updated this week
- Several utilities to aid in the use, administration, and management of PBS variants (including OpenPBS, PBS Pro, and TORQUE).☆25Updated 2 years ago
- High-level python wapper for the Sun Grid Engine (SGE) using DRMAA and ZMQ☆21Updated 11 years ago
- HDF5 Tutorial☆105Updated 10 years ago