glennklockwood / myhadoopLinks
Framework for deploying Hadoop clusters on traditional HPC from userland
☆45Updated 7 years ago
Alternatives and similar repositories for myhadoop
Users that are interested in myhadoop are comparing it to the libraries listed below
Sorting:
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 7 months ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- Python-based viewer for HDF5 and other HDF5-like file formats☆131Updated 8 months ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- HDF5 Tutorial☆106Updated 11 years ago
- Custom Spawner for Jupyterhub to start slurm jobs when users log in☆24Updated 3 years ago
- Custom Spawner for Jupyterhub to start servers in batch scheduled systems☆198Updated last week
- Remote Spawner class for JupyterHub to spawn IPython notebooks and a remote server and tunnel the port via SSH☆26Updated 9 years ago
- Specification and tools for representing HDF5 in JSON☆82Updated last week
- ☆60Updated 3 years ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- The configuration and scripts for using spark within a Sun Grid Engine environment☆24Updated 11 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆33Updated 2 months ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 6 years ago
- ☆100Updated last year
- Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX☆221Updated 4 years ago
- HTCondor source repository, formerly the Condor Project☆301Updated this week
- launching and controlling spark on hpc clusters☆23Updated 3 years ago
- Execute scripts in their own temporary environment☆73Updated 6 years ago
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 5 years ago
- h5py distributed - Python client library for HDF Rest API☆122Updated 3 weeks ago
- Create clusters of VMs on the cloud and configure them with Ansible.☆338Updated 2 years ago
- Rose is a toolkit for writing, editing and running application configurations.☆60Updated last week
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆105Updated 7 years ago
- Mirror upstream conda channels☆72Updated 6 years ago
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- build and test recipes for conda☆328Updated 3 years ago
- Scalable dynamic library and python loading in HPC environments☆102Updated last week
- [DEPRECATED] Virtual large arrays and lazy evaluation.☆53Updated 8 years ago
- DockerSpawner with image selection☆15Updated 7 years ago