glennklockwood / myhadoopLinks
Framework for deploying Hadoop clusters on traditional HPC from userland
☆45Updated 8 years ago
Alternatives and similar repositories for myhadoop
Users that are interested in myhadoop are comparing it to the libraries listed below
Sorting:
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆196Updated 11 months ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- Custom Spawner for Jupyterhub to start servers in batch scheduled systems☆204Updated 3 weeks ago
- Reference service implementation of the HDF5 REST API☆170Updated 3 years ago
- Python-based viewer for HDF5 and other HDF5-like file formats☆132Updated 11 months ago
- Remote Spawner class for JupyterHub to spawn IPython notebooks and a remote server and tunnel the port via SSH☆26Updated 9 years ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- Custom Spawner for Jupyterhub to start slurm jobs when users log in☆24Updated 3 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 6 years ago
- h5py distributed - Python client library for HDF Rest API☆123Updated 3 weeks ago
- HDF5 Tutorial☆105Updated 11 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆33Updated 6 months ago
- launching and controlling spark on hpc clusters☆23Updated 3 years ago
- ☆102Updated last year
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 8 years ago
- A Python library to describe abstract workflows for distributed data-intensive applications☆27Updated 6 years ago
- files and instructions for creating and using example containers from the sylabs.io blog☆104Updated 2 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- XALT: System tracking of users codes on clusters☆47Updated last week
- Create clusters of VMs on the cloud and configure them with Ansible.☆340Updated 2 years ago
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- ☆60Updated 3 years ago
- The configuration and scripts for using spark within a Sun Grid Engine environment☆24Updated 11 years ago
- Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX☆222Updated 5 years ago
- Slurm on Google Cloud Platform☆190Updated last year
- DockerSpawner with image selection☆15Updated 8 years ago
- Run singularity containers on the Comet Supercomputer at San Diego Supercomputer Center☆18Updated 5 years ago
- Specification and tools for representing HDF5 in JSON☆82Updated last week
- StarCluster is an open source cluster-computing toolkit for Amazon's Elastic Compute Cloud (EC2).☆583Updated 3 years ago