glennklockwood / myhadoopLinks
Framework for deploying Hadoop clusters on traditional HPC from userland
☆45Updated 7 years ago
Alternatives and similar repositories for myhadoop
Users that are interested in myhadoop are comparing it to the libraries listed below
Sorting:
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆197Updated 5 months ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- Custom Spawner for Jupyterhub to start servers in batch scheduled systems☆197Updated last week
- Default Repo description from terraform module☆5Updated 5 years ago
- Remote Spawner class for JupyterHub to spawn IPython notebooks and a remote server and tunnel the port via SSH☆26Updated 8 years ago
- Create clusters of VMs on the cloud and configure them with Ansible.☆337Updated last year
- Python-based viewer for HDF5 and other HDF5-like file formats☆131Updated 5 months ago
- launching and controlling spark on hpc clusters☆23Updated 2 years ago
- Custom Spawner for Jupyterhub to start slurm jobs when users log in☆24Updated 3 years ago
- Several utilities to aid in the use, administration, and management of PBS variants (including OpenPBS, PBS Pro, and TORQUE).☆25Updated 2 years ago
- ☆60Updated 2 years ago
- Reference service implementation of the HDF5 REST API☆169Updated 2 years ago
- h5py distributed - Python client library for HDF Rest API☆121Updated 2 weeks ago
- HDF5 Tutorial☆105Updated 11 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 6 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 7 years ago
- Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.☆69Updated 8 years ago
- Pegasus Workflow Management System - Automate, recover, and debug scientific computations.☆196Updated this week
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆32Updated 2 weeks ago
- ☆99Updated 9 months ago
- XALT: System tracking of users codes on clusters☆45Updated 2 weeks ago
- HTCondor source repository, formerly the Condor Project☆299Updated this week
- [DEPRECATED] Virtual large arrays and lazy evaluation.☆53Updated 7 years ago
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- StarCluster is an open source cluster-computing toolkit for Amazon's Elastic Compute Cloud (EC2).☆581Updated 3 years ago
- A scalable OpenMPI runtime container for Docker☆91Updated 4 years ago
- Docker Deployment of NERSC Jupyterhub (including auth and spawner modules)☆13Updated 4 years ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 10 years ago
- Deploy Dask using MPI4Py☆55Updated 3 weeks ago
- Cloud-native, service based access to HDF data☆148Updated last week