glennklockwood / myhadoopLinks
Framework for deploying Hadoop clusters on traditional HPC from userland
☆45Updated 7 years ago
Alternatives and similar repositories for myhadoop
Users that are interested in myhadoop are comparing it to the libraries listed below
Sorting:
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 9 months ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- Remote Spawner class for JupyterHub to spawn IPython notebooks and a remote server and tunnel the port via SSH☆26Updated 9 years ago
- Custom Spawner for Jupyterhub to start servers in batch scheduled systems☆201Updated 2 weeks ago
- launching and controlling spark on hpc clusters☆23Updated 3 years ago
- ☆60Updated 3 years ago
- Python-based viewer for HDF5 and other HDF5-like file formats☆132Updated 9 months ago
- Reference service implementation of the HDF5 REST API☆170Updated 3 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆33Updated 3 months ago
- Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX☆222Updated 5 years ago
- For interacting with nutch via Python☆29Updated last week
- Custom Spawner for Jupyterhub to start slurm jobs when users log in☆24Updated 3 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 6 years ago
- Specification and tools for representing HDF5 in JSON☆82Updated 3 weeks ago
- A Python library to describe abstract workflows for distributed data-intensive applications☆26Updated 6 years ago
- HDF5 Tutorial☆106Updated 11 years ago
- ☆100Updated last year
- files and instructions for creating and using example containers from the sylabs.io blog☆104Updated 2 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 8 years ago
- Several utilities to aid in the use, administration, and management of PBS variants (including OpenPBS, PBS Pro, and TORQUE).☆25Updated 3 years ago
- HTCondor source repository, formerly the Condor Project☆305Updated last week
- h5py distributed - Python client library for HDF Rest API☆123Updated 3 weeks ago
- [DEPRECATED] Virtual large arrays and lazy evaluation.☆53Updated 8 years ago
- Create clusters of VMs on the cloud and configure them with Ansible.☆338Updated 2 years ago
- build and test recipes for conda☆328Updated 3 years ago
- Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.☆69Updated 9 years ago
- pelican-bibtex: Manage your academic publications page with Pelican and BibTeX☆52Updated 2 years ago
- Now hosted on GitLab.☆315Updated 6 months ago
- Shifter - Linux Containers for HPC☆362Updated 3 weeks ago
- ParaView Cinema☆11Updated 10 years ago