LLNL / spark-hdf5
A plugin to enable Apache Spark to read HDF5 files
☆20Updated 7 years ago
Related projects: ⓘ
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 3 years ago
- launching and controlling spark on hpc clusters☆23Updated 2 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 6 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 2 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated last year
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 6 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 5 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated 3 months ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 6 years ago
- ☆11Updated this week
- N-dimensional arrays, with Zarr and HDF5 integrations☆16Updated 5 years ago
- A composable framework for fast and scalable data analytics☆57Updated last year
- Jupyter extensions for SWAN☆58Updated this week
- New url: https://github.com/biointec/halvade☆19Updated 7 years ago
- Scientific Spark - a NASA AIST14 project☆83Updated 6 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆191Updated 9 months ago
- Java read and write example for Apache Arrow☆33Updated 6 years ago
- CWL on Kubernetes☆42Updated 4 months ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Updated 7 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 5 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 4 years ago
- API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.☆22Updated 4 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Updated 8 months ago
- Introduction to Hadoop for those from an HPC simulation background☆31Updated 9 years ago
- Spark GPU and SIMD Support☆61Updated 4 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 7 years ago
- Deploy Dask on DRMAA clusters☆40Updated 3 years ago
- A Variant Caller, Distributed. Apache 2 licensed.☆71Updated 5 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 8 years ago