llnl / spark-hdf5Links
A plugin to enable Apache Spark to read HDF5 files
☆20Updated 9 years ago
Alternatives and similar repositories for spark-hdf5
Users that are interested in spark-hdf5 are comparing it to the libraries listed below
Sorting:
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 7 years ago
- launching and controlling spark on hpc clusters☆23Updated 3 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 10 months ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 3 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated 7 months ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 8 years ago
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 8 years ago
- A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs☆69Updated 3 years ago
- API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.☆25Updated 5 years ago
- Some tests / examples for Open MPI's Java MPI bindings☆13Updated 7 years ago
- hanythingondemand provides a set of scripts to easily set up an ad-hoc Hadoop cluster through PBS jobs☆12Updated 6 years ago
- Kira is an astronomy image processing toolkit implemented with Apache Spark.☆15Updated 9 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Updated 8 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Mirror upstream conda channels☆72Updated 6 years ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Updated 4 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 9 years ago
- Jupyter plugin that provides a tab for TACC Lmod (https://github.com/TACC/Lmod)☆33Updated 4 months ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- Bioinformatics for the Scala programming language☆112Updated 3 months ago
- An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …☆18Updated 4 years ago
- Java read and write example for Apache Arrow☆34Updated 7 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Updated 10 months ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 5 years ago
- N-dimensional arrays, with Zarr and HDF5 integrations☆19Updated 6 years ago