LLNL / spark-hdf5Links
A plugin to enable Apache Spark to read HDF5 files
☆20Updated 9 years ago
Alternatives and similar repositories for spark-hdf5
Users that are interested in spark-hdf5 are comparing it to the libraries listed below
Sorting:
- Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark☆42Updated 4 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 7 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 8 years ago
- launching and controlling spark on hpc clusters☆23Updated 3 years ago
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 8 years ago
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 7 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 6 years ago
- Spark GPU and SIMD Support☆61Updated 5 years ago
- CWL on Kubernetes☆50Updated 2 months ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 9 months ago
- A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs☆69Updated 3 years ago
- Kira is an astronomy image processing toolkit implemented with Apache Spark.☆15Updated 9 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 5 years ago
- N-dimensional arrays, with Zarr and HDF5 integrations☆19Updated 6 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 3 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆22Updated 9 years ago
- Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …☆32Updated 2 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Updated 8 years ago
- Java read and write example for Apache Arrow☆33Updated 7 years ago
- Mirror upstream conda channels☆72Updated 6 years ago
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Updated last year
- Spark Example using Phoenix to interact with HBase☆16Updated 9 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 9 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Java JNI interface to the TileDB storage engine☆26Updated last month
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated 7 months ago