valiantljk / h5spark
Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark
☆42Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for h5spark
- A plugin to enable Apache Spark to read HDF5 files☆20Updated 8 years ago
- Scientific Spark - a NASA AIST14 project☆83Updated 6 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 6 years ago
- Utilities to work with Scala/Java code with py4j☆40Updated 10 months ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated last year
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- launching and controlling spark on hpc clusters☆23Updated 2 years ago
- Integration code to enable Hadoop processing of data in NetCDF format☆31Updated 11 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Updated 10 months ago
- A composable framework for fast and scalable data analytics☆57Updated last year
- Mirror upstream conda channels☆72Updated 5 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆196Updated 11 months ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 7 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 8 years ago
- Deploy Dask on DRMAA clusters☆40Updated 3 years ago
- Deploy dask-distributed on google container engine using kubernetes☆40Updated 5 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Updated 5 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 2 years ago
- Spark GPU and SIMD Support☆61Updated 4 years ago
- TileDB☆80Updated last year
- Cascading on Apache Flink®☆54Updated 9 months ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 6 years ago
- Bounded-memory serverless distributed N-dimensional array processing☆122Updated this week
- N-dimensional arrays, with Zarr and HDF5 integrations☆16Updated 5 years ago
- ☆93Updated 4 years ago
- analytics tool kit☆43Updated 7 years ago