valiantljk / h5sparkLinks
Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark
☆42Updated 4 years ago
Alternatives and similar repositories for h5spark
Users that are interested in h5spark are comparing it to the libraries listed below
Sorting:
- A plugin to enable Apache Spark to read HDF5 files☆20Updated 8 years ago
- Scientific Spark - a NASA AIST14 project☆85Updated 7 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 7 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 9 years ago
- Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.☆15Updated last year
- Utilities to work with Scala/Java code with py4j☆40Updated last year
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated 2 months ago
- Integration code to enable Hadoop processing of data in NetCDF format☆31Updated 12 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- ☆92Updated 5 years ago
- Spark GPU and SIMD Support☆61Updated 4 years ago
- Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …☆30Updated 2 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14Updated 9 years ago
- analytics tool kit☆43Updated 8 years ago
- [RETIRED] Jupyter Declarative Widget Extension☆120Updated 7 years ago
- Unified interface for local and distributed ndarrays☆157Updated 6 years ago
- Perform high-speed calculations on columnar data without creating intermediate objects.☆81Updated 6 years ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Scala bindings for Bokeh plotting library☆136Updated last year
- A primal-dual framework for distributed L1-regularized optimization☆36Updated 9 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆197Updated 4 months ago
- launching and controlling spark on hpc clusters☆23Updated 2 years ago
- Support for operating on images via Apache Spark☆26Updated 2 years ago
- Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk☆171Updated 6 years ago
- ☆21Updated 9 years ago
- Deploy dask on YARN clusters☆69Updated 10 months ago