project-alchemist / AlchemistLinks
An HPC Interface for data analysis platforms
☆23Updated 5 years ago
Alternatives and similar repositories for Alchemist
Users that are interested in Alchemist are comparing it to the libraries listed below
Sorting:
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆50Updated last year
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- A composable framework for fast and scalable data analytics☆57Updated 2 years ago
- Routines and data structures for using isarn-sketches idiomatically in Apache Spark☆29Updated last year
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- Automatic offload of user-written Spark kernels to accelerators☆18Updated 8 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114Updated last year
- Cluster computing using Stateful Dataflow Graphs☆26Updated 2 years ago
- Parquet file generator☆22Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- XGBoost GPU accelerated on Spark example applications☆53Updated 2 years ago
- Spark Structured Streaming State Tools☆34Updated 5 years ago
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- Performance Analysis Tool☆76Updated last month
- Spark GPU and SIMD Support☆61Updated 4 years ago
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Website for DataSketches.☆102Updated last month
- Measuring the performance of popular streaming engines with Yahoo's Streaming Benchmark☆53Updated 6 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Updated last year
- Milan is a Scala API and runtime infrastructure for building data-oriented systems, built on top of Apache Flink.☆39Updated 2 years ago
- Splittable Gzip codec for Hadoop☆70Updated last week
- Spark ML Lib serving library☆48Updated 7 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis …☆21Updated last year