project-alchemist / AlchemistLinks
An HPC Interface for data analysis platforms
☆23Updated 5 years ago
Alternatives and similar repositories for Alchemist
Users that are interested in Alchemist are comparing it to the libraries listed below
Sorting:
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 7 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- Drizzle integration with Apache Spark☆120Updated 7 years ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.☆128Updated 5 years ago
- Spark GPU and SIMD Support☆61Updated 5 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Updated 7 years ago
- A tool to get better debug info on spark's memory usage☆42Updated 6 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 9 years ago
- JVM integration for Weld☆16Updated 7 years ago
- XGBoost GPU accelerated on Spark example applications☆52Updated 3 years ago
- An extension of Yahoo's Benchmarks☆109Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 2 years ago
- Fast I/O plugins for Spark☆41Updated 5 years ago
- MPI-oriented extension of the Spark computational model☆24Updated 7 years ago
- Routines and data structures for using isarn-sketches idiomatically in Apache Spark☆29Updated last year
- An experimental Graph Streaming API for Apache Flink☆141Updated 5 years ago
- Parquet file generator☆22Updated 7 years ago
- Website for DataSketches.☆108Updated 2 weeks ago
- Real-time query spark and visualise it as graph.☆24Updated 8 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Updated last year
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆52Updated 2 years ago
- A composable framework for fast and scalable data analytics☆57Updated 3 years ago
- Spark Structured Streaming State Tools☆34Updated 5 years ago
- Joins for skewed datasets in Spark☆57Updated 8 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆16Updated 8 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- Distributed Temporal Graph Analytics with Apache Flink☆252Updated 3 weeks ago
- Spark Terasort☆121Updated 2 years ago