NVIDIA / spark-rapids-benchmarksLinks
Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
☆41Updated last week
Alternatives and similar repositories for spark-rapids-benchmarks
Users that are interested in spark-rapids-benchmarks are comparing it to the libraries listed below
Sorting:
- User tools for Spark RAPIDS☆64Updated last week
- RAPIDS Accelerator JNI For Apache Spark☆49Updated last week
- A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.☆160Updated 3 weeks ago
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆50Updated last year
- Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs☆83Updated last week
- RAPIDS GPU-BDB☆108Updated last year
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆929Updated last week
- Spark RAPIDS Container – Docker containers for Spark RAPIDS☆21Updated 7 months ago
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆346Updated 2 months ago
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆29Updated last year
- ☆21Updated 4 years ago
- Python bindings for UCX☆138Updated last week
- Tracking Ray Enhancement Proposals☆55Updated 5 months ago
- KvikIO - High Performance File IO☆223Updated this week
- ☆15Updated 2 years ago
- ☆39Updated this week
- Exoshuffle-CloudSort☆27Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆149Updated last week
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- ☆24Updated 2 years ago
- GPU library for writing SQL queries☆79Updated last year
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆99Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 11 months ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆128Updated 8 months ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆185Updated this week
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Updated 2 years ago
- Lakehouse storage system benchmark☆76Updated 2 years ago
- Magnum IO community repo☆98Updated 3 weeks ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆39Updated last year
- Mirror of Apache crail (Incubating)☆150Updated 3 years ago