oap-project/pmem-shuffle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oap-project/pmem-shuffle)

oap-project / pmem-shuffle

Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote persistent memory (for read) to provide extremely high performance and low latency shuffle solutions for Spark*.

☆14

Alternatives and similar repositories for pmem-shuffle

Users that are interested in pmem-shuffle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

oap-project / remote-shuffle
View on GitHub
Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…
☆21Mar 15, 2024Updated 2 years ago
rwang067 / XPGraph
View on GitHub
Source code for XPGraph-MICRO22
☆11Apr 10, 2023Updated 3 years ago
Intel-bigdata / Spark-PMoF
View on GitHub
Spark Shuffle Optimization with RDMA+AEP
☆30May 23, 2023Updated 3 years ago
oap-project / oap-tools
View on GitHub
Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.
☆18Mar 27, 2024Updated 2 years ago
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
saltstack-formulas / keepalived-formula
View on GitHub
☆12Apr 7, 2025Updated last year
astroswego / plotypus
View on GitHub
A Python library and command line utility for manipulating and plotting stellar lightcurves.
☆10Jun 14, 2016Updated 10 years ago
jlmelville / rnndescent
View on GitHub
R package implementing the Nearest Neighbor Descent method for approximate nearest neighbors
☆18Jul 11, 2026Updated 2 weeks ago
chenwbyx / Multithreading-epoll
View on GitHub
基于多线程与epoll的高并发TCP服务器
☆11Aug 4, 2018Updated 7 years ago
wenyuzhao / lxr-pldi-2022-artifact
View on GitHub
☆12Mar 13, 2024Updated 2 years ago
NetEase / spark-alarm
View on GitHub
Alerting and monitoring tool for Apache Spark
☆23May 20, 2022Updated 4 years ago
Intel-bigdata / HPNL
View on GitHub
High Performance Network Library for RDMA
☆28Jan 3, 2023Updated 3 years ago
HKU-BAL / MegaGTA
View on GitHub
HMM-guided metagenomic gene-targeted assembler using iterative de Bruijn graphs
☆18Oct 3, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Tencent / Firestorm
View on GitHub
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆256Apr 7, 2023Updated 3 years ago
linkedin / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆16Jul 24, 2023Updated 3 years ago
althonos / gb-io.py
View on GitHub
A Python interface to gb-io, a fast GenBank parser written in Rust.
☆24May 21, 2026Updated 2 months ago
ogrebgr / scram-sasl
View on GitHub
Java implementation of the SCRAM SASL for both server and client plus examples
☆17Apr 18, 2021Updated 5 years ago
AliyunContainerService / benchmark-for-spark
View on GitHub
benchmark-for-spark
☆19May 7, 2025Updated last year
IBM / spark-s3-shuffle
View on GitHub
A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.
☆52Sep 17, 2025Updated 10 months ago
WukLab / pDPM
View on GitHub
Passive Disaggregated Persistent Memory at USENIX ATC 2020.
☆52Nov 2, 2020Updated 5 years ago
ooneko / claude-config
View on GitHub
☆18Dec 30, 2025Updated 6 months ago
fmpr / texttk
View on GitHub
Text Preprocessing in Python
☆19Jan 15, 2017Updated 9 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
pmem / rpma
View on GitHub
Remote Persistent Memory Access Library
☆105Sep 5, 2023Updated 2 years ago
ayaanhossain / ViennaRNA
View on GitHub
ViennaRNA Package consists of a C code library for the prediction and comparison of RNA secondary structures
☆15May 20, 2022Updated 4 years ago
IvS-KULeuven / IvSPythonRepository
View on GitHub
Python Repository of the Institute of Astronomy @ KU Leuven
☆20Nov 5, 2020Updated 5 years ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
apache / livy-website
View on GitHub
Mirror of Apache livy (Incubating)
☆13Jul 7, 2026Updated 2 weeks ago
zrlio / parquet-generator
View on GitHub
Parquet file generator
☆22Apr 17, 2018Updated 8 years ago
spcl / naos
View on GitHub
Naos: Serialization-free RDMA networking in Java
☆17Aug 17, 2021Updated 4 years ago
monmohan / logdriver
View on GitHub
Log driver plugin for docker explained. The boilerplate code here can also be used to write your own driver if you are feeling adventurou…
☆13Mar 13, 2019Updated 7 years ago
datafusion-contrib / datafusion-objectstore-hdfs
View on GitHub
HDFS based on Java implementation as a remote ObjectStore for DataFusion
☆10Feb 13, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
micchie / paste
View on GitHub
Latest PASTE (NSDI'18) repository
☆13May 2, 2022Updated 4 years ago
XpressAI / SparkCyclone
View on GitHub
Plugin to accelerate Spark SQL with the NEC Vector Engine.
☆19Aug 15, 2022Updated 3 years ago
henry-nazare / llvm-sra
View on GitHub
Symbolic range analysis for LLVM.
☆12Jan 10, 2016Updated 10 years ago
liecn / cnli.me
View on GitHub
template for https://cnli.me
☆10Feb 27, 2025Updated last year
oracle / spark-oracle
View on GitHub
On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.
☆36Apr 15, 2025Updated last year
COMSYS / SymbolicLivenessAnalysis
View on GitHub
Symbolic Liveness Analysis of real-world software building upon KLEE to detect liveness violations (e.g. infinite loop bugs)
☆12Dec 16, 2021Updated 4 years ago
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago