oap-project/remote-shuffle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oap-project/remote-shuffle)

oap-project / remote-shuffle

Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.

☆21

Alternatives and similar repositories for remote-shuffle

Users that are interested in remote-shuffle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

oap-project / pmem-shuffle
View on GitHub
Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…
☆14Sep 18, 2023Updated 2 years ago
implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
anastasop / thames
View on GitHub
An ambient sound generator using free sounds from BBC Sounds Effects
☆14Dec 3, 2023Updated 2 years ago
intenthq / gander
View on GitHub
Html Content / Article Extractor in Scala
☆18May 23, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
saltstack-formulas / keepalived-formula
View on GitHub
☆12Apr 7, 2025Updated last year
aws-samples / amazon-sagemaker-integration-with-snowflake
View on GitHub
☆10Oct 12, 2022Updated 3 years ago
alibaba / SparkCube
View on GitHub
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
☆136Mar 6, 2023Updated 3 years ago
althonos / gb-io.py
View on GitHub
A Python interface to gb-io, a fast GenBank parser written in Rust.
☆24May 21, 2026Updated 2 months ago
ogrebgr / scram-sasl
View on GitHub
Java implementation of the SCRAM SASL for both server and client plus examples
☆17Apr 18, 2021Updated 5 years ago
apache / livy-website
View on GitHub
Mirror of Apache livy (Incubating)
☆13Jul 7, 2026Updated 2 weeks ago
indextables / indextables_spark
View on GitHub
IndexTables is an open-table format for Apache Spark that enables fast retrieval and full-text search across large-scale data. It integra…
☆43Updated this week
linyiqun / open-source-patch
View on GitHub
项目中保留了向开源社区提交过的patch
☆16Oct 22, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
oap-project / gazelle_plugin
View on GitHub
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆255Feb 21, 2023Updated 3 years ago
swapagarwal / awesome-dropbox
View on GitHub
A curated list of awesome Dropbox SDKs, open source libraries, and cool tools and services powered by Dropbox.
☆15Apr 6, 2016Updated 10 years ago
sujitpal / ltr-examples
View on GitHub
Supporting code for Learning to Rank (LTR) presentation
☆16Oct 11, 2018Updated 7 years ago
qubole / rubix
View on GitHub
Cache File System optimized for columnar formats and object stores
☆188Aug 11, 2022Updated 3 years ago
XpressAI / SparkCyclone
View on GitHub
Plugin to accelerate Spark SQL with the NEC Vector Engine.
☆19Aug 15, 2022Updated 3 years ago
dianping / hiveweb
View on GitHub
☆15Aug 25, 2014Updated 11 years ago
apache / orc-format
View on GitHub
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
☆16May 15, 2026Updated 2 months ago
appsmithorg / awesome-for-beginners
View on GitHub
A list of awesome beginners-friendly projects.
☆12Oct 5, 2020Updated 5 years ago
gvanhavre / ArcheoBM
View on GitHub
Archaeology Based Modelling
☆12Sep 26, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
noleme / noleme-flow
View on GitHub
A library enabling DAG structuring of data processing programs such as ETLs
☆18Updated this week
n0vad3v / simple-multinode-clickhouse-cluster
View on GitHub
Deploy a simple Multi-Node Clickhouse Cluster with docker-compose in minutes.
☆17Feb 11, 2022Updated 4 years ago
aws-samples / emr-remote-shuffle-service
View on GitHub
☆18May 7, 2026Updated 2 months ago
kimrutherford / EMBOSS
View on GitHub
MIRROR OF: The European Molecular Biology Open Software Suite (from git://anonscm.debian.org/debian-med/emboss.git)
☆32Feb 18, 2022Updated 4 years ago
MammothGrowth / dbt-cli-mcp
View on GitHub
DBT CLI MCP Server
☆18Jun 26, 2025Updated last year
rlazoti / finagle-metrics
View on GitHub
Easy way to send Finagle metrics to Codahale Metrics library
☆43Apr 2, 2020Updated 6 years ago
prestodb / presto-query-predictor
View on GitHub
A query predictor pipeline and service to predict resource usages of Presto queries
☆14May 2, 2023Updated 3 years ago
aakashnand / ranger
View on GitHub
Mirror of Apache Ranger
☆15Apr 5, 2024Updated 2 years ago
syucream / avro-protobuf
View on GitHub
avro-protobuf in Go
☆10May 26, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gamedev-js / vmath
View on GitHub
Yet another gl-matrix: faster and smaller.
☆17Jan 20, 2018Updated 8 years ago
tripadvisor / hive-query-tool
View on GitHub
A web interface to Hive with flexible, user-friendly query customization
☆24Jun 30, 2013Updated 13 years ago
alibaba-archive / aliyun-oss-hadoop-fs
View on GitHub
Hadoop filesystem implementation for Aliyun OSS
☆13Feb 14, 2016Updated 10 years ago
lightdash / helm-charts
View on GitHub
Lightdash Community helm charts
☆24Updated this week
DataEngineeringLabs / ranged-reader-rs
View on GitHub
A reader that buffers ranged calls
☆12May 17, 2022Updated 4 years ago
code-yeongyu / twitter_video_tools_v2
View on GitHub
An all in one Twitter video downloader
☆12Jan 10, 2024Updated 2 years ago
soulmachine / scala-cheat-sheet
View on GitHub
Scala cheat sheet
☆23Mar 28, 2014Updated 12 years ago