Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.
☆21Mar 15, 2024Updated last year
Alternatives and similar repositories for remote-shuffle
Users that are interested in remote-shuffle are comparing it to the libraries listed below
Sorting:
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- Hadoop InputFormat for http://druid.io/☆10Oct 26, 2016Updated 9 years ago
- Ted is a line oriented text editor and formatter☆12Jun 29, 2020Updated 5 years ago
- An ambient sound generator using free sounds from BBC Sounds Effects☆14Dec 3, 2023Updated 2 years ago
- Html Content / Article Extractor in Scala☆18May 23, 2018Updated 7 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Example application written using Reboot☆11Jan 24, 2026Updated last month
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 2 years ago
- S3-native streaming platform. A Kafka alternative with infinite scalability, 10x lower cost, and Kafka-compatible APIs. Written in Rust.☆41Updated this week
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- A Proof Generator for Entailments, Tautologies, and Semantic Equivalences in First-order Logic☆44Jan 5, 2026Updated last month
- sbt plugin for scala modules.☆14Feb 25, 2026Updated last week
- Incan: a modern, Pythonic language that compiles to Rust! Type-safe, async-friendly, with fixtures, testing, and web/inter-op built in.☆12Feb 23, 2026Updated last week
- Bringing up Docker Compose environments for system, integration and performance testing, with support for ScalaTest and Gatling☆11Jul 29, 2021Updated 4 years ago
- The NVRC project provides a Rust binary that implements a simple init system for microVMs.☆25Feb 24, 2026Updated last week
- A curated list of awesome Dropbox SDKs, open source libraries, and cool tools and services powered by Dropbox.☆15Apr 6, 2016Updated 9 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Aug 17, 2015Updated 10 years ago
- RLIBM-ALL: A correctly rounded math library and a polynomial generator that produces correct results for multiple floating point represen…☆17Oct 6, 2023Updated 2 years ago
- A list of awesome beginners-friendly projects.☆12Oct 5, 2020Updated 5 years ago
- A collection of Flink applications for working with Pravega streams☆12Dec 20, 2022Updated 3 years ago
- Meet Rustacean GPT, an experimental project transforming OpenAi's GPT into a helpful, autonomous software engineer to support senior deve…☆14May 10, 2023Updated 2 years ago
- Easy way to send Finagle metrics to Codahale Metrics library☆42Apr 2, 2020Updated 5 years ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- A Trino ODBC driver☆14Jan 10, 2024Updated 2 years ago
- SERA: IPv6 Segment Routing Aware Firewall☆11Apr 30, 2018Updated 7 years ago
- ☆18Nov 4, 2024Updated last year
- A library enabling DAG structuring of data processing programs such as ETLs☆17Dec 13, 2025Updated 2 months ago
- Shapeless generic instances for Scrooge types☆14Feb 16, 2018Updated 8 years ago
- TPC-C for YDB☆12Aug 11, 2025Updated 6 months ago
- JupyterLab Notebook for Mesosphere DC/OS☆11Aug 6, 2019Updated 6 years ago
- ☆12Apr 7, 2025Updated 10 months ago
- general collection of notes☆10Oct 8, 2018Updated 7 years ago
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- Get a nicely formatted summary of authors that contributed to a project between two points in git history☆10Jan 1, 2025Updated last year
- Cloud Spanner Connector for Apache Spark☆17Feb 23, 2026Updated last week
- Generate random Snellen charts for visual acuity tests☆13Oct 31, 2023Updated 2 years ago
- JCublas - Java bindings for CUBLAS☆13Nov 16, 2024Updated last year