zrlio / crail-spark-ioView external linksLinks
Fast I/O plugins for Spark
☆41Dec 14, 2020Updated 5 years ago
Alternatives and similar repositories for crail-spark-io
Users that are interested in crail-spark-io are comparing it to the libraries listed below
Sorting:
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆258May 13, 2019Updated 6 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- ☆68May 1, 2017Updated 8 years ago
- DiSNI: Direct Storage and Networking Interface☆194Mar 9, 2023Updated 2 years ago
- A reusable, extensible, and efficient C++ implementation of the Foster B-tree data structure☆15Jun 26, 2019Updated 6 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Mar 2, 2023Updated 2 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- ☆15May 13, 2022Updated 3 years ago
- Thousand Island Scanner: Scaling Video Analysis on AWS Lambda☆13Oct 25, 2019Updated 6 years ago
- NVIDIA GPU direct RDMA using SISCI API☆17Apr 8, 2018Updated 7 years ago
- Java API for libaio☆15Jan 10, 2022Updated 4 years ago
- Repository for the Spark-Vector connector☆20Jul 7, 2021Updated 4 years ago
- A hybrid I/O virtualization framework for RDMA-capable network interfaces☆35Mar 2, 2018Updated 7 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 4 years ago
- Source code for our OSDI 2016 paper☆110Nov 11, 2018Updated 7 years ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Rust version of fastapprox: approximate versions of functions commonly used in machine learning.☆24Sep 11, 2023Updated 2 years ago
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- Random implementation notes☆33Apr 23, 2013Updated 12 years ago
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- Highly configurable Helm Presto Chart☆24Nov 13, 2019Updated 6 years ago
- Bloomfilter support for Facebook Presto (prestodb.io)☆25Jul 7, 2022Updated 3 years ago
- DaRPC: Data Center Remote Procedure Call☆56Oct 13, 2020Updated 5 years ago
- A NVMf library for Java☆30Aug 15, 2019Updated 6 years ago
- High Performance Network Library for RDMA☆28Jan 3, 2023Updated 3 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 9 years ago
- Data sets and Vagrant script to provision a virtual machine for Apache Calcite development☆31Mar 24, 2023Updated 2 years ago
- Verbs on DPDK☆106Sep 5, 2022Updated 3 years ago
- Fast In-memory Transaction Processing using RDMA and HTM☆59Dec 20, 2015Updated 10 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Dec 19, 2024Updated last year
- Fast In-memory Transaction Processing using Hybrid RDMA Primitives☆66Nov 15, 2018Updated 7 years ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- Apache Yunikorn website - see the master branch for instructions☆30Feb 5, 2026Updated last week
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- Spark Terasort☆121Apr 21, 2023Updated 2 years ago