Fast I/O plugins for Spark
☆41Dec 14, 2020Updated 5 years ago
Alternatives and similar repositories for crail-spark-io
Users that are interested in crail-spark-io are comparing it to the libraries listed below
Sorting:
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆74Mar 2, 2018Updated 8 years ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆257May 13, 2019Updated 6 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- A reusable, extensible, and efficient C++ implementation of the Foster B-tree data structure☆15Jun 26, 2019Updated 6 years ago
- Flash cache solution iostash☆11Jun 23, 2016Updated 9 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Thousand Island Scanner: Scaling Video Analysis on AWS Lambda☆13Oct 25, 2019Updated 6 years ago
- ☆15May 13, 2022Updated 3 years ago
- NVIDIA GPU direct RDMA using SISCI API☆17Apr 8, 2018Updated 7 years ago
- Java API for libaio☆15Jan 10, 2022Updated 4 years ago
- Repository for the Spark-Vector connector☆20Jul 7, 2021Updated 4 years ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- A hybrid I/O virtualization framework for RDMA-capable network interfaces☆35Mar 2, 2018Updated 8 years ago
- Some code snippets used in blogs☆17Oct 26, 2025Updated 4 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 4 years ago
- Source code for our OSDI 2016 paper☆110Nov 11, 2018Updated 7 years ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago
- Rust version of fastapprox: approximate versions of functions commonly used in machine learning.☆24Sep 11, 2023Updated 2 years ago
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- Use Vagrant and Ambari Blueprint API to install PivotalHD 3.0 (or Hortonworks HDP2.x) Hadoop cluster with HAWQ 1.3 (SQL on Hadoop) and Sp…☆23Jul 20, 2016Updated 9 years ago
- Random implementation notes☆33Apr 23, 2013Updated 12 years ago
- Quark is a data virtualization engine over analytic databases.☆100Jul 13, 2017Updated 8 years ago
- Bloomfilter support for Facebook Presto (prestodb.io)☆25Jul 7, 2022Updated 3 years ago
- Highly configurable Helm Presto Chart☆24Nov 13, 2019Updated 6 years ago
- High Performance Network Library for RDMA☆28Jan 3, 2023Updated 3 years ago
- A NVMf library for Java☆30Aug 15, 2019Updated 6 years ago
- Embed any webapp/website as Ambari view!☆25Feb 26, 2016Updated 10 years ago
- Verbs on DPDK☆106Sep 5, 2022Updated 3 years ago
- Fast In-memory Transaction Processing using RDMA and HTM☆59Dec 20, 2015Updated 10 years ago
- An opinionated Kubernetes deployment system for appops☆31Nov 7, 2016Updated 9 years ago
- ☆31Feb 22, 2024Updated 2 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆130Dec 19, 2024Updated last year
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 2 years ago
- Loki log provider for OpenFaaS☆26Jan 12, 2024Updated 2 years ago