zrlio/crail-spark-io

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zrlio/crail-spark-io)

zrlio / crail-spark-io

Fast I/O plugins for Spark

☆42

Alternatives and similar repositories for crail-spark-io

Users that are interested in crail-spark-io are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zrlio / crail
View on GitHub
[Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O
☆75Mar 2, 2018Updated 8 years ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
zrlio / albis
View on GitHub
Albis: High-Performance File Format for Big Data Systems
☆21Jul 12, 2018Updated 8 years ago
ibm-research / iostash
View on GitHub
Flash cache solution iostash
☆11Jun 23, 2016Updated 10 years ago
Mellanox / SparkRDMA
View on GitHub
This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…
☆258May 13, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zrlio / jaio
View on GitHub
Java API for libaio
☆15Jan 10, 2022Updated 4 years ago
zrlio / disni
View on GitHub
DiSNI: Direct Storage and Networking Interface
☆194Mar 9, 2023Updated 3 years ago
zaratsian / HDP_Tuning_Unofficial
View on GitHub
Collection of HDP Tuning Tricks & Tips (unofficial guide)
☆17Sep 26, 2017Updated 8 years ago
InMobi / grill
View on GitHub
☆17Jan 29, 2019Updated 7 years ago
efficient / HERD
View on GitHub
☆70May 1, 2017Updated 9 years ago
IBM / LTFS-Data-Management
View on GitHub
☆36Sep 17, 2025Updated 10 months ago
youngwookim / awesome-presto
View on GitHub
A curated list of awesome PrestoDB / Trino software, libraries, tools and resources
☆18Jun 28, 2021Updated 5 years ago
efficient / fasst
View on GitHub
Source code for our OSDI 2016 paper
☆109Nov 11, 2018Updated 7 years ago
zrlio / urdma
View on GitHub
Verbs on DPDK
☆107Sep 5, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zrlio / darpc
View on GitHub
DaRPC: Data Center Remote Procedure Call
☆57Oct 13, 2020Updated 5 years ago
sjyk / sampleclean
View on GitHub
SampleClean+BlinkDB
☆18May 21, 2014Updated 12 years ago
IBM / uDepot
View on GitHub
Key-Value Store for Non-Volatile Memories uDepot
☆46May 23, 2022Updated 4 years ago
gdfm / partial-key-grouping
View on GitHub
An implementation and example of Partial Key Grouping for Apache Storm. Partial Key Grouping is a load balancing strategy for distributed…
☆15Sep 14, 2015Updated 10 years ago
Azure-Samples / hdinsight-spark-scala-kafka
View on GitHub
A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight
☆13Mar 2, 2023Updated 3 years ago
ActianCorp / spark-vector
View on GitHub
Repository for the Spark-Vector connector
☆20Jul 7, 2021Updated 5 years ago
zpysky1125 / Ensembling
View on GitHub
Sklearn implement of multiple ensemble learning methods, including bagging, adaboost, iterative bagging and multiboosting
☆13Jan 9, 2018Updated 8 years ago
elazarl / hadoop_rpc_walktrhough
View on GitHub
What happens on the wire when Hadoop RPC call is issued?
☆13Jul 1, 2022Updated 4 years ago
Intel-bigdata / HPNL
View on GitHub
High Performance Network Library for RDMA
☆28Jan 3, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
caetanosauer / foster-btree
View on GitHub
A reusable, extensible, and efficient C++ implementation of the Foster B-tree data structure
☆15Jun 26, 2019Updated 7 years ago
SJTU-IPADS / cocytus
View on GitHub
Cocytus is an efficient and available in-memory K/V-store through hybrid erasure coding and replication
☆31Mar 7, 2016Updated 10 years ago
tum-db / cachecoherence
View on GitHub
☆13May 11, 2026Updated 2 months ago
MemVerge / splash
View on GitHub
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
☆131Dec 19, 2024Updated last year
patrickpclee / codfs
View on GitHub
CodFS: An Erasure-Coded Clustered Storage System for Efficient Updates and Recovery
☆10Mar 31, 2015Updated 11 years ago
Impetus / ankush
View on GitHub
A big data cluster management tool that creates and manages clusters of different technologies.
☆21Apr 20, 2015Updated 11 years ago
Kasma-Inc / Sagi
View on GitHub
☆20Nov 13, 2025Updated 8 months ago
qianl15 / this
View on GitHub
Thousand Island Scanner: Scaling Video Analysis on AWS Lambda
☆13Oct 25, 2019Updated 6 years ago
ExpediaGroup / hiveberg
View on GitHub
Demonstration of a Hive Input Format for Iceberg
☆26Mar 12, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
djsutherland / skl-groups
View on GitHub
scikit-learn addon to operate on set/"group"-based features
☆41Aug 8, 2016Updated 9 years ago
Azure-Samples / Machine-Learning-Operationalization
View on GitHub
Deploying machine learning models to Azure
☆62Jun 28, 2019Updated 7 years ago
Intel-bigdata / Spark-PMoF
View on GitHub
Spark Shuffle Optimization with RDMA+AEP
☆30May 23, 2023Updated 3 years ago
att / netarbiter
View on GitHub
Multi-site Network Emulation, Kubeadm-installed Kubernetes, NVMe over Fabrics
☆19Feb 8, 2021Updated 5 years ago
vasigavr1 / Odyssey
View on GitHub
☆15May 13, 2022Updated 4 years ago
UrbanOS-Public / kdp
View on GitHub
Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
☆17Oct 20, 2022Updated 3 years ago
L-Maybe / SemEval-2019-task3-EmoContext
View on GitHub
dawei.li SemEval-2019 task3 EmoContext: Multi-Step Ensemble Neural Network for Sentiment Analysis in Textual Conversation
☆16Jun 6, 2019Updated 7 years ago