fsspec / alluxiofs
Speed up fsspec data access with Alluxio distributed caching.
☆14Updated last month
Alternatives and similar repositories for alluxiofs:
Users that are interested in alluxiofs are comparing it to the libraries listed below
- Lightning In-Memory Object Store☆45Updated 3 years ago
- Lakehouse storage system benchmark☆73Updated 2 years ago
- ☆10Updated last year
- Grizzly: Efficient Stream Processing Through Adaptive Query Compilation☆16Updated 4 years ago
- Mirror of Apache crail (Incubating)☆150Updated 2 years ago
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆54Updated 11 months ago
- [SIGMOD '24] CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure☆65Updated 9 months ago
- ☆30Updated 2 years ago
- A caching framework for microservice applications☆20Updated last year
- InfiniStore: an elastic serverless cloud storage system (VLDB'23)☆22Updated last year
- DS2 is an auto-scaling controller for distributed streaming dataflows☆89Updated 2 years ago
- ☆34Updated 3 years ago
- SnailTrail implementation☆39Updated 6 years ago
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆25Updated 10 months ago
- Pythonic file-system interface for TOS(Tinder Object Storage)https://tosfs.readthedocs.io/en/latest/☆14Updated last week
- This repository contains the code for our DaMoN '21 paper.☆11Updated 3 years ago
- ☆35Updated 10 months ago
- HDFS file read access for ClickHouse☆41Updated last month
- A modular acceleration toolkit for big data analytic engines☆68Updated 11 months ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆11Updated last week
- ☆14Updated 2 years ago
- Tools for generating TPC-* datasets☆29Updated 10 months ago
- Skeena: Efficient and Consistent Cross-Engine Transactions (SIGMOD 2022, ACM SIGMOD Research Highlights Award 2022)☆21Updated last year
- Fast I/O plugins for Spark☆41Updated 4 years ago
- Demonstrations of (in)consistency in various streaming systems.☆23Updated 4 years ago
- A native storage format for apache arrow☆82Updated last year
- Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …☆67Updated 3 years ago
- A GPT4 powered tool for detecting bugs in Databend☆16Updated 8 months ago
- LazyLog: A New Shared Log Abstraction for Low-Latency Applications☆22Updated 5 months ago
- OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It …☆90Updated last month