eth-easl / cachew
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆37Updated 6 months ago
Alternatives and similar repositories for cachew:
Users that are interested in cachew are comparing it to the libraries listed below
- ☆23Updated last year
- ☆16Updated 2 years ago
- ☆53Updated 4 years ago
- A resilient distributed training framework☆89Updated 11 months ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆49Updated 2 years ago
- Lightning In-Memory Object Store☆45Updated 3 years ago
- ☆43Updated 3 years ago
- Stateful LLM Serving☆46Updated this week
- ☆35Updated 4 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆31Updated 2 years ago
- A universal workflow system for exactly-once DAGs☆23Updated last year
- Vector search with bounded performance.☆34Updated last year
- Virtual Memory Abstraction for Serverless Architectures☆46Updated 2 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆53Updated 6 months ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆51Updated 2 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆22Updated 3 months ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ☆24Updated last year
- Artifacts for our ASPLOS'23 paper ElasticFlow☆52Updated 10 months ago
- Serverless for all computation☆42Updated 2 years ago
- ☆11Updated 9 months ago
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆50Updated last month
- ☆44Updated 8 months ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆112Updated last year
- ☆31Updated 9 months ago
- MemLiner is a remote-memory-friendly runtime system.☆32Updated 2 years ago
- A Memory-Disaggregated Managed Runtime.☆65Updated 3 years ago
- Microsoft Collective Communication Library☆61Updated 3 months ago
- Modyn is a research-platform for training ML models on growing datasets.☆46Updated last week
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆17Updated this week