eth-easl / cachew
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆36Updated 4 months ago
Alternatives and similar repositories for cachew:
Users that are interested in cachew are comparing it to the libraries listed below
- Stateful LLM Serving☆44Updated 6 months ago
- Lightning In-Memory Object Store☆44Updated 3 years ago
- ☆16Updated 2 years ago
- Vector search with bounded performance.☆34Updated last year
- ☆23Updated last year
- A resilient distributed training framework☆88Updated 9 months ago
- ☆53Updated 4 years ago
- A Memory-Disaggregated Managed Runtime.☆65Updated 3 years ago
- ☆43Updated 3 years ago
- ☆34Updated 3 years ago
- Microsoft Collective Communication Library☆61Updated 2 months ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- ☆24Updated last year
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆109Updated 11 months ago
- Ultra | Ultimate | Unified CCL☆23Updated last week
- An interference-aware scheduler for fine-grained GPU sharing☆121Updated this week
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆31Updated last year
- ☆10Updated 8 months ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆48Updated 2 years ago
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆77Updated 3 years ago
- Virtual Memory Abstraction for Serverless Architectures☆45Updated 2 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Updated 2 years ago
- A universal workflow system for exactly-once DAGs☆23Updated last year
- ☆13Updated 2 years ago
- Releasing the spot availability traces used in "Can't Be Late" paper.☆17Updated 9 months ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆30Updated 8 months ago
- MemLiner is a remote-memory-friendly runtime system.☆30Updated 2 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆22Updated 2 months ago
- An I/O benchmark for deep Learning applications☆73Updated 3 months ago
- An experimental parallel training platform☆54Updated 10 months ago