liangyuRain / ForestColl
☆8Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ForestColl
- ☆14Updated 5 months ago
- ☆43Updated 3 years ago
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆9Updated last year
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆47Updated last year
- ☆13Updated 2 years ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆11Updated 5 months ago
- ☆22Updated 2 months ago
- Nu is a new datacenter system that enables developers to build fungible applications that can use datacenter resources wherever they are.☆35Updated 6 months ago
- Deduplication over dis-aggregated memory for Serverless Computing☆12Updated 2 years ago
- A rust-based benchmark for BlueField SmartNICs.☆27Updated last year
- ☆34Updated 5 months ago
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆77Updated 3 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆85Updated last year
- ☆24Updated last year
- NetLock: Fast, Centralized Lock Management Using Programmable Switches☆30Updated 4 years ago
- Code for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆16Updated this week
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆49Updated last month
- Vector search with bounded performance.☆33Updated 10 months ago
- Stateful LLM Serving☆38Updated 3 months ago
- ☆23Updated last year
- ☆31Updated 5 months ago
- ☆48Updated last year
- Benchmark Suite for RDMA Performance Isolation☆36Updated last year
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆19Updated 9 months ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Updated 2 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny model can tell you the verbosity of an LLM (…☆22Updated 5 months ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆13Updated last year
- ☆41Updated last year