minlanyu / cs243-siteLinks
☆65Updated 6 months ago
Alternatives and similar repositories for cs243-site
Users that are interested in cs243-site are comparing it to the libraries listed below
Sorting:
- A resilient distributed training framework☆95Updated last year
- ☆72Updated 3 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆81Updated last year
- Ensō is a high-performance streaming interface for NIC-application communication.☆72Updated this week
- My paper/code reading notes in Chinese☆46Updated 2 weeks ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆50Updated 2 years ago
- ☆37Updated 7 months ago
- ☆32Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆25Updated 7 months ago
- Advanced Scalable Systems for X☆34Updated 6 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆23Updated last month
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆18Updated 3 months ago
- Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC☆44Updated 2 weeks ago
- Random collections of my interested research papers / projects☆20Updated 4 years ago
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆39Updated last week
- Systems for GenAI☆142Updated 2 months ago
- LLM serving cluster simulator☆106Updated last year
- system paper reading notes☆246Updated 3 years ago
- ☆39Updated 5 months ago
- ☆30Updated 2 months ago
- Nu is a new datacenter system that enables developers to build fungible applications that can use datacenter resources wherever they are.☆38Updated last year
- Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…☆107Updated 2 weeks ago
- SOTA Learning-augmented Systems☆36Updated 3 years ago
- Major CS conference publication stats (including accepted and submitted) by year.☆128Updated this week
- ☆53Updated 4 years ago
- Selected Topics in Computer Networks @ Johns Hopkins University☆19Updated 4 years ago
- ☆14Updated 3 years ago
- ☆79Updated 2 years ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆38Updated 9 months ago
- Stateful LLM Serving☆73Updated 3 months ago