☆45Jun 7, 2024Updated last year
Alternatives and similar repositories for VTC-artifact
Users that are interested in VTC-artifact are comparing it to the libraries listed below
Sorting:
- ☆16Apr 15, 2025Updated 10 months ago
- ☆27Jan 28, 2026Updated last month
- ☆167Jul 15, 2025Updated 7 months ago
- ☆131Nov 11, 2024Updated last year
- Managed collective communication service☆23Sep 2, 2024Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- ☆26Aug 31, 2023Updated 2 years ago
- Compression for Foundation Models☆35Jul 21, 2025Updated 7 months ago
- A low-latency & high-throughput serving engine for LLMs☆482Jan 8, 2026Updated 2 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆85Jun 16, 2025Updated 8 months ago
- A large-scale simulation framework for LLM inference☆545Jul 25, 2025Updated 7 months ago
- Efficient and easy multi-instance LLM serving☆528Sep 3, 2025Updated 6 months ago
- ☆30May 28, 2024Updated last year
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- ☆87Oct 17, 2025Updated 4 months ago
- Disaggregated serving system for Large Language Models (LLMs).☆778Apr 6, 2025Updated 11 months ago
- ☆150Oct 9, 2024Updated last year
- raytracer☆10Jul 18, 2022Updated 3 years ago
- The ASPLOS 2025 / EuroSys 2025 Contest Track☆40Aug 7, 2025Updated 7 months ago
- ☆51Apr 30, 2025Updated 10 months ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆22Feb 9, 2026Updated last month
- OZZ: Identifying Kernel Out-of-Order Concurrency Bugs with In-Vivo Memory Access Reordering☆51Sep 2, 2024Updated last year
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated 2 months ago
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 4 months ago
- API2Vec: Learning Representations of API Sequences for Malware Detection☆14Mar 10, 2024Updated 2 years ago
- ☆13May 13, 2025Updated 9 months ago
- Repo for transient training paper at ICAC 2019.☆11Oct 5, 2022Updated 3 years ago
- An Observability Framework for AI Training☆64Feb 13, 2026Updated 3 weeks ago
- ☆15Jan 27, 2026Updated last month
- Serverless Paper Reading and Discussion☆38Jan 9, 2023Updated 3 years ago
- GPU-accelerated LLM Training Simulator☆51Jun 26, 2025Updated 8 months ago
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆41Jul 10, 2024Updated last year
- ☆38Jan 15, 2021Updated 5 years ago
- ☆175Mar 12, 2024Updated last year
- PowerSensor is a low-cost, custom-built device that measures the instantaneous power consumption of GPUs and other devices at a high time…☆10Dec 15, 2025Updated 2 months ago
- ☆11Apr 10, 2025Updated 10 months ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week