Ying1123 / VTC-artifactView external linksLinks
☆43Jun 7, 2024Updated last year
Alternatives and similar repositories for VTC-artifact
Users that are interested in VTC-artifact are comparing it to the libraries listed below
Sorting:
- ☆24Jan 28, 2026Updated 2 weeks ago
- ☆164Jul 15, 2025Updated 7 months ago
- ☆131Nov 11, 2024Updated last year
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆135Feb 22, 2024Updated last year
- Managed collective communication service☆23Sep 2, 2024Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆25Nov 21, 2024Updated last year
- Compression for Foundation Models☆35Jul 21, 2025Updated 6 months ago
- ☆26Aug 31, 2023Updated 2 years ago
- A low-latency & high-throughput serving engine for LLMs☆474Jan 8, 2026Updated last month
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆84Jun 16, 2025Updated 7 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- A large-scale simulation framework for LLM inference☆535Jul 25, 2025Updated 6 months ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆29Mar 5, 2024Updated last year
- Efficient and easy multi-instance LLM serving☆527Sep 3, 2025Updated 5 months ago
- ☆29May 28, 2024Updated last year
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last week
- ☆85Oct 17, 2025Updated 3 months ago
- The ASPLOS 2025 / EuroSys 2025 Contest Track☆39Aug 7, 2025Updated 6 months ago
- Disaggregated serving system for Large Language Models (LLMs).☆776Apr 6, 2025Updated 10 months ago
- ☆151Oct 9, 2024Updated last year
- raytracer☆10Jul 18, 2022Updated 3 years ago
- A Framework for Automated Validation of Deep Learning Training Tasks☆61Jan 10, 2026Updated last month
- OZZ: Identifying Kernel Out-of-Order Concurrency Bugs with In-Vivo Memory Access Reordering☆50Sep 2, 2024Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- separating music and voice from a song☆10Nov 29, 2018Updated 7 years ago
- ☆15Jan 27, 2026Updated 2 weeks ago
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 3 months ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- GPU-accelerated LLM Training Simulator☆51Jun 26, 2025Updated 7 months ago
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆41Jul 10, 2024Updated last year
- ☆38Jan 15, 2021Updated 5 years ago
- ☆15Feb 10, 2023Updated 3 years ago
- ☆10Nov 7, 2023Updated 2 years ago
- ☆10Sep 9, 2021Updated 4 years ago
- EdgeRag is a program that runs large language models and vector databases on your local device☆14May 29, 2024Updated last year
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- An EDM-enabled PHY + a rack-level network simulator☆12Dec 11, 2024Updated last year
- This repo hosts the famfs kernel patch sets as branches☆11Jan 18, 2026Updated 3 weeks ago
- ☆11Mar 13, 2023Updated 2 years ago