[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆222Sep 21, 2024Updated last year
Alternatives and similar repositories for ParrotServe
Users that are interested in ParrotServe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- Dynamic Memory Management for Serving LLMs without PagedAttention☆498Jun 10, 2026Updated 3 weeks ago
- [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference☆397Jul 10, 2025Updated 11 months ago
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆64Jun 5, 2024Updated 2 years ago
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A throughput-oriented high-performance serving framework for LLMs☆962Mar 29, 2026Updated 3 months ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- A low-latency & high-throughput serving engine for LLMs☆507Jan 8, 2026Updated 5 months ago