Easy, Fast, and Scalable Multimodal AI
☆113Mar 4, 2026Updated this week
Alternatives and similar repositories for cornserve
Users that are interested in cornserve are comparing it to the libraries listed below
Sorting:
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- ☆12Apr 24, 2024Updated last year
- Production-ready, Light, and Flexible Webhook Infrastructure | Effortlessly Build Performant Webhook Integrations☆12Sep 8, 2024Updated last year
- A framework for generating realistic LLM serving workloads☆104Oct 9, 2025Updated 5 months ago
- Repo for CS 380D Distributed Systems course at the University of Texas at Austin CS Department☆25Mar 30, 2020Updated 5 years ago
- A unified benchmarking framework for generative styling models in PyTorch☆14Oct 27, 2024Updated last year
- This codebase holds API logic from Sparrow App.☆28Feb 27, 2026Updated last week
- ☆36Dec 16, 2025Updated 2 months ago
- ☆11Sep 12, 2025Updated 5 months ago
- Aequitas enables RPC-level QoS in datacenter networks.☆18Jul 19, 2022Updated 3 years ago
- Training tiny models to prove hard theorems☆41Feb 15, 2026Updated 3 weeks ago
- ☆21May 13, 2022Updated 3 years ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆91Feb 7, 2026Updated last month
- A canonical source of GenAI energy benchmark and meausrements☆50Nov 29, 2025Updated 3 months ago
- ☆35Jun 22, 2024Updated last year
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Mar 8, 2025Updated last year
- ☆54Jul 16, 2025Updated 7 months ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆123Dec 25, 2025Updated 2 months ago
- A resilient distributed training framework☆97Apr 11, 2024Updated last year
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 2 months ago
- Google Gemini AI model w/speech recognition and voice.☆26Nov 26, 2025Updated 3 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆71Jan 13, 2026Updated last month
- ☆76Feb 18, 2026Updated 2 weeks ago
- MIO: A Foundation Model on Multimodal Tokens☆34Dec 13, 2024Updated last year
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆71Feb 18, 2026Updated 2 weeks ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆30Jun 14, 2024Updated last year
- ☆30Updated this week
- ☆28May 2, 2023Updated 2 years ago
- Agentic Research and Evaluation Suite☆77Feb 26, 2026Updated last week
- Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"☆103Oct 24, 2025Updated 4 months ago
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- ☆36Feb 6, 2026Updated last month
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- Hydra adds resilience and high availability to remote memory solutions.☆33Feb 22, 2022Updated 4 years ago
- A Federated Execution Engine for Fast Distributed Computation Over Slow Networks☆26Apr 26, 2021Updated 4 years ago
- Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.☆58Aug 15, 2025Updated 6 months ago
- Measure and optimize the energy consumption of your AI applications!☆338Mar 1, 2026Updated last week
- ☆87Oct 17, 2025Updated 4 months ago
- ☆128Mar 2, 2026Updated last week