iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.
☆39Jun 11, 2024Updated 2 years ago
Alternatives and similar repositories for igniter
Users that are interested in igniter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so a…☆14Sep 7, 2023Updated 2 years ago
- ☆12Sep 20, 2023Updated 2 years ago
- ebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.☆12Sep 7, 2023Updated 2 years ago
- Reading paper list for iCloud group☆14May 3, 2026Updated 2 months ago
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆38Jun 27, 2025Updated last year
- ☆53Dec 26, 2024Updated last year
- Cost-efficient and Instruction-driven AI Conversation in Digital Pathology☆23Nov 5, 2025Updated 8 months ago
- ☆54Dec 13, 2022Updated 3 years ago
- An interference-aware scheduler for fine-grained GPU sharing☆163Nov 26, 2025Updated 7 months ago
- ☆23Jan 7, 2022Updated 4 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling