Preview Code for Continuum Paper
☆82Jun 3, 2026Updated this week
Alternatives and similar repositories for vllm-continuum
Users that are interested in vllm-continuum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Apr 7, 2026Updated 2 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆16Nov 1, 2025Updated 7 months ago
- ☆160Oct 9, 2024Updated last year
- Systematic and comprehensive benchmarks for LLM systems.☆59Jan 28, 2026Updated 4 months ago
- ☆13Apr 9, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Autocomp: Optimize any AI kernel, anywhere.☆135Updated this week
- The first range filter to simultaneously offer dynamicity, fast operations, and a robust false positive rate for any workload.☆13Jul 15, 2025Updated 10 months ago
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆32Mar 7, 2024Updated 2 years ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.☆225Updated this week
- A basic repository for a Clang-based tool, with CMake integration.☆10Sep 22, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Notes and work-in-progress for BPF-related research projects☆12Jan 10, 2025Updated last year
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆264Mar 19, 2026Updated 2 months ago
- Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.☆64Mar 4, 2026Updated 3 months ago
- VSS: A Storage System for Video Analytics☆13Jul 9, 2021Updated 4 years ago
- The repo of "BugLens"☆41Nov 12, 2025Updated 6 months ago
- ☆18Dec 2, 2025Updated 6 months ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- ☆11Mar 9, 2022Updated 4 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- Machine Learning System☆14May 11, 2020Updated 6 years ago
- ☆10Apr 20, 2025Updated last year
- ☆20Feb 2, 2026Updated 4 months ago
- A simple SQL parser based on Apache Calcite.☆14May 8, 2026Updated last month
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- Advancing the frontier of efficient AI☆65Updated this week
- ☆16Nov 28, 2023Updated 2 years ago
- A pytorch model profiler with information about macs, energy and e.t.c☆17Feb 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A rust-version of NVIDIA BlueField DOCA kit.☆14Jun 11, 2023Updated 2 years ago
- The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)☆32Jan 22, 2026Updated 4 months ago
- ☆19Feb 18, 2025Updated last year
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆30Apr 9, 2026Updated last month
- Nex Venus Communication Library☆75Nov 17, 2025Updated 6 months ago
- Pokémon damage calculator☆14Feb 7, 2024Updated 2 years ago