Preview Code for Continuum Paper
☆84Jun 22, 2026Updated this week
Alternatives and similar repositories for vllm-continuum
Users that are interested in vllm-continuum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Apr 7, 2026Updated 2 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆17Nov 1, 2025Updated 7 months ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- ☆159Oct 9, 2024Updated last year
- ☆13Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS Spotlight 2025] Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.☆82Sep 26, 2025Updated 9 months ago
- This is a repo covers ai research papers pseudocodes☆18Jun 20, 2023Updated 3 years ago
- Autocomp: Optimize any AI kernel, anywhere.☆139Updated this week
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆73Oct 2, 2025Updated 8 months ago
- 2026年排名前5最好用的VPN(梯子、机场)推荐与免费代理工具分析,专为中国用户优化,兼具 极速连接、顶级安全与高性价比。全球节点加速,多节点随意切换,让你轻松解锁 ChatGPT、Google、YouTube、Netflix、TikTok 等受限服务;支持 Androi…☆44Updated this week
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆164Feb 27, 2026Updated 4 months ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆273Mar 19, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.☆65Mar 4, 2026Updated 3 months ago
- VSS: A Storage System for Video Analytics☆13Jul 9, 2021Updated 4 years ago
- The repo of "BugLens"☆41Nov 12, 2025Updated 7 months ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- ☆18Dec 2, 2025Updated 6 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆193Feb 11, 2026Updated 4 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- Linux programming environment course in Chinese☆12Nov 19, 2017Updated 8 years ago
- ☆10Apr 20, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Feb 2, 2026Updated 4 months ago
- Final project for the class "Deep Learning Systems Algorithms and Implementation" from CMU, where we try to make needle work with Apple M…☆10Jan 8, 2023Updated 3 years ago
- A simple SQL parser based on Apache Calcite.☆14May 8, 2026Updated last month
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- ☆16Nov 28, 2023Updated 2 years ago
- A pytorch model profiler with information about macs, energy and e.t.c☆17Feb 24, 2024Updated 2 years ago
- The code of Advancing Expert Specialization for Better MoE (NeurIPS2025 oral)☆34Jan 22, 2026Updated 5 months ago
- ☆19Feb 18, 2025Updated last year
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆31Jun 15, 2026Updated 2 weeks ago
- Accepted paper of SIGMOD 2023, DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with the GPU☆16Dec 15, 2023Updated 2 years ago
- ☆12Sep 18, 2024Updated last year
- Nex Venus Communication Library☆75Nov 17, 2025Updated 7 months ago
- Pokémon damage calculator☆14Feb 7, 2024Updated 2 years ago
- Implementation of TCP connection tracking in eBPF☆15May 9, 2024Updated 2 years ago
- ☆13Feb 16, 2023Updated 3 years ago