Preview Code for Continuum Paper
☆75Apr 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for vllm-continuum
Users that are interested in vllm-continuum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Apr 7, 2026Updated last month
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- ☆158Oct 9, 2024Updated last year
- Systematic and comprehensive benchmarks for LLM systems.☆57Jan 28, 2026Updated 3 months ago
- ☆12Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS Spotlight 2025] Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.☆83Sep 26, 2025Updated 7 months ago
- The first range filter to simultaneously offer dynamicity, fast operations, and a robust false positive rate for any workload.☆12Jul 15, 2025Updated 9 months ago
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆32Mar 7, 2024Updated 2 years ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆68Oct 2, 2025Updated 7 months ago
- A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.☆194May 1, 2026Updated last week
- 2026年排名前5最好用的VPN(梯子、机场)推荐与免费代理工具分析,专为中国用户优化,兼具 极速连接、顶级安全与高性价比。全球节点加速,多节点随意切换,让你轻松解锁 ChatGPT、Google、YouTube、Netflix、TikTok 等受限服务;支持 Androi…☆32Apr 30, 2026Updated last week
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆147Feb 27, 2026Updated 2 months ago
- Notes and work-in-progress for BPF-related research projects☆12Jan 10, 2025Updated last year
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆256Mar 19, 2026Updated last month
- Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.☆63Mar 4, 2026Updated 2 months ago
- The repo of "BugLens"☆40Nov 12, 2025Updated 5 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- Machine Learning System☆14May 11, 2020Updated 5 years ago
- A simple SQL parser based on Apache Calcite.☆14Updated this week
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Advancing the frontier of efficient AI☆60Updated this week
- A pytorch model profiler with information about macs, energy and e.t.c☆17Feb 24, 2024Updated 2 years ago
- ☆19Feb 18, 2025Updated last year
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆30Apr 9, 2026Updated last month
- Accepted paper of SIGMOD 2023, DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with the GPU☆16Dec 15, 2023Updated 2 years ago
- Nex Venus Communication Library☆74Nov 17, 2025Updated 5 months ago
- ☆12Sep 18, 2024Updated last year
- Pokémon damage calculator☆14Feb 7, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An RDMA skew-aware key-value store, which implements the Scale-Out ccNUMA design, to exploit skew in order to increase performance of dat…☆19Jul 1, 2021Updated 4 years ago
- The mgmt translator for Puppet manifests☆11Feb 27, 2024Updated 2 years ago
- ☆13Feb 16, 2023Updated 3 years ago
- NVIDIA cuTile learn☆168Dec 9, 2025Updated 5 months ago
- ☆28Jun 22, 2025Updated 10 months ago
- ☆27Apr 19, 2026Updated 3 weeks ago
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago