Preview Code for Continuum Paper
☆54Mar 20, 2026Updated last week
Alternatives and similar repositories for vllm-continuum
Users that are interested in vllm-continuum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Dec 25, 2025Updated 3 months ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- ☆12Apr 9, 2025Updated 11 months ago
- [ICML 2025] Improving Planning of Agents for Long-Horizon Tasks☆27Oct 2, 2025Updated 5 months ago
- ☆11Jan 19, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆32Mar 7, 2024Updated 2 years ago
- AgentOpt automatically finds the best LLM model combination for each step of your agent — optimizing for accuracy, cost, and latency.☆47Updated this week
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆67Oct 2, 2025Updated 5 months ago
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆139Feb 27, 2026Updated last month
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆243Mar 19, 2026Updated last week
- A basic repository for a Clang-based tool, with CMake integration.☆10Sep 22, 2023Updated 2 years ago
- Advancing the frontier of efficient AI☆56Mar 20, 2026Updated last week
- VSS: A Storage System for Video Analytics☆13Jul 9, 2021Updated 4 years ago
- Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.☆57Mar 4, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Mar 9, 2022Updated 4 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆172Feb 11, 2026Updated last month
- Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model☆91Mar 2, 2026Updated 3 weeks ago
- Machine Learning System☆14May 11, 2020Updated 5 years ago
- ☆19Feb 2, 2026Updated last month
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- A pytorch model profiler with information about macs, energy and e.t.c☆17Feb 24, 2024Updated 2 years ago
- A rust-version of NVIDIA BlueField DOCA kit.☆14Jun 11, 2023Updated 2 years ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆26Mar 18, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Nex Venus Communication Library☆74Nov 17, 2025Updated 4 months ago
- Pokémon damage calculator☆13Feb 7, 2024Updated 2 years ago
- ☆12Sep 18, 2024Updated last year
- Request access to Optane powered bare metal infrastructure for performance-testing and analysis purposes☆14Jan 23, 2019Updated 7 years ago
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆48Jan 28, 2026Updated 2 months ago
- ☆29Jun 22, 2025Updated 9 months ago
- ☆24Updated this week
- ☆10Oct 8, 2021Updated 4 years ago
- A system for scheduling serverless edge functions☆11Aug 11, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆18Jul 5, 2024Updated last year
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆103Mar 10, 2026Updated 2 weeks ago
- A comprehensive repository for Compute Express Link (CXL) resources: covering research papers, specifications, simulation/emulation tools…☆23Feb 24, 2026Updated last month
- Experimental Bookie C++ implementation☆15Jun 28, 2016Updated 9 years ago
- A docker image for One Student One Chip's debug exam☆10Sep 22, 2023Updated 2 years ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated 11 months ago
- Systematic and comprehensive benchmarks for LLM systems.☆53Jan 28, 2026Updated 2 months ago