☆291Jun 24, 2026Updated last week
Alternatives and similar repositories for FlexKV
Users that are interested in FlexKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40Jun 17, 2026Updated 2 weeks ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆93Oct 15, 2025Updated 8 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- ☆32Jun 22, 2025Updated last year
- A simple demo for using Sentinel with Spring Cloud Alibaba☆17Nov 8, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 国内首个企业级 IT 运维多 Agent 自动化平台 — 基于大语言模型的智能运维解决方案。ITOps Agent Platform通过可视化工作流编排,将多个AI Agent组合成智能运维自动化流水线,实现服务器管理、告警处理、故障诊断、日志分析、脚本管理、定时运维任务的…☆263Jun 24, 2026Updated last week
- AI Cluster Observability & Troubleshooting Toolkit. Powered by SII & Infrawaves.☆36Apr 29, 2026Updated 2 months ago
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆111Dec 17, 2025Updated 6 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆194Feb 11, 2026Updated 4 months ago
- Important experiments on memory management, file access, network transfer, job scheduler, and so on.☆15Apr 27, 2022Updated 4 years ago
- perf-script and (Linux, QEMU, SeaBIOS) patches to measure the boot time of a Linux VM with QEMU☆41Apr 3, 2020Updated 6 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆15Aug 31, 2023Updated 2 years ago
- alibaba/Sentinel zuul integration sample☆11Oct 20, 2018Updated 7 years ago
- An ultra-fast, distributed Safetensors loader☆63Jun 22, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆35Nov 18, 2025Updated 7 months ago
- eRPC library for Rust☆14Jan 16, 2020Updated 6 years ago
- SocksDirect code repository☆20May 6, 2026Updated last month
- High Performance KV Cache Store for LLM☆56May 20, 2026Updated last month
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- ☆42Dec 9, 2025Updated 6 months ago
- High-Performance Embeddable Vector Database with Document Storage, Hybrid Search, and Filtering☆82Jun 4, 2026Updated 3 weeks ago
- COSCon Workshop on ECharts☆18Oct 18, 2018Updated 7 years ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Heartland Payment Systems Java SDK☆10Feb 27, 2025Updated last year
- Persist and reuse KV Cache to speedup your LLM.☆295Updated this week
- A platform for formalizing OEIS sequences in Lean 4☆20Jun 10, 2026Updated 3 weeks ago
- Integrated Training Platform (ITP) traces used in ElasticFlow paper.☆31Dec 23, 2022Updated 3 years ago
- Gomoku, a HTML5 game working on PC and mobile device☆17May 24, 2013Updated 13 years ago
- CUDA keyring packaging for Debian☆14Apr 14, 2023Updated 3 years ago
- ☆14Jan 10, 2025Updated last year
- eTran: Extensible Kernel Transport with eBPF☆52Apr 28, 2025Updated last year
- Postgres protocol support for finagle☆36Sep 4, 2013Updated 12 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆40May 19, 2026Updated last month
- 技术杂文集☆34Feb 27, 2026Updated 4 months ago
- ☆88Sep 15, 2025Updated 9 months ago
- ☆11Nov 14, 2023Updated 2 years ago
- A size grip QGraphicsItem for interactive resizing.☆33Jun 6, 2022Updated 4 years ago
- IndexFS core☆21Jan 12, 2015Updated 11 years ago
- ansible plugins used by xiaomi☆10Oct 13, 2018Updated 7 years ago