Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSim), and more.
☆137Apr 14, 2026Updated this week
Alternatives and similar repositories for tair-kvcache
Users that are interested in tair-kvcache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- Agent skills for vLLM☆59Apr 3, 2026Updated last week
- ☆62Feb 5, 2026Updated 2 months ago
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆95Jan 16, 2026Updated 3 months ago
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An event-driven c library open source by taobao originally and maintain here☆21Mar 15, 2020Updated 6 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- Accelerated Computer Vision Lab (ACCV-Lab) is a systematic collection of packages with the common goal to facilitate end-to-end efficient…☆50Mar 24, 2026Updated 3 weeks ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- fork from yuki-xin/picgo-plugin-web-uploader☆16Sep 22, 2019Updated 6 years ago
- ☆26Oct 2, 2023Updated 2 years ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 8 months ago
- ☆11Jun 9, 2023Updated 2 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆177Feb 11, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 一个用Apple Metal实现的Llama和通义千问大模 型本地推理☆10Apr 26, 2024Updated last year
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆125Dec 25, 2025Updated 3 months ago
- ☆360Jan 28, 2026Updated 2 months ago
- ☆51May 19, 2025Updated 10 months ago
- 在线表情包制作工具☆21Aug 4, 2022Updated 3 years ago
- ☆15Nov 14, 2023Updated 2 years ago
- Tools and library to manipulate EFI variables.☆10Mar 26, 2026Updated 3 weeks ago
- ☆13Jun 29, 2024Updated last year
- HPC Game Platform☆11Apr 20, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GNU Gzip with Kunpeng optimization.☆12Mar 30, 2022Updated 4 years ago
- Mini Moonbit implementation from 摩卡猫猫☆15Dec 4, 2024Updated last year
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 5 months ago
- ☆41Feb 26, 2025Updated last year
- Nsq v1.1.0 版本的源码分析☆14Aug 9, 2020Updated 5 years ago
- Mirror site speedtest☆12Dec 4, 2023Updated 2 years ago
- clickhouse-copier (obsolete)☆15Mar 17, 2024Updated 2 years ago
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 2 months ago
- 我陈平安,唯有一键,可搬山,倒海,降妖,镇魔,敕神,摘星,断江,摧城,开天!☆22Jun 4, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- a high-performance, large-capacity, multi-tenant, data-persistent, strong data consistency based on raft, Redis-compatible elastic KV dat…☆52Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆43Updated this week
- ☆44Oct 15, 2025Updated 6 months ago
- ☆14Oct 8, 2023Updated 2 years ago
- A tool to detect which version of Redis your Redis-Like database is compatible with.☆42Updated this week