Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSim), and more.
☆181Jun 4, 2026Updated this week
Alternatives and similar repositories for tair-kvcache
Users that are interested in tair-kvcache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- helm charts for deploying models with llm-d☆31Updated this week
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆99Jan 16, 2026Updated 4 months ago
- ☆66Apr 26, 2025Updated last year
- An experimental communicating attention kernel based on DeepEP.☆34Jul 29, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An event-driven c library open source by taobao originally and maintain here☆21Mar 15, 2020Updated 6 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- ☆17Oct 28, 2020Updated 5 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- resp-benchmark is a benchmark tool for testing databases that support the RESP protocol, such as Redis, Valkey, and Tair.☆27Mar 2, 2026Updated 3 months ago
- cluster power control☆49May 29, 2026Updated last week
- Accelerated Computer Vision Lab (ACCV-Lab) is a systematic collection of packages with the common goal to facilitate end-to-end efficient…☆62Updated this week
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆34Aug 13, 2025Updated 9 months ago
- llm-d benchmark scripts and tooling☆61Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆126Dec 25, 2025Updated 5 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 3 months ago
- ☆367Jan 28, 2026Updated 4 months ago
- ☆52May 19, 2025Updated last year
- 查看进程打开的文件以及pagecache使用情况☆12Nov 20, 2015Updated 10 years ago
- ☆14Jul 28, 2024Updated last year
- Tools and library to manipulate EFI variables.☆10Apr 21, 2026Updated last month
- ☆13Jun 29, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- HPC Game Platform☆11Apr 20, 2023Updated 3 years ago
- GNU Gzip with Kunpeng optimization.☆12Mar 30, 2022Updated 4 years ago
- Mini Moonbit implementation from 摩卡猫猫☆15Dec 4, 2024Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆110Jun 28, 2025Updated 11 months ago
- Implementation of UltraMem, improved Product Key Memory design, from Bytedance AI labs☆28Nov 4, 2025Updated 7 months ago
- ☆41Feb 26, 2025Updated last year
- Oracle NoSQL Database. Designed for today’s most demanding applications that require low latency responses, flexible data models, and e…☆56Mar 2, 2026Updated 3 months ago
- Nsq v1.1.0 版本的源码分析☆14Aug 9, 2020Updated 5 years ago
- Mirror site speedtest☆12Dec 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- clickhouse-copier (obsolete)☆15Mar 17, 2024Updated 2 years ago
- 我陈平安,唯有一键,可搬山,倒海,降妖,镇魔,敕神,摘星,断江,摧城,开天!☆22Jun 4, 2022Updated 4 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- A flexible C++ formatting library designed for i18n, using embedded script to output plural forms, grammatical gender, etc. correctly☆12May 3, 2026Updated last month
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 6 months ago
- ☆45Oct 15, 2025Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆44May 28, 2026Updated last week