LMCache on Ascend
☆77Jun 24, 2026Updated this week
Alternatives and similar repositories for LMCache-Ascend
Users that are interested in LMCache-Ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Systematic and comprehensive benchmarks for LLM systems.☆61Jan 28, 2026Updated 5 months ago
- Community maintained hardware plugin for vLLM on Ascend☆2,295Updated this week
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆452Updated this week
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- SGLang kernel library for NPU☆148Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ParaDnn: A systematic performance analysis methodology for deep learning.☆40Mar 30, 2020Updated 6 years ago
- ☆157Mar 5, 2026Updated 3 months ago
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- ☆11Feb 5, 2017Updated 9 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆36Jun 13, 2025Updated last year
- ☆13Jan 16, 2019Updated 7 years ago
- ☆11Dec 9, 2022Updated 3 years ago
- 📚 学习c++历程中模拟实现关于STL容器、特殊类、智能指针以及一些高阶的数据结构源码☆13Nov 29, 2019Updated 6 years ago
- ☆123May 19, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Oct 24, 2024Updated last year
- The driver for LMCache core to run in vLLM☆67Feb 4, 2025Updated last year
- Word template for a Lancaster University thesis☆10Mar 19, 2022Updated 4 years ago
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆117Dec 15, 2025Updated 6 months ago
- ☆12May 3, 2020Updated 6 years ago
- ☆13Nov 21, 2024Updated last year
- 我的小窝, 装修全纪录☆11Apr 19, 2021Updated 5 years ago
- NVIDIA Inference Xfer Library (NIXL)☆1,106Updated this week
- Data Plane Development Kit☆13Nov 10, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- seq2seq_translation☆27Nov 28, 2021Updated 4 years ago
- graph challenge 2021☆27Jul 9, 2021Updated 4 years ago
- An Alluring, Dark, and Muted Theme For Xcode.☆14Aug 6, 2019Updated 6 years ago
- PCB libraries and templates for rocket-chip based FPGA/ASIC designs☆19Jun 4, 2026Updated 3 weeks ago
- ☆13Jun 20, 2019Updated 7 years ago
- ☆10Dec 27, 2020Updated 5 years ago
- 该储存库现已移动到“https://github.com/HoneyWhiteCloud/enable-hdr-oneplus13-webui”☆10Aug 30, 2025Updated 10 months ago
- Python module to compute the Mann-Kendall test for trend in time series data☆10Apr 18, 2017Updated 9 years ago
- Solution to Kaggle Santa 2021 Challenge☆14Jan 18, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于Vela平台的POSIX和Wasm的多语言运行时☆11Apr 25, 2023Updated 3 years ago
- Ampere CentOS kernel☆18Jul 16, 2024Updated last year
- Model Slicing for Analytics with Elastic Inference Cost and Resource Constraints☆12Jul 6, 2023Updated 2 years ago
- efftive-java-3rd 中文版☆13Oct 3, 2018Updated 7 years ago
- ☆12Jun 9, 2020Updated 6 years ago
- LMCache: Supercharge Your LLM with the Fastest KV Cache Layer☆9,944Updated this week
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago