LMCache / LMCache-AscendView external linksLinks
LMCache on Ascend
☆49Feb 6, 2026Updated last week
Alternatives and similar repositories for LMCache-Ascend
Users that are interested in LMCache-Ascend are comparing it to the libraries listed below
Sorting:
- Systematic and comprehensive benchmarks for LLM systems.☆50Jan 28, 2026Updated 2 weeks ago
- Protocol buffers and other common resources.☆13Jan 20, 2026Updated 3 weeks ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆11Dec 31, 2024Updated last year
- ☆117Jan 10, 2026Updated last month
- SGLang kernel library for NPU☆96Feb 5, 2026Updated last week
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- Word template for a Lancaster University thesis☆11Mar 19, 2022Updated 3 years ago
- Ampere CentOS kernel☆18Jul 16, 2024Updated last year
- ☆13Nov 21, 2024Updated last year
- ☆11Apr 21, 2020Updated 5 years ago
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- Data Plane Development Kit☆12Nov 10, 2025Updated 3 months ago
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆12Jul 12, 2024Updated last year
- clustering algorithm implementation☆13Nov 3, 2025Updated 3 months ago
- A rust wrapper for HIP☆12Jun 10, 2025Updated 8 months ago
- ☆10Mar 14, 2020Updated 5 years ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆43Jan 30, 2026Updated 2 weeks ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- An ecosystem of Rust libraries for working with large language models☆13Oct 2, 2023Updated 2 years ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆92Jan 26, 2026Updated 2 weeks ago
- Framework to achieve context distillation in LLMs☆15Nov 24, 2023Updated 2 years ago
- ☆18Mar 4, 2025Updated 11 months ago
- 基于 mxnet, 实现 ssd demo for android☆14Oct 17, 2018Updated 7 years ago
- ☆13Jan 16, 2019Updated 7 years ago
- Fione is Enterprise AI Platform☆16Nov 9, 2025Updated 3 months ago
- (MacOS Support) OpenAI compatible http server for Spark-TTS☆15May 1, 2025Updated 9 months ago
- Model Slicing for Analytics with Elastic Inference Cost and Resource Constraints☆12Jul 6, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆20Jan 2, 2026Updated last month
- The driver for LMCache core to run in vLLM☆60Feb 4, 2025Updated last year
- creditmodel, 模型,评分卡,scorecard, vintage, automatic modeling☆11Aug 10, 2024Updated last year
- Spark library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs☆14Jul 31, 2025Updated 6 months ago
- How to use AI to generate unit tests☆16Updated this week
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)☆18Nov 24, 2022Updated 3 years ago
- ArcticInference: vLLM plugin for high-throughput, low-latency inference☆391Updated this week
- ☆13Jun 20, 2019Updated 6 years ago