LMCache on Ascend
☆49Feb 25, 2026Updated last week
Alternatives and similar repositories for LMCache-Ascend
Users that are interested in LMCache-Ascend are comparing it to the libraries listed below
Sorting:
- Systematic and comprehensive benchmarks for LLM systems.☆51Jan 28, 2026Updated last month
- Protocol buffers and other common resources.☆13Updated this week
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- ☆123Feb 24, 2026Updated last week
- Word template for a Lancaster University thesis☆11Mar 19, 2022Updated 3 years ago
- [ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation☆22May 29, 2025Updated 9 months ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- ☆13Jan 7, 2025Updated last year
- SGLang kernel library for NPU☆101Updated this week
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- Python module to compute the Mann-Kendall test for trend in time series data☆10Apr 18, 2017Updated 8 years ago
- ☆10Feb 16, 2022Updated 4 years ago
- 2023 XFlops Training☆13Jan 23, 2024Updated 2 years ago
- A statistical framework for graph anomaly detection.☆17Sep 23, 2018Updated 7 years ago
- ☆13Nov 21, 2024Updated last year
- AutoML 2024: HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection☆12Jul 12, 2024Updated last year
- A rust wrapper for HIP☆12Jun 10, 2025Updated 8 months ago
- clustering algorithm implementation☆13Nov 3, 2025Updated 4 months ago
- Data Plane Development Kit☆12Nov 10, 2025Updated 3 months ago
- Resources on how to use the HEC at Lancaster University☆14Jan 11, 2022Updated 4 years ago
- ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/vide…☆21May 5, 2024Updated last year
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- ☆11Apr 21, 2020Updated 5 years ago
- Ampere CentOS kernel☆18Jul 16, 2024Updated last year
- Fione is Enterprise AI Platform☆16Nov 9, 2025Updated 3 months ago
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆92Jan 26, 2026Updated last month
- ☆10Mar 14, 2020Updated 5 years ago
- A conversion tool between scala types and protobuf-java types.☆12Dec 21, 2021Updated 4 years ago
- ☆20Oct 24, 2024Updated last year
- Graph model execution API for Candle☆17Jul 27, 2025Updated 7 months ago
- The Open-Source Implementation of Cognition AI's Automated Software Engineer, Devin.☆16Mar 13, 2024Updated last year
- ☆18Mar 4, 2025Updated last year
- Model Slicing for Analytics with Elastic Inference Cost and Resource Constraints☆12Jul 6, 2023Updated 2 years ago
- An ecosystem of Rust libraries for working with large language models☆14Oct 2, 2023Updated 2 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year