This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference speed.
☆61Apr 24, 2026Updated last month
Alternatives and similar repositories for nano-kvllm
Users that are interested in nano-kvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official codes and implementations of the UnGSL method in the paper: **"Uncertainty-aware Graph Structure Learning"**.☆14Feb 28, 2025Updated last year
- ☆21Feb 24, 2025Updated last year
- The website of ZLST☆11Jul 28, 2025Updated 10 months ago
- the code of MoG☆22Aug 6, 2024Updated last year
- The official codes and implementations of HimGNN model in paper:"HimGNN:a novel hierarchical molecular representations learning framewor…☆23Aug 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 5 months ago
- This is the official training code of OmniSVG☆40Jan 19, 2026Updated 4 months ago
- Machine Learning with Graphs (Chinese) http://web.stanford.edu/class/cs224w/☆14Apr 14, 2020Updated 6 years ago
- Code for reproducing experiments in "On the Ability of Graph Neural Networks to Model Interactions Between Vertices"☆25Oct 14, 2023Updated 2 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆11Nov 25, 2020Updated 5 years ago
- VS Code port of PyCharm's Darcula syntax theme w/ Light & Dark GUI options, MagicPython support, Jinja & Django template support, and o…☆16Jan 11, 2026Updated 5 months ago
- This is the official code for ZLST-Project, Generative Recommendation Benchmark☆66May 25, 2026Updated 3 weeks ago
- Summary of PingCap tinykv camp. No codes presented.☆22May 9, 2023Updated 3 years ago
- [NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.☆62Jan 28, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a tiny c++ reflect library☆16Jul 10, 2023Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- netty源码中文注释☆18Jun 1, 2023Updated 3 years ago
- Popular Trading System Collection.☆24May 10, 2020Updated 6 years ago
- A distributed in-memory database featuring flexible deployment, dynamic scalability, and rapid construction.☆17Apr 28, 2024Updated 2 years ago
- ☆12Jan 10, 2025Updated last year
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Teamfight Tactics: Mobile Synergy Calculator☆20Apr 16, 2024Updated 2 years ago
- 一个归纳了RocketMQ常见使用方法的学习项目☆18May 30, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 动手学ROS2系列教程☆16Aug 26, 2021Updated 4 years ago
- A Swift package for performing native SNMP queries. Also includes some ASN.1 decoders.☆11Dec 24, 2022Updated 3 years ago
- DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆23Jan 31, 2018Updated 8 years ago
- 视频营销号工具☆26Oct 8, 2023Updated 2 years ago
- ☆11May 16, 2026Updated last month
- django 源码剖析☆22Dec 5, 2020Updated 5 years ago
- 本项目是关于Harness Engineering的开源教程,旨在帮助开发者理解和掌握在大模型时代,如何为复杂、长时间运行的 AI 智能体(Agent)构建健壮的底层运行架构。☆155Apr 25, 2026Updated last month
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆33May 26, 2026Updated 3 weeks ago
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆116Apr 18, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The missing layer between idea and code.☆28Feb 5, 2026Updated 4 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 8, 2026Updated last week
- A curated list of awesome graph structure learning approaches☆41Nov 24, 2024Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 9 months ago
- 一个基于百度翻译API的智能文章论文降重工具,通过多语言转换实现文本论文降重与AIGC降重,支持多种降重模式☆44May 25, 2025Updated last year
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- ☆14Nov 3, 2025Updated 7 months ago