This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference speed.
☆52Apr 24, 2026Updated 2 weeks ago
Alternatives and similar repositories for nano-kvllm
Users that are interested in nano-kvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official codes and implementations of the UnGSL method in the paper: **"Uncertainty-aware Graph Structure Learning"**.☆14Feb 28, 2025Updated last year
- ☆21Feb 24, 2025Updated last year
- The website of ZLST☆11Jul 28, 2025Updated 9 months ago
- the code of MoG☆21Aug 6, 2024Updated last year
- The official codes and implementations of HimGNN model in paper:"HimGNN:a novel hierarchical molecular representations learning framewor…☆23Aug 30, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Code For EMNLP2025 Findings: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Le…☆10Dec 25, 2025Updated 4 months ago
- This is the official training code of OmniSVG☆40Jan 19, 2026Updated 3 months ago
- Machine Learning with Graphs (Chinese) http://web.stanford.edu/class/cs224w/☆14Apr 14, 2020Updated 6 years ago
- Code for reproducing experiments in "On the Ability of Graph Neural Networks to Model Interactions Between Vertices"☆25Oct 14, 2023Updated 2 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆11Nov 25, 2020Updated 5 years ago
- 本项目是关于Harness Engineering的开源教程,旨在帮助开发者理解和掌握在大模型时代,如何为复杂、长时间运行的 AI 智能体(Agent)构建健壮的底层运行架构。☆101Apr 25, 2026Updated 2 weeks ago
- [NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.☆53Jan 28, 2026Updated 3 months ago
- VS Code port of PyCharm's Darcula syntax theme w/ Light & Dark GUI options, MagicPython support, Jinja & Django template support, and o…☆16Jan 11, 2026Updated 3 months ago
- Summary of PingCap tinykv camp. No codes presented.☆22May 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official code for ZLST-Project, Generative Recommendation Benchmark☆65Apr 30, 2026Updated last week
- a tiny c++ reflect library☆15Jul 10, 2023Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- netty源码中文注释☆18Jun 1, 2023Updated 2 years ago
- Popular Trading System Collection.☆24May 10, 2020Updated 5 years ago
- A distributed in-memory database featuring flexible deployment, dynamic scalability, and rapid construction.☆17Apr 28, 2024Updated 2 years ago
- ☆12Jan 10, 2025Updated last year
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Teamfight Tactics: Mobile Synergy Calculator☆20Apr 16, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 一个归纳了RocketMQ常见使用方法的学习项目☆18Mar 1, 2023Updated 3 years ago
- 动手学ROS2系列教程☆15Aug 26, 2021Updated 4 years ago
- A Swift package for performing native SNMP queries. Also includes some ASN.1 decoders.☆12Dec 24, 2022Updated 3 years ago
- DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。☆23Jan 31, 2018Updated 8 years ago
- 视频营销号工具☆26Oct 8, 2023Updated 2 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- django 源码剖析☆22Dec 5, 2020Updated 5 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30Apr 27, 2026Updated last week
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆104Apr 18, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The missing layer between idea and code.☆28Feb 5, 2026Updated 3 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 7 months ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆32May 24, 2025Updated 11 months ago
- A curated list of awesome graph structure learning approaches☆41Nov 24, 2024Updated last year
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 8 months ago
- 一个基于百度翻译API的智能文章论文降重工具,通过多语言转换实现文本论文降重与AIGC降重,支持多种降重模式☆44May 25, 2025Updated 11 months ago