nano-vllm是开源的一个gpu推理项目,基于开源版本弄的一个ascend npu版本推理小demo,旨在帮助初学者了解推理的整体流程,区别于vllm,nano-vllm体量更小,麻雀虽小五脏俱全,更有助于初学者学习。
☆113May 4, 2026Updated last month
Alternatives and similar repositories for nano-vllm-ascend
Users that are interested in nano-vllm-ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 1688营销 Skill —— 帮助商家进行招商活动报名、查看商机推荐等营销操作。 核心工具能力:招商活动查询、商品建议价查询、活动报名提交、商机推荐查询。 触发词:报名活动、招商活动、查询活动、提报、报名、活动报名、查看建议价、商机推荐、商机、市场机会、找商机、查商机,不…☆99May 6, 2026Updated last month
- Towards Instance Segmentation with Polygon Detection Transformer.☆110Mar 10, 2026Updated 3 months ago
- ☆61May 25, 2026Updated last month
- Extract Med Data and Construct KG , Provide Q&A☆43Apr 16, 2025Updated last year
- WHU-CS-Courses-Notes☆84Mar 22, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆91May 20, 2026Updated last month
- ☆52May 15, 2026Updated last month
- ☆121Jun 11, 2026Updated 3 weeks ago
- Self-deployed auth for Cloudflare Workers and D1: email/password login, magic links, verification, password reset, secure sessions, CLI s…☆77Jun 25, 2026Updated last week
- Development containers for triton and triton-cpu☆28Jun 23, 2026Updated last week
- ☆32May 14, 2026Updated last month
- Based on your current lab repository, design your experiement panel.☆41Jun 26, 2026Updated last week
- Demonstrate once, execute anywhere — secure remote skills for AI agents.☆216Jun 12, 2026Updated 2 weeks ago
- ☆126Jun 10, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Panorama Module for custom timers in Dota 2☆17Mar 30, 2017Updated 9 years ago
- PRISM-VL studies measurement-grounded VLM learning with RAW-derived Meas.-XYZ inputs, camera-conditioned grounding, and exposure-brackete…☆661May 27, 2026Updated last month
- This project creates a bridge between BWAPI for StarCraft: Brood War and EIS-enabled Multi-Agent Systems like GOAL.☆19Dec 15, 2020Updated 5 years ago
- consolidate known Starcraft2 (SC2) maps for use by developers creating bots, AI agents or other code-based projects.☆15Oct 8, 2018Updated 7 years ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆30Jun 23, 2025Updated last year
- Complete ETCLOVG framework for AI Agent workflows - DAG+FSM orchestration, Ebbinghaus memory, discipline routing, skill evolution, trace …☆100May 31, 2026Updated last month
- xKV: Cross-Layer SVD for KV-Cache Compression☆51Jun 21, 2026Updated last week
- ☆54Dec 4, 2025Updated 6 months ago
- Implement some method of LLM KV Cache Sparsity☆41Jun 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆83Jun 23, 2025Updated last year
- Omni_Infer is a suite of inference accelerators designed for the Ascend NPU platform, offering native support and an expanding feature se…☆123Updated this week
- 实现国产算力大模型零门槛部署,一键跑通 Qwen、GLM-4.7、Minimax-2.1、DeepSeek-OCR 等模型☆326Jun 11, 2026Updated 3 weeks ago
- An implementation of the StarCraft: Brood War game engine☆33Jun 1, 2017Updated 9 years ago
- ☆91Oct 17, 2025Updated 8 months ago
- Practice for nw☆44Oct 14, 2016Updated 9 years ago
- Tile-Based Runtime for Ultra-Low-Latency LLM Inference☆1,496Jun 8, 2026Updated 3 weeks ago
- awesome SAE papers☆78May 24, 2025Updated last year
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ZZZKBot is a bot (AI) for Starcraft: Broodwar. It is designed to compete against other bots. It is not designed to compete against humans…☆60Oct 23, 2025Updated 8 months ago
- 这是极客时间苏玲老师的《玩转Git三剑客》笔记☆100Aug 1, 2022Updated 3 years ago
- ☆83Jan 28, 2025Updated last year
- Learning TileLang with 10 puzzles!☆326May 28, 2026Updated last month
- 基于单目视觉原理,研究目标图像的预处理、识别、定位方法与测距模型,设计实现一个目标识别与定位测距原型系统。☆107Apr 18, 2020Updated 6 years ago
- Repository for analysis and experiments in the BigCode project.☆126Mar 20, 2024Updated 2 years ago
- AI for StarCraft: Brood War☆82Dec 8, 2022Updated 3 years ago