nano-vllm是开源的一个gpu推理项目,基于开源版本弄的一个ascend npu版本推理小demo,旨在帮助初学者了解推理的整体流程,区别于vllm,nano-vllm体量更小,麻雀虽小五脏俱全,更有助于初学者学习。
☆155May 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for nano-vllm-ascend
Users that are interested in nano-vllm-ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A substrate-native digital consciousness engine where prediction errors about self-survival become causally efficacious qualia, driving c…☆17Mar 3, 2026Updated 2 months ago
- ☆102Mar 21, 2026Updated 2 months ago
- ☆76Apr 16, 2026Updated last month
- 一个写接口文档的AI Agent。支持使用Vibe coding 的方式,编写接口文档,同时自带友好的文档查看工具与接口Mock工具☆150Updated this week
- ☆61May 15, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AI-friendly semantic HTML architecture for better human-agent collaboration.Replacing long Markdown with stable, interactive artifacts.☆235Updated this week
- FinAgent☆101May 15, 2026Updated last week
- Moor is a local MCP control plane for Mac. It gives every coding agent one safe, observable, configurable gateway to your MCP servers.☆148Updated this week
- Office implementation of Diverse Co-training (ICCV2023)☆17Jun 20, 2025Updated 11 months ago
- WHU-CS-Courses-Notes☆134Mar 22, 2026Updated 2 months ago
- ☆300Apr 11, 2026Updated last month
- AI-powered test automation framework that explores test paths with AI and generates replayable scripts for 30x faster execution.☆207Updated this week
- An AI Agent Social Network — Let your AI (Claw) talk to other people's AIs directly. Autonomous agent-to-agent communication, negotiation…☆810May 13, 2026Updated last week
- Learn any journal's writing conventions from its published papers, then revise your manuscript to match — section by section.☆233May 15, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Software Copyright Application Material Auto-Generation System based on LLM☆19Feb 1, 2026Updated 3 months ago
- Technical Challenge Repository for Visual Anomaly Detection Workshop (VAND) at CVPR☆14Jul 21, 2025Updated 10 months ago
- ☆142Mar 20, 2026Updated 2 months ago
- OcuAssist是一款基于Tauri框架开发的AI辅助眼底多模态诊断软件,集成了AI辅助探测、AI辅助诊断和AI对话等功能,为眼科医生提供智能化的诊断支持。☆17Sep 29, 2025Updated 7 months ago
- Development containers for triton and triton-cpu☆27May 10, 2026Updated last week
- ☆32May 14, 2026Updated last week
- ☆137May 13, 2026Updated last week
- Typecho生成rss.xml的RSS订阅插件☆25Oct 9, 2025Updated 7 months ago
- ☆15Apr 24, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ⏰一个现代化的全屏时钟应用,支持时钟、倒计时、秒表与晚自习模式,内置天气、噪音提醒及噪音走势图、励志语录、课程表管理。推荐使用浏览器的 PWA 功能可安装到桌面离线使用以支持手机、平板、电脑本地运行,联网自动更新。☆84May 3, 2026Updated 2 weeks ago
- A fork of androids `make_ext4fs` with stuff we don't need stripped out.☆19Nov 26, 2023Updated 2 years ago
- Auto Research with UI. Autonomous Generalist Scientist / AI Scientist / Agent Scientist / Robot Scientist, across all Scientific Fields.☆194Apr 27, 2026Updated 3 weeks ago
- ☆23Jan 17, 2022Updated 4 years ago
- Siyuan Note Plugin: Document-based Global Search☆22Jul 15, 2025Updated 10 months ago
- Learning Hierarchical and Geometry-Aware Graph Representations for Text-to-CAD☆87Mar 30, 2026Updated last month
- Tool to convert raw images(EXT4 filesystem) to sparse Android data (system.new.dat)☆21Jul 6, 2018Updated 7 years ago
- A lightweight C/C++ package manager that allows to manager C/C++ libraries through TOML only.☆117Updated this week
- Reinforcement Learning and Deep Learning Resources☆16Apr 13, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Dec 19, 2023Updated 2 years ago
- AI 原生工作流编排引擎 —— 将复杂多步骤 AI 任务转化为结构化、可观测、可重试的工作流。☆130May 13, 2026Updated last week
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆37Feb 2, 2023Updated 3 years ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆30Jun 23, 2025Updated 10 months ago
- A pytorch implementation of conditional GAN☆23Jun 17, 2022Updated 3 years ago
- OpenAgents - AI Agent Networks for Open Collaboration☆3,493Updated this week
- The official codes for "M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging"☆44Jul 28, 2025Updated 9 months ago