See vLLM official support: https://github.com/vllm-project/vllm-ascend
☆11Feb 5, 2025Updated last year
Alternatives and similar repositories for vllm-ascend
Users that are interested in vllm-ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆52Mar 25, 2026Updated last month
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆21Nov 14, 2025Updated 5 months ago
- ☆39Apr 28, 2026Updated last week
- ☆24Updated this week
- Dockerfiles for Ascend CANN☆50Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆70Updated this week
- Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.☆23Jun 17, 2025Updated 10 months ago
- Pytorch--使用伪标签训练efficientNet模型☆11Dec 28, 2019Updated 6 years ago
- ☆12Sep 7, 2024Updated last year
- HPSF website☆13Oct 29, 2024Updated last year
- Rust King OS - Linux Distro☆15Sep 9, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Apr 13, 2026Updated 3 weeks ago
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- SGLang kernel library for NPU☆128Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- douban SDK in node.js☆45Oct 26, 2016Updated 9 years ago
- Repository for implementation of active learning and semi-supervised learning algorithms and applying them to medical imaging datasets☆16May 17, 2021Updated 4 years ago
- ☆22Jun 1, 2025Updated 11 months ago
- Code for undergraduate thesis "Active Learning for Deep Object Detection".☆14Nov 12, 2023Updated 2 years ago
- ☆33Apr 19, 2025Updated last year
- A lightweight benchmark utility for PySpark☆20Jan 25, 2020Updated 6 years ago
- Zoom in Lesions for Better Diagnosis: Attention Guided Deformation Network for WCE Image Classification☆13Aug 4, 2020Updated 5 years ago
- Misc ceph tools☆33Nov 25, 2015Updated 10 years ago
- A simple LaTeX template for CUHK thesis.☆16Apr 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- High-performance LLM operator library built on TileLang.☆118Updated this week
- A NFC card reader for Campus card of NEU ( China )☆12Mar 13, 2021Updated 5 years ago
- ☆20Jun 13, 2025Updated 10 months ago
- 该仓库已经合并到Rust中文☆23May 15, 2020Updated 5 years ago
- A basic S3 compatible storage server in Rust☆13Aug 25, 2021Updated 4 years ago
- vLLM adapter for a TGIS-compatible gRPC server.☆55Updated this week
- Nightly Build for LMDeploy☆11Jan 28, 2025Updated last year
- Ctrl-Z for the filesystem☆15Jun 11, 2019Updated 6 years ago
- Community maintained hardware plugin for vLLM on Ascend☆2,035Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- Agent skills for vLLM☆67Apr 3, 2026Updated last month
- RPG^2 is a pure-software system that operates on running C/C++ programs, profiling them, injecting prefetch instructions, and then tuning…☆12May 15, 2024Updated last year
- ☆53Mar 15, 2025Updated last year
- Survey for Distribution Shift☆19Jun 1, 2021Updated 4 years ago
- Experiment with Res2Net. https://arxiv.org/abs/1904.01169☆13Apr 4, 2019Updated 7 years ago
- ☆19Jun 8, 2022Updated 3 years ago