See vLLM official support: https://github.com/vllm-project/vllm-ascend
☆11Feb 5, 2025Updated last year
Alternatives and similar repositories for vllm-ascend
Users that are interested in vllm-ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆53Updated this week
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆21Nov 14, 2025Updated 6 months ago
- ☆42Updated this week
- ☆24May 22, 2026Updated last week
- Dockerfiles for Ascend CANN☆55Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆72May 13, 2026Updated 2 weeks ago
- Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.☆23Jun 17, 2025Updated 11 months ago
- Pytorch--使用伪标签训练efficientNet模型☆11Dec 28, 2019Updated 6 years ago
- ☆13Sep 7, 2024Updated last year
- HPSF website☆13Oct 29, 2024Updated last year
- Rust King OS - Linux Distro☆15Sep 9, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13May 15, 2026Updated 2 weeks ago
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- ☆10Apr 29, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- douban SDK in node.js☆45Oct 26, 2016Updated 9 years ago
- SGLang kernel library for NPU☆137May 21, 2026Updated last week
- Repository for implementation of active learning and semi-supervised learning algorithms and applying them to medical imaging datasets☆16May 17, 2021Updated 5 years ago
- ☆22Jun 1, 2025Updated 11 months ago
- Code for undergraduate thesis "Active Learning for Deep Object Detection".☆14Nov 12, 2023Updated 2 years ago
- ☆33Apr 19, 2025Updated last year
- A lightweight benchmark utility for PySpark☆20Jan 25, 2020Updated 6 years ago
- Zoom in Lesions for Better Diagnosis: Attention Guided Deformation Network for WCE Image Classification☆13Aug 4, 2020Updated 5 years ago
- Misc ceph tools☆33Nov 25, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A simple LaTeX template for CUHK thesis.☆17Apr 24, 2023Updated 3 years ago
- A NFC card reader for Campus card of NEU ( China )☆12Mar 13, 2021Updated 5 years ago
- High-performance LLM operator library built on TileLang.☆125Updated this week
- ☆19Jun 13, 2025Updated 11 months ago
- 该仓库已经合并到Rust中文☆23May 15, 2020Updated 6 years ago
- A basic S3 compatible storage server in Rust☆13Aug 25, 2021Updated 4 years ago
- vLLM adapter for a TGIS-compatible gRPC server.☆55Updated this week
- Nightly Build for LMDeploy☆11Jan 28, 2025Updated last year
- Ctrl-Z for the filesystem☆15Jun 11, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Community maintained hardware plugin for vLLM on Ascend☆2,117May 21, 2026Updated last week
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- RPG^2 is a pure-software system that operates on running C/C++ programs, profiling them, injecting prefetch instructions, and then tuning…☆13May 15, 2024Updated 2 years ago
- ☆53Mar 15, 2025Updated last year
- Survey for Distribution Shift☆19Jun 1, 2021Updated 4 years ago
- Agent skills for vLLM☆71Apr 3, 2026Updated last month
- Experiment with Res2Net. https://arxiv.org/abs/1904.01169☆13Apr 4, 2019Updated 7 years ago