See vLLM official support: https://github.com/vllm-project/vllm-ascend
☆11Feb 5, 2025Updated last year
Alternatives and similar repositories for vllm-ascend
Users that are interested in vllm-ascend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆52Mar 25, 2026Updated 3 weeks ago
- ☆34Updated this week
- Welcome to the official repository of AC-LORA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs, a mechanism that provides tr…☆21Nov 14, 2025Updated 5 months ago
- ☆24Updated this week
- Dockerfiles for Ascend CANN☆46Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆66Apr 9, 2026Updated last week
- Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.☆23Jun 17, 2025Updated 10 months ago
- Pytorch--使用伪标签训练efficientNet模型☆11Dec 28, 2019Updated 6 years ago
- ☆12Sep 7, 2024Updated last year
- HPSF website☆13Oct 29, 2024Updated last year
- Rust King OS - Linux Distro☆15Sep 9, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Updated this week
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- SGLang kernel library for NPU☆125Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- douban SDK in node.js☆45Oct 26, 2016Updated 9 years ago
- High-performance LLM operator library built on TileLang.☆102Updated this week
- Repository for implementation of active learning and semi-supervised learning algorithms and applying them to medical imaging datasets☆16May 17, 2021Updated 4 years ago
- ☆21Jun 1, 2025Updated 10 months ago
- Code for undergraduate thesis "Active Learning for Deep Object Detection".☆14Nov 12, 2023Updated 2 years ago
- ☆32Apr 19, 2025Updated last year
- A lightweight benchmark utility for PySpark☆20Jan 25, 2020Updated 6 years ago
- Zoom in Lesions for Better Diagnosis: Attention Guided Deformation Network for WCE Image Classification☆13Aug 4, 2020Updated 5 years ago
- Misc ceph tools☆33Nov 25, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple LaTeX template for CUHK thesis.☆16Apr 24, 2023Updated 2 years ago
- A NFC card reader for Campus card of NEU ( China )☆12Mar 13, 2021Updated 5 years ago
- ☆20Jun 13, 2025Updated 10 months ago
- 该仓库已经合并到Rust中文☆23May 15, 2020Updated 5 years ago
- A basic S3 compatible storage server in Rust☆13Aug 25, 2021Updated 4 years ago
- vLLM adapter for a TGIS-compatible gRPC server.☆55Apr 12, 2026Updated last week
- Community maintained hardware plugin for vLLM on Ascend☆1,937Updated this week
- Nightly Build for LMDeploy☆11Jan 28, 2025Updated last year
- Ctrl-Z for the filesystem☆15Jun 11, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Agent skills for vLLM☆59Apr 3, 2026Updated 2 weeks ago
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- RPG^2 is a pure-software system that operates on running C/C++ programs, profiling them, injecting prefetch instructions, and then tuning…☆12May 15, 2024Updated last year
- ☆54Mar 15, 2025Updated last year
- Weka Monitoring via Grafana, Prometheus, etc.☆12Mar 6, 2026Updated last month
- Survey for Distribution Shift☆19Jun 1, 2021Updated 4 years ago
- Experiment with Res2Net. https://arxiv.org/abs/1904.01169☆13Apr 4, 2019Updated 7 years ago