vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆403Feb 20, 2026Updated last month
Alternatives and similar repositories for vllm-gfx906
Users that are interested in vllm-gfx906 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆45Dec 8, 2025Updated 4 months ago
- ML software (llama.cpp, ComfyUI, vLLM) builds for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆157Updated this week
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆32Nov 7, 2025Updated 5 months ago
- ☆11Dec 23, 2022Updated 3 years ago
- A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.☆4,832Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Development enviroment in docker.☆17Jun 21, 2016Updated 9 years ago
- The High Performance LLM Native Mock Server☆25Updated this week
- Proxy for OpenAI☆16Sep 2, 2025Updated 7 months ago
- ☆65Mar 10, 2026Updated last month
- Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and pr…☆38Nov 14, 2025Updated 5 months ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆15Sep 3, 2025Updated 7 months ago
- ☆86Mar 23, 2026Updated 3 weeks ago
- 基于RWKV模型的角色扮演,实际上是个改的妈都不认识的 RWKV_Role_Playing☆17Aug 17, 2023Updated 2 years ago
- 安卓版Snipaste☆10Aug 1, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Modifications made to Qt for Snipaste.☆11Dec 5, 2024Updated last year
- 树莓派qwen-omni语音助手免TTS/STT☆16Apr 4, 2025Updated last year
- Extension for Forge-based UIs (Forge, reForge, etc) and ComfyUI to replace CFG with Negative Rejection Steering☆16Feb 14, 2026Updated 2 months ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆186Aug 23, 2025Updated 7 months ago
- Fork of ollama for vulkan support☆20Apr 16, 2025Updated last year
- ☆15Apr 1, 2024Updated 2 years ago
- ☆32Apr 19, 2025Updated last year
- Agentic BYOK Browser-Based Website Builder☆42Updated this week
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Aka bulk job offer search / bulk job applications mailer☆25Aug 3, 2024Updated last year
- ☆83Feb 28, 2025Updated last year
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆16,953Apr 9, 2026Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Jan 19, 2025Updated last year
- A small guide to help user correctly passthrough their GPUs to an unprivileged LXC container☆28Mar 12, 2025Updated last year
- An MCP server for Tekla that facilitates interaction with Tekla Structures, allowing users to speed-up modeling processes☆27Updated this week
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.by adding more amd gpu support.☆1,720Updated this week
- A multi-interface (REST and MCP) server for automatic license plate recognition 🚗☆22Dec 2, 2025Updated 4 months ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,189Apr 10, 2026Updated last week
- Cluster API implementation for Incus and LXD☆97Apr 5, 2026Updated 2 weeks ago
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆39Jul 5, 2025Updated 9 months ago
- 在esp32中实现homeassistant ,对接小米,小度,涂鸦,天猫精灵等平台,对外提供MCP接口,提供大模型调用,控制家庭下的所有设备,(持续维护,欢迎star)☆33Nov 19, 2025Updated 5 months ago
- Analyze suspicious files and URLs to detect types of malware☆11May 14, 2020Updated 5 years ago
- Snag web pages like a polite robot with a browser☆27Apr 9, 2026Updated last week
- This repo focuses on supervised and self-supervised bio-sequence representation learning☆22Oct 11, 2023Updated 2 years ago