vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆414Feb 20, 2026Updated 3 months ago
Alternatives and similar repositories for vllm-gfx906
Users that are interested in vllm-gfx906 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama.cpp-gfx906☆130Mar 22, 2026Updated 2 months ago
- triton3.2.0添加mi25/mi50/mi60支持☆14Apr 26, 2025Updated last year
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆33Nov 7, 2025Updated 6 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆2,554May 23, 2026Updated last week
- A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.☆5,052Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆120Updated this week
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆1,036May 22, 2026Updated last week
- Test programs for exploring ESP32☆13Jan 4, 2023Updated 3 years ago
- Intel Ethernet LAN driver for macOS☆13Oct 26, 2022Updated 3 years ago
- Lower Precision Floating Point Operations☆80Feb 22, 2026Updated 3 months ago
- ☆101Mar 23, 2026Updated 2 months ago
- Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and pr…☆40Nov 14, 2025Updated 6 months ago
- Extension for Forge-based UIs (Forge, reForge, etc) and ComfyUI to replace CFG with Negative Rejection Steering☆16May 16, 2026Updated 2 weeks ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆203Aug 23, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Persistent, per-skill experience memory for Claude Code☆93May 2, 2026Updated 3 weeks ago
- Agentic BYOK Browser-Based Website Builder☆45Updated this week
- 树莓派qwen-omni语音助手免TTS/STT☆18Apr 4, 2025Updated last year
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- Aka bulk job offer search / bulk job applications mailer☆26Aug 3, 2024Updated last year
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆17,191May 21, 2026Updated last week
- ☆82Feb 28, 2025Updated last year
- ☆51Oct 1, 2025Updated 7 months ago
- ☆35Nov 11, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 構建 DELL 7060 MFF Hackintosh 教學☆17Jun 12, 2022Updated 3 years ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.by adding more amd gpu support.☆1,754May 18, 2026Updated last week
- Nix Flake for personal NixOS system containing both NixOS config and home-manager config. For personal use, but made available for sharin…☆25May 17, 2026Updated last week
- ☆24Aug 26, 2025Updated 9 months ago
- Small 3D-printed Raspberry Pi NAS with support for up to 4 2.5" SSDs☆15Apr 22, 2023Updated 3 years ago
- minimalistic AI library that resembles HF's transformers☆13Dec 31, 2024Updated last year
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆28Dec 18, 2025Updated 5 months ago
- Snag web pages like a polite robot with a browser☆27Apr 9, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,684May 21, 2026Updated last week
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆41Jul 5, 2025Updated 10 months ago
- A polyphonic music transcription Vamp plugin☆10Nov 20, 2019Updated 6 years ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆27Feb 3, 2026Updated 3 months ago
- This repository is about implementing The Personality Cores Conversation System originally developed by Aperture Science, Inc. in the Por…☆24May 5, 2024Updated 2 years ago
- ☆14Sep 24, 2024Updated last year
- Encryption and signing for a post quantum world☆17Apr 4, 2023Updated 3 years ago