vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆383Feb 20, 2026Updated 2 weeks ago
Alternatives and similar repositories for vllm-gfx906
Users that are interested in vllm-gfx906 are comparing it to the libraries listed below
Sorting:
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Dec 15, 2025Updated 2 months ago
- llama.cpp-gfx906☆104Feb 14, 2026Updated 3 weeks ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆65May 4, 2025Updated 10 months ago
- triton3.2.0添加mi25/mi50/mi60支持☆14Apr 26, 2025Updated 10 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 4 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated last month
- ROCm Container 6.2 with PyTorch 2.4 for ComfyUI with RX570/RX580/RX590 aka Polaris AMD GPU Support☆12Feb 8, 2025Updated last year
- LM inference server implementation based on *.cpp.☆296Nov 24, 2025Updated 3 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,756Updated this week
- A skin smoothing filter to beautify faces.☆15Jan 18, 2021Updated 5 years ago
- Finetune and Inference Qwen3-0.6B.☆28May 5, 2025Updated 10 months ago
- Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.☆4,573Updated this week
- LLM inference in C/C++☆21Mar 22, 2025Updated 11 months ago
- A small guide to help user correctly passthrough their GPUs to an unprivileged LXC container☆28Mar 12, 2025Updated 11 months ago
- Control your computer with a voice interface☆29Nov 12, 2025Updated 3 months ago
- ☆51Oct 1, 2025Updated 5 months ago
- KTransformers 一键部署脚本☆58Apr 18, 2025Updated 10 months ago
- Aka bulk job offer search / bulk job applications mailer☆25Aug 3, 2024Updated last year
- ☆74Updated this week
- An open source GPU benchmarking project☆28Dec 2, 2024Updated last year
- 使用该工具基本可以告别Mybatis的xml文件,它可以根据方法名称自动推断sql(不是生成),既可以完成简单的增删改查,也可以支持复杂的连表查询,并且和xml配置不冲突,随时可以使用xml配置对个别方法进行配置☆31Dec 11, 2023Updated 2 years ago
- Coordinated Agent Team is a prompt-driven multi-agent system for autonomous software delivery. It defines clear agent roles, a determinis…☆31Feb 19, 2026Updated 2 weeks ago
- 大模型推理框架加速,让 LLM 飞起来☆24May 10, 2024Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆56Feb 24, 2026Updated 2 weeks ago
- ☆31Apr 19, 2025Updated 10 months ago
- Exposes internet search tools for use by LLM-backed Assist in Home Assistant☆75Mar 3, 2026Updated last week
- ☆1,071Updated this week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆34Feb 12, 2025Updated last year
- Presense, Temperature, Humidity, Air Quality multi sensor using DFRobot, SHT35 or BME680 sensors☆39Aug 9, 2023Updated 2 years ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- The High Performance LLM Native Mock Server☆19Jan 8, 2026Updated 2 months ago
- dify框架对接ragflow外部知识库代理服务☆32Feb 22, 2025Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,716Mar 2, 2026Updated last week
- Container image for PiKVM☆38Jul 17, 2024Updated last year
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- Wakeword Installer for Home Assistant☆20Mar 2, 2026Updated last week
- ☆17Feb 4, 2026Updated last month
- firefox addon which allows the user to toggle javascript☆12Nov 16, 2016Updated 9 years ago