nlzy / vllm-gfx906View external linksLinks
vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆372Dec 29, 2025Updated last month
Alternatives and similar repositories for vllm-gfx906
Users that are interested in vllm-gfx906 are comparing it to the libraries listed below
Sorting:
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆40Dec 8, 2025Updated 2 months ago
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Dec 15, 2025Updated 2 months ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆65May 4, 2025Updated 9 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 3 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 weeks ago
- LM inference server implementation based on *.cpp.☆295Nov 24, 2025Updated 2 months ago
- Nix Flake for personal NixOS system containing both NixOS config and home-manager config. For personal use, but made available for sharin…☆25Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆1,626Updated this week
- A skin smoothing filter to beautify faces.☆16Jan 18, 2021Updated 5 years ago
- ShellgetBotのWEB版☆12May 10, 2021Updated 4 years ago
- Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.☆4,505Updated this week
- A small guide to help user correctly passthrough their GPUs to an unprivileged LXC container☆28Mar 12, 2025Updated 11 months ago
- Control your computer with a voice interface☆28Nov 12, 2025Updated 3 months ago
- LLM inference in C/C++☆21Mar 22, 2025Updated 10 months ago
- ☆51Oct 1, 2025Updated 4 months ago
- Aka bulk job offer search / bulk job applications mailer☆25Aug 3, 2024Updated last year
- Exposes internet search tools for use by LLM-backed Assist in Home Assistant☆66Feb 9, 2026Updated last week
- 大模型推理框架加速,让 LLM 飞起来☆24May 10, 2024Updated last year
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆55Feb 3, 2026Updated 2 weeks ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Feb 12, 2025Updated last year
- ☆31Apr 19, 2025Updated 9 months ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,374Feb 8, 2026Updated last week
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- dify框架对接ragflow外部知识库代理服务☆32Feb 22, 2025Updated 11 months ago
- The High Performance LLM Native Mock Server☆17Jan 8, 2026Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Jan 19, 2025Updated last year
- 一个基于asp.net core 2.x+mysql+jwt开发的webapi项目☆12Jun 22, 2020Updated 5 years ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆142Updated this week
- MuseScore 3 plugin that expands chord symbols into notes.☆10May 29, 2020Updated 5 years ago
- HealthiVert-GAN, a novel deep-learning framework designed to generate pseudo-healthy vertebral images. These images simulate the pre-frac…☆11Nov 3, 2025Updated 3 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆25Jan 6, 2026Updated last month
- ☆28Dec 4, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 5 months ago
- firefox addon which allows the user to toggle javascript☆12Nov 16, 2016Updated 9 years ago
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆35Oct 22, 2025Updated 3 months ago
- Wakeword Installer for Home Assistant☆19Jun 1, 2025Updated 8 months ago
- Self-hosted web panel for managing Hysteria 2 proxy servers. Features HTTP authentication, auto node setup via SSH, server groups, load b…☆22Jan 23, 2026Updated 3 weeks ago
- Electroneum Classic☆10Oct 30, 2018Updated 7 years ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated last month