FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs
☆65May 4, 2025Updated 10 months ago
Alternatives and similar repositories for vllm-rocm
Users that are interested in vllm-rocm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Dec 15, 2025Updated 3 months ago
- ☆434Apr 4, 2025Updated 11 months ago
- ML software (llama.cpp, ComfyUI, vLLM) builds for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆133Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆117Updated this week
- LLM inference in C/C++☆21Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Sep 4, 2024Updated last year
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- BC backport for 1.12.1 including world map markers.☆10Oct 8, 2025Updated 5 months ago
- ☆15Jul 21, 2025Updated 8 months ago
- Example of creating a user at runtime for a dynamic Docker image☆13Feb 2, 2023Updated 3 years ago
- offline realtime subtitle for mac☆23Nov 18, 2025Updated 4 months ago
- A Repo dedicated to resources regarding the overclocking of DDR5 Memory☆14Jan 27, 2022Updated 4 years ago
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆46Mar 12, 2026Updated 2 weeks ago
- ☆17Jan 31, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This Elgg plugin lets users preview MS Office files (doc, docx, xls, xlsx, ppt, pptx), Apple iWork pages, Adobe eps, and zip files using …☆12Aug 28, 2015Updated 10 years ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- super-Django-CC is a simle web interface for commoncrawl.org☆15Dec 8, 2022Updated 3 years ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 4 months ago
- Modern self-hosting panel.☆15Jan 1, 2025Updated last year
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆64Mar 16, 2026Updated 2 weeks ago
- Rust bindings to the Knot Resolver library (also known as libkres)☆18Apr 2, 2019Updated 6 years ago
- ☆10Aug 13, 2012Updated 13 years ago
- Log4j_dos_CVE-2021-45105☆13Dec 19, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TBC/WotLK AddOn: Displays all the spells you use (and miss). Made for streamers and moviemakers☆15Apr 29, 2024Updated last year
- Experimental WebGPU plugin for Flutter☆11Jan 11, 2021Updated 5 years ago
- 🌸De-inflect Japanese words☆15Nov 24, 2025Updated 4 months ago
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated 2 months ago
- ☆29Updated this week
- Zenpower3 is a Linux kernel driver for reading temperature, voltage(SVI2), current(SVI2) and power(SVI2) for AMD Zen family CPUs, now wit…☆36Dec 20, 2025Updated 3 months ago
- IME for Mac.☆10Mar 19, 2026Updated last week
- A port of the RWKV v7 language model, implemented with the Burn deep learning framework☆14Jun 9, 2025Updated 9 months ago
- ☆10May 5, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Repository for the course "JavaScript Object Oriented Programming"☆11Jun 30, 2019Updated 6 years ago
- A wayland compositor with an epic tech stack☆11Aug 28, 2020Updated 5 years ago
- ☆13Dec 26, 2022Updated 3 years ago
- Deep learning inference SW framework based on TensorFlow Lite for Aarch64 Linux with GPU and Hexagon delegate☆13Mar 11, 2025Updated last year
- A TUI for Managing and Searching with Meilisearch☆20Aug 26, 2025Updated 7 months ago
- Bare-metal Rust explorations of the Allwinner D1☆17Oct 25, 2022Updated 3 years ago
- hopfield☆30Oct 8, 2021Updated 4 years ago