lemonade-sdk / llamacpp-rocmLinks

Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration

☆129

Alternatives and similar repositories for llamacpp-rocm

Users that are interested in llamacpp-rocm are comparing it to the libraries listed below

Sorting:

SearchSavior / OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
☆260Updated last week
FastFlowLM / FastFlowLM
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
☆488Updated last week
kyuz0 / amd-strix-halo-toolboxes
☆612Updated this week
lhl / strix-halo-testing
☆154Updated last month
iuliaturc / gguf-docs
Docs for GGUF quantization (unofficial)
☆330Updated 4 months ago
nktice / AMD-AI
AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1
☆216Updated last week
k-koehler / gguf-tensor-overrider
☆50Updated last month
crashr / gppm
GPU Power and Performance Manager
☆62Updated last year
ikawrakow / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆1,358Updated this week
TesslateAI / Agent-Builder
☆195Updated 3 months ago
amd / gaia
Run LLM Agents on Ryzen AI PCs in Minutes
☆792Updated this week
nlzy / vllm-gfx906
vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆338Updated this week
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆84Updated last week
platinum-hill / cobolt
This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support
☆226Updated 3 months ago
rhulha / Speech2Speech
A web application that converts speech to speech 100% private
☆81Updated 6 months ago
Viceman256 / TensorTune
KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning
☆29Updated 6 months ago
sasha0552 / nvidia-pstated
A daemon that automatically manages the performance states of NVIDIA GPUs.
☆100Updated last month
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆588Updated this week
kalavai-net / kalavai-client
A platform to self-host AI on easy mode
☆178Updated this week
atineiatte / deep-research-at-home
☆228Updated 7 months ago
Thireus / GGUF-Tool-Suite
Input your VRAM and RAM and the toolchain will produce a GGUF model tuned to your system within seconds — flexible model sizing and lowes…
☆66Updated this week
savantskie / persistent-ai-memory
A persistent local memory for AI, LLMs, or Copilot in VS Code.
☆175Updated last month
TesslateAI / TFrameX
☆176Updated 3 months ago
theroyallab / YALS
☆87Updated 2 weeks ago
lamikr / rocm_sdk_builder
☆418Updated 8 months ago
mostlygeek / llama-swap
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
☆1,977Updated last week
intel / llm-scaler
☆90Updated last week
7ozzam / cohere-toolkit-with-openai
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆29Updated 10 months ago
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆219Updated 5 months ago
pwilkin / llama-runner
Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends
☆48Updated 3 months ago