arlo-phoenix / CTranslate2-rocmLinks

Fast inference engine for Transformer models

☆50

Alternatives and similar repositories for CTranslate2-rocm

Users that are interested in CTranslate2-rocm are comparing it to the libraries listed below

Sorting:

lamikr / rocm_sdk_builder
☆411Updated 6 months ago
YellowRoseCx / koboldcpp-rocm
AI Inferencing at the Edge. A simple one-file way to run various GGML models with KoboldAI's UI with AMD ROCm offloading
☆702Updated 2 weeks ago
nktice / AMD-AI
AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1
☆212Updated last week
SearchSavior / OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
☆226Updated this week
ikawrakow / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆1,277Updated this week
theroyallab / tabbyAPI
The official API server for Exllama. OAI compatible, lightweight, and fast.
☆1,071Updated 2 weeks ago
mostlygeek / llama-swap
Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc
☆1,764Updated this week
ROCm / bitsandbytes
8-bit CUDA functions for PyTorch
☆66Updated last month
ROCm / TheRock
The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm
☆514Updated this week
olealgoritme / gddr6
Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.
☆104Updated 6 months ago
FastFlowLM / FastFlowLM
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
☆378Updated this week
scooter-lacroix / Stan-s-ML-Stack
A complete package that provides you with all the components needed to get started of dive deeper into Machine Learning Workloads on Cons…
☆40Updated this week
ulyssesrr / docker-rocm-xtra
ROCm docker images with fixes/support for extra architectures, such as gfx803/gfx1010.
☆31Updated 2 years ago
theroyallab / YALS
☆84Updated 3 weeks ago
robertrosenbusch / gfx803_rocm
General Site for the GFX803 ROCm Stuff
☆120Updated 2 months ago
QuantiusBenignus / BlahST
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs via llama.c…
☆147Updated 3 months ago
amd / xdna-driver
☆481Updated last week
lemonade-sdk / llamacpp-rocm
Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration
☆79Updated this week
Softcatala / whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,129Updated this week
turboderp-org / exui
Web UI for ExLlamaV2
☆511Updated 8 months ago
lmg-anon / mikupad
LLM Frontend in a single html file
☆654Updated 9 months ago
rjmalagon / ollama-linux-amd-apu
AMD APU compatible Ollama. Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
☆116Updated 2 weeks ago
xuhuisheng / rocm-gfx803
☆234Updated 2 years ago
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆826Updated 8 months ago
whyvl / ollama-vulkan
Fork of ollama for vulkan support
☆105Updated 8 months ago
xor2k / gpu_undervolt
☆47Updated 2 years ago
evshiron / rocm_lab
DEPRECATED!
☆50Updated last year
woodrex83 / ROCm-For-RX580
ROCm docker images with fixes/support for legecy architecture gfx803. eg.Radeon RX 590/RX 580/RX 570/RX 480
☆76Updated 5 months ago
amd / RyzenAI-SW
AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.
☆677Updated last week
xuhuisheng / rocm-build
build scripts for ROCm
☆186Updated last year