arlo-phoenix / CTranslate2-rocmLinks
Fast inference engine for Transformer models
☆50Updated 11 months ago
Alternatives and similar repositories for CTranslate2-rocm
Users that are interested in CTranslate2-rocm are comparing it to the libraries listed below
Sorting:
- ☆411Updated 6 months ago
- AI Inferencing at the Edge. A simple one-file way to run various GGML models with KoboldAI's UI with AMD ROCm offloading☆702Updated 2 weeks ago
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆212Updated last week
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆226Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆1,277Updated this week
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,071Updated 2 weeks ago
- Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc☆1,764Updated this week
- 8-bit CUDA functions for PyTorch☆66Updated last month
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆514Updated this week
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆104Updated 6 months ago
- Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.☆378Updated this week
- A complete package that provides you with all the components needed to get started of dive deeper into Machine Learning Workloads on Cons…☆40Updated this week
- ROCm docker images with fixes/support for extra architectures, such as gfx803/gfx1010.☆31Updated 2 years ago
- ☆84Updated 3 weeks ago
- General Site for the GFX803 ROCm Stuff☆120Updated 2 months ago
- Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs via llama.c…☆147Updated 3 months ago
- ☆481Updated last week
- Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration☆79Updated this week
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,129Updated this week
- Web UI for ExLlamaV2☆511Updated 8 months ago
- LLM Frontend in a single html file☆654Updated 9 months ago
- AMD APU compatible Ollama. Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆116Updated 2 weeks ago
- ☆234Updated 2 years ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆826Updated 8 months ago
- Fork of ollama for vulkan support☆105Updated 8 months ago
- ☆47Updated 2 years ago
- DEPRECATED!☆50Updated last year
- ROCm docker images with fixes/support for legecy architecture gfx803. eg.Radeon RX 590/RX 580/RX 570/RX 480☆76Updated 5 months ago
- AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.☆677Updated last week
- build scripts for ROCm☆186Updated last year