arlo-phoenix / CTranslate2-rocm
Fast inference engine for Transformer models
☆17Updated 2 months ago
Alternatives and similar repositories for CTranslate2-rocm:
Users that are interested in CTranslate2-rocm are comparing it to the libraries listed below
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆27Updated last month
- 8-bit CUDA functions for PyTorch Rocm compatible☆39Updated 9 months ago
- Web UI for ExLlamaV2☆460Updated 3 weeks ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆16Updated 3 months ago
- ☆164Updated this week
- An OAI compatible exllamav2 API that's both lightweight and fast☆733Updated this week
- Simple monkeypatch to boost AMD Navi 3 GPUs☆31Updated 8 months ago
- ☆37Updated last year
- transparent proxy server for llama.cpp's server to provide automatic model swapping☆135Updated this week
- Croco.Cpp is a 3rd party testground for KoboldCPP, a simple one-file way to run various GGML/GGUF models with KoboldAI's UI. (for Croco.C…☆92Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆205Updated this week
- ☆40Updated last year
- ☆90Updated 8 months ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆43Updated 8 months ago
- ☆44Updated 9 months ago
- A utility that uses Whisper to transcribe videos and various translation APIs to translate the transcribed text and save them as SRT (sub…☆67Updated 4 months ago
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆184Updated 2 months ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆51Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆217Updated last month
- llama.cpp fork with additional SOTA quants and improved performance☆126Updated this week
- Easily view and modify JSON datasets for large language models☆68Updated 3 months ago
- ROCm docker images with fixes/support for extra architectures, such as gfx803/gfx1010.☆25Updated last year
- A pipeline parallel training script for LLMs.☆116Updated this week
- A fast batching API to serve LLM models☆177Updated 8 months ago
- LLM Frontend in a single html file☆305Updated this week
- Pillow plugin for JPEG-XL, using Rust for bindings.☆28Updated last week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆50Updated 2 months ago
- Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline. Speak with local LLMs.☆52Updated 2 months ago
- A Gradio UI for XTTSv2 and RVC.☆156Updated 7 months ago
- Python bindings for whisper.cpp☆203Updated 2 weeks ago