nktice / AMD-AILinks
AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1
☆216Updated last month
Alternatives and similar repositories for AMD-AI
Users that are interested in AMD-AI are comparing it to the libraries listed below
Sorting:
- AI Inferencing at the Edge. A simple one-file way to run various GGML models with KoboldAI's UI with AMD ROCm offloading☆725Updated this week
- ☆421Updated 9 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆613Updated this week
- Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.☆572Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆1,407Updated this week
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,103Updated 2 weeks ago
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆104Updated 2 months ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆267Updated this week
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆681Updated this week
- ☆718Updated last week
- ☆236Updated 2 years ago
- Web UI for ExLlamaV2☆514Updated 10 months ago
- Prebuilt Windows ROCm Libs for gfx1031 and gfx1032☆169Updated 9 months ago
- 8-bit CUDA functions for PyTorch☆69Updated 3 months ago
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration☆155Updated this week
- AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.☆720Updated 2 weeks ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,123Updated this week
- Run LLM Agents on Ryzen AI PCs in Minutes☆834Updated 2 weeks ago
- A manual for helping using tesla p40 gpu☆139Updated last year
- ☆87Updated 3 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- General Site for the GFX803 ROCm Stuff☆134Updated 4 months ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- DEPRECATED!☆50Updated last year
- Docker variants of oobabooga's text-generation-webui, including pre-built images.☆443Updated 2 months ago
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆52Updated 2 years ago
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆319Updated this week
- A complete package that provides you with all the components needed to get started of dive deeper into Machine Learning Workloads on Cons…☆44Updated 2 months ago
- Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards☆143Updated last year