kyuz0 / amd-strix-halo-toolboxesLinks
☆415Updated last week
Alternatives and similar repositories for amd-strix-halo-toolboxes
Users that are interested in amd-strix-halo-toolboxes are comparing it to the libraries listed below
Sorting:
- ☆114Updated last week
- Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.☆378Updated this week
- Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration☆79Updated this week
- Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc☆1,764Updated this week
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆226Updated this week
- Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…☆1,512Updated this week
- Linux distro for AI computers. Go from bare-metal GPUs to running AI workloads - like vLLM, SGLang, RAG, and Agents - in minutes, fully a…☆307Updated last month
- AMD APU compatible Ollama. Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆116Updated last week
- AI Cluster deployed with Ansible on Random computers with random capabilities☆252Updated last month
- ☆253Updated 4 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,277Updated this week
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆212Updated this week
- Run LLM Agents on Ryzen AI PCs in Minutes☆684Updated last week
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆224Updated 2 months ago
- A beautiful local-first coding agent running in your terminal - built by the community for the community ⚒☆766Updated this week
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆303Updated 2 months ago
- ☆1,177Updated this week
- Manifold is a platform for enabling workflow automation using AI assistants.☆464Updated this week
- ☆409Updated 6 months ago
- Docs for GGUF quantization (unofficial)☆293Updated 3 months ago
- ☆226Updated 5 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆107Updated last week
- InferX: Inference as a Service Platform☆137Updated this week
- OpenAPI Tool Servers☆722Updated last month
- General Tool-calling API Proxy☆52Updated 2 months ago
- ☆180Updated last month
- A platform to self-host AI on easy mode☆171Updated last week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆96Updated last month
- Web UI and API for managing MCP Orchestrator (mcpo) instances and configurations☆120Updated 5 months ago
- Interactive, locally hosted tool to migrate Open-WebUI SQLite databases to PostgreSQL☆166Updated 3 weeks ago