lhl / strix-halo-testingLinks

☆138

Alternatives and similar repositories for strix-halo-testing

Users that are interested in strix-halo-testing are comparing it to the libraries listed below

Sorting:

kyuz0 / amd-strix-halo-toolboxes
☆524Updated this week
FastFlowLM / FastFlowLM
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
☆451Updated this week
SearchSavior / OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
☆241Updated last week
sbnb-io / sbnb
Linux distro for AI computers. Go from bare-metal GPUs to running AI workloads - like vLLM, SGLang, RAG, and Agents - in minutes, fully a…
☆315Updated 2 months ago
lemonade-sdk / llamacpp-rocm
Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration
☆103Updated this week
ikawrakow / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆1,329Updated this week
mostlygeek / llama-swap
Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc
☆1,899Updated this week
geerlingguy / beowulf-ai-cluster
AI Cluster deployed with Ansible on Random computers with random capabilities
☆273Updated 2 months ago
iluxu / llmbasedos
llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work
☆278Updated 3 months ago
ubergarm / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆21Updated this week
taylorwilsdon / reddacted
reddacted lets you analyze & sanitize your online footprint using LLMs, PII detection & sentiment analysis to identify anything that migh…
☆112Updated 3 months ago
sasha0552 / nvidia-pstated
A daemon that automatically manages the performance states of NVIDIA GPUs.
☆97Updated 2 weeks ago
thushan / olla
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…
☆117Updated this week
atineiatte / deep-research-at-home
☆226Updated 6 months ago
crashr / gppm
GPU Power and Performance Manager
☆61Updated last year
platinum-hill / cobolt
This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support
☆225Updated 3 months ago
k-koehler / gguf-tensor-overrider
☆49Updated last month
lemonade-sdk / lemonade
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…
☆1,622Updated this week
regnull / how.sh
Generate and execute command line commands using LLM
☆50Updated 9 months ago
ServiceStack / llms
LLM Client, Server API and UI
☆393Updated this week
cloudsbird / mem0-owui
Mem0 Integration with OpenWebUI
☆46Updated last week
iuliaturc / gguf-docs
Docs for GGUF quantization (unofficial)
☆312Updated 4 months ago
mattcurf / ollama-intel-gpu
☆257Updated 5 months ago
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆571Updated last week
savantskie / persistent-ai-memory
A persistent local memory for AI, LLMs, or Copilot in VS Code.
☆170Updated 3 weeks ago
Raskoll2 / LLMcalc
A tool to determine whether or not your PC can run a given LLM
☆164Updated 9 months ago
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆216Updated 5 months ago
rhulha / Speech2Speech
A web application that converts speech to speech 100% private
☆81Updated 5 months ago
aidatatools / ollama-benchmark
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆311Updated 3 months ago
taylorwilsdon / open-webui-postgres-migration
Interactive, locally hosted tool to migrate Open-WebUI SQLite databases to PostgreSQL
☆175Updated last month