kyuz0 / amd-strix-halo-vllm-toolboxesLinks

☆48

Alternatives and similar repositories for amd-strix-halo-vllm-toolboxes

Users that are interested in amd-strix-halo-vllm-toolboxes are comparing it to the libraries listed below

Sorting:

shantur / amd-strix-halo-fine-tuning-toolboxes
LLM Fine Tuning Toolbox images for Ryzen AI 395+ Strix Halo
☆34Updated 2 months ago
AmpereComputingAI / llama.cpp
Ampere optimized llama.cpp
☆28Updated last month
nicknochnack / ACPxMCPxWatsonx
How to build an ACP compliant agent that uses MCP as well!
☆11Updated 6 months ago
NVIDIA / workbench-example-mistral-finetune
An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model
☆66Updated last year
yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆161Updated 6 months ago
wsmlby / homl
The easiest & fastest way to run LLMs in your home lab
☆71Updated 3 months ago
SearchSavior / OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
☆247Updated 3 weeks ago
lynthera / bitsegments_localminds
Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.
☆22Updated 5 months ago
inferx-net / inferx
InferX: Inference as a Service Platform
☆139Updated this week
crashr / gppm
GPU Power and Performance Manager
☆61Updated last year
thom-heinrich / itrs
✅ Iterative Transparent Reasoning System by chonkyDB ✅ combining reasoning, graph and vector for trustworthy, explainable and smart LLMs …
☆35Updated 5 months ago
Forest-Person / smolResearcher
Use smol agents to do research and then update csv coumns with its findings.
☆41Updated 9 months ago
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆216Updated 5 months ago
LLMSELECTOR / LLMSELECTOR
☆79Updated last month
lhl / strix-halo-testing
☆146Updated 3 weeks ago
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆93Updated this week
intel / intel-ai-assistant-builder
Intel® AI Assistant Builder
☆128Updated this week
marc-shade / Ollama-Workbench
A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…
☆47Updated 8 months ago
pseudotensor / open-strawberry
Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…
☆187Updated last year
kalavai-net / kalavai-client
A platform to self-host AI on easy mode
☆177Updated last week
TesslateAI / TFrameX
☆176Updated 3 months ago
NVIDIA-AI-Blueprints / llm-router
Route LLM requests to the best model for the task at hand.
☆133Updated last week
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆213Updated 3 months ago
benhaotang / OpenDeepResearcher-via-searxng
Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…
☆146Updated 8 months ago
severian42 / MoA-Ollama-Chat
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆118Updated last year
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆84Updated last month
tarun7r / deep-research-agent
Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …
☆65Updated last week
EmbeddedLLM / embeddedllm
EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
☆43Updated last year
NVIDIA-AI-Blueprints / aiq-research-assistant
☆223Updated last month
ibm-granite / watsonx-code-assistant-individual
For individual users, watsonx Code Assistant can access a local IBM Granite model
☆37Updated 5 months ago