9cb14c1ec0 / ollama-vulkanLinks
Fork of ollama for vulkan support
☆20Updated 7 months ago
Alternatives and similar repositories for ollama-vulkan
Users that are interested in ollama-vulkan are comparing it to the libraries listed below
Sorting:
- Fork of ollama for vulkan support☆110Updated 9 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆84Updated last week
- Input your VRAM and RAM and the toolchain will produce a GGUF model tuned to your system within seconds — flexible model sizing and lowes…☆66Updated this week
- Golang web client for Ollama, fast and easy to use.☆30Updated 4 months ago
- Add-on for the Web Search extension that provides the web browsing capabilities without the need for Extras API.☆46Updated 4 months ago
- Download models from the Ollama library, without Ollama☆115Updated last year
- ☆18Updated 11 months ago
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 7 months ago
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆19Updated 5 months ago
- ☆107Updated this week
- ☆20Updated last year
- Ollama model direct link generator and installer.☆222Updated 10 months ago
- A multi engine TTS & LLM edge computing playground with audio book features and more!☆27Updated last week
- AirLLM 70B inference with single 4GB GPU☆14Updated 5 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆155Updated 2 months ago
- Croco.Cpp is fork of KoboldCPP infering GGML/GGUF models on CPU/Cuda with KoboldAI's UI. It's powered partly by IK_LLama.cpp, and compati…☆153Updated this week
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 5 months ago
- Convert downloaded Ollama models back into their GGUF equivalent format☆66Updated 11 months ago
- Userspace KSM helper daemon (CachyOS branding)☆28Updated 10 months ago
- ☆126Updated last year
- Make abliterated models with transformers, easy and fast☆101Updated this week
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆19Updated 3 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 6 months ago
- Image synthesis using machine learning☆22Updated 7 months ago
- LLM inference in C/C++☆23Updated last year
- ☆94Updated 5 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 10 months ago
- SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.☆57Updated last week
- Benchmarking tool for vLLM inference performance with GPU monitoring☆34Updated 2 weeks ago
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆31Updated last year