akx / ollama-dlLinks
Download models from the Ollama library, without Ollama
☆121Updated last year
Alternatives and similar repositories for ollama-dl
Users that are interested in ollama-dl are comparing it to the libraries listed below
Sorting:
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆329Updated 2 weeks ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆238Updated 3 weeks ago
- Aggregates compute from spare GPU capacity☆189Updated last week
- ☆209Updated 3 weeks ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆290Updated last week
- Code execution utilities for Open WebUI & Ollama☆318Updated last year
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆146Updated 2 months ago
- ☆109Updated 5 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 8 months ago
- Extract structured data from local or remote LLM models☆54Updated last year
- LLM inference in C/C++☆104Updated last week
- Link you Ollama models to LM-Studio☆150Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- Docs for GGUF quantization (unofficial)☆361Updated 6 months ago
- Something similar to Apple Intelligence?☆60Updated last year
- Web UI for ExLlamaV2☆513Updated 11 months ago
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆144Updated 9 months ago
- a curated collection of models ready-to-use with LocalAI☆268Updated last year
- automatically quant GGUF models☆219Updated last month
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Updated 10 months ago
- Handy tool to measure the performance and efficiency of LLMs workloads.☆73Updated 9 months ago
- Export and Backup Ollama models into GGUF and ModelFile☆89Updated last year
- ☆51Updated 3 months ago
- Prompt Jinja2 templates for LLMs☆35Updated 6 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆192Updated last month