akx / ollama-dlLinks

Download models from the Ollama library, without Ollama

☆89

Alternatives and similar repositories for ollama-dl

Users that are interested in ollama-dl are comparing it to the libraries listed below

Sorting:

akx / ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
☆154Updated 2 months ago
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆127Updated 2 years ago
iohub / collama
VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.
☆183Updated 5 months ago
gpustack / gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
☆185Updated this week
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated last year
aidatatools / ollama-benchmark
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆255Updated 2 weeks ago
SearchSavior / OpenArc
Lightweight Inference server for OpenVINO
☆188Updated this week
lemonade-sdk / lemonade
Local LLM Server with GPU and NPU Acceleration
☆206Updated this week
Mozilla-Ocho / llamafile-rag-example
☆88Updated last year
adrienbrault / hf-gguf-to-ollama
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
☆116Updated last year
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆116Updated 2 weeks ago
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆134Updated last week
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆67Updated 2 weeks ago
DefamationStation / Retrochat-v2
RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…
☆76Updated this week
SomeOddCodeGuy / OfflineWikipediaTextApi
This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …
☆98Updated this week
sammcj / llamalink
Link you Ollama models to LM-Studio
☆140Updated last year
bold84 / cot_proxy
Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…
☆48Updated last month
theroyallab / llm-prompt-templates
Prompt Jinja2 templates for LLMs
☆32Updated last week
jndiogo / sibila
Extract structured data from local or remote LLM models
☆42Updated last year
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆257Updated 4 months ago
AIWintermuteAI / whispercpp
Pybind11 bindings for Whisper.cpp
☆58Updated 2 weeks ago
Les-El / Ollm-Bridge
Easily access your Ollama models within LMStudio
☆113Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆187Updated this week
teabranch / open-responses-server
Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…
☆72Updated 2 weeks ago
matteoserva / GraphLLM
☆204Updated last month
chigkim / Ollama-MMLU-Pro
☆95Updated 6 months ago
iluxu / llmbasedos
Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.
☆257Updated 3 weeks ago
beratcmn / local-intelligence
Something similar to Apple Intelligence?
☆61Updated last year
unslothai / llama.cpp
LLM inference in C/C++
☆78Updated 3 weeks ago
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆78Updated 9 months ago