akx / ollama-dlLinks
Download models from the Ollama library, without Ollama
☆89Updated 8 months ago
Alternatives and similar repositories for ollama-dl
Users that are interested in ollama-dl are comparing it to the libraries listed below
Sorting:
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆154Updated 2 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆127Updated 2 years ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 5 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆185Updated this week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆255Updated 2 weeks ago
- Lightweight Inference server for OpenVINO☆188Updated this week
- Local LLM Server with GPU and NPU Acceleration☆206Updated this week
- ☆88Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆116Updated last year
- InferX is a Inference Function as a Service Platform☆116Updated 2 weeks ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆134Updated last week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated 2 weeks ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆76Updated this week
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆98Updated this week
- Link you Ollama models to LM-Studio☆140Updated last year
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆48Updated last month
- Prompt Jinja2 templates for LLMs☆32Updated last week
- Extract structured data from local or remote LLM models☆42Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆257Updated 4 months ago
- Pybind11 bindings for Whisper.cpp☆58Updated 2 weeks ago
- Easily access your Ollama models within LMStudio☆113Updated last year
- automatically quant GGUF models☆187Updated this week
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆72Updated 2 weeks ago
- ☆204Updated last month
- ☆95Updated 6 months ago
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆257Updated 3 weeks ago
- Something similar to Apple Intelligence?☆61Updated last year
- LLM inference in C/C++☆78Updated 3 weeks ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆78Updated 9 months ago