aidatatools / ollama-benchmarkLinks
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆323Updated this week
Alternatives and similar repositories for ollama-benchmark
Users that are interested in ollama-benchmark are comparing it to the libraries listed below
Sorting:
- Download models from the Ollama library, without Ollama☆119Updated last year
- Code execution utilities for Open WebUI & Ollama☆314Updated last year
- A proxy server for multiple ollama instances with Key security☆565Updated 2 months ago
- Handy tool to measure the performance and efficiency of LLMs workloads.☆74Updated 8 months ago
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆140Updated 9 months ago
- Link you Ollama models to LM-Studio☆151Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 11 months ago
- A open webui function for better R1 experience☆78Updated 10 months ago
- Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel☆158Updated 4 months ago
- Benchmark llm performance☆108Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆435Updated last month
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆274Updated this week
- OpenAPI Tool Servers☆805Updated 3 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆127Updated this week
- Aggregates compute from spare GPU capacity☆184Updated 2 weeks ago
- InferX: Inference as a Service Platform☆146Updated this week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆118Updated last year
- ☆109Updated 5 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆142Updated 2 months ago
- Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…☆244Updated 9 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆279Updated 2 weeks ago
- Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs☆359Updated last month
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 8 months ago
- QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.☆317Updated 4 months ago
- beep boop 🤖 (experimental)☆118Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆152Updated 9 months ago
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆364Updated 3 months ago
- Create Linux commands from natural language, in the shell.☆121Updated 4 months ago
- ☆180Updated last year