cloudmercato / ollama-benchmarkLinks
Handy tool to measure the performance and efficiency of LLMs workloads.
☆73Updated 8 months ago
Alternatives and similar repositories for ollama-benchmark
Users that are interested in ollama-benchmark are comparing it to the libraries listed below
Sorting:
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆319Updated last week
- Code execution utilities for Open WebUI & Ollama☆312Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 11 months ago
- Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel☆154Updated 4 months ago
- Generate and execute command line commands using LLM☆51Updated 10 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆138Updated 2 months ago
- This repository contains custom pipelines developed for the OpenWebUI framework, including advanced workflows such as long-term memory fi…☆80Updated 7 months ago
- ☆28Updated last year
- A proxy server for multiple ollama instances with Key security☆553Updated last month
- Download models from the Ollama library, without Ollama☆118Updated last year
- Create Linux commands from natural language, in the shell.☆118Updated 4 months ago
- ☆99Updated last week
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆151Updated 9 months ago
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆226Updated 4 months ago
- Web UI and API for managing MCP Orchestrator (mcpo) instances and configurations☆127Updated 7 months ago
- Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.☆86Updated 10 months ago
- Easily access your Ollama models within LMStudio☆127Updated last year
- MCP Server for SearXNG☆376Updated last month
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated 4 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆222Updated 4 months ago
- OpenAPI Tool Servers☆792Updated 3 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆89Updated 11 months ago
- Streamline Coding & Speed Up Dev Process. Your Own Personal Senior Engineer For Free!☆133Updated last year
- beep boop 🤖 (experimental)☆118Updated 11 months ago
- LM inference server implementation based on *.cpp.☆294Updated last month
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆124Updated 2 weeks ago
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆139Updated 8 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆332Updated 2 weeks ago
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆263Updated 10 months ago
- ☆173Updated last year