aidatatools / ollama-benchmarkLinks
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆303Updated 2 months ago
Alternatives and similar repositories for ollama-benchmark
Users that are interested in ollama-benchmark are comparing it to the libraries listed below
Sorting:
- A proxy server for multiple ollama instances with Key security☆515Updated 2 weeks ago
- Handy tool to measure the performance and efficiency of LLMs workloads.☆72Updated 6 months ago
- Code execution utilities for Open WebUI & Ollama☆302Updated 11 months ago
- Download models from the Ollama library, without Ollama☆104Updated 11 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆411Updated 5 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- Benchmark llm performance☆105Updated last year
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆137Updated 6 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆159Updated 5 months ago
- Link you Ollama models to LM-Studio☆145Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 9 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆160Updated 6 months ago
- beep boop 🤖 (experimental)☆115Updated 9 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆262Updated 7 months ago
- Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel☆144Updated last month
- InferX: Inference as a Service Platform☆137Updated this week
- QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.☆313Updated 2 months ago
- OpenAPI Tool Servers☆722Updated last month
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆276Updated 2 months ago
- A platform to self-host AI on easy mode☆171Updated last week
- Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. …☆396Updated this week
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆784Updated 2 weeks ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆107Updated last week
- Dolphin System Messages☆353Updated 8 months ago
- Create Linux commands from natural language, in the shell.☆116Updated 2 months ago
- ☆104Updated 2 months ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆226Updated this week
- LLM plugin providing access to models running on an Ollama server☆340Updated 2 weeks ago
- Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…☆242Updated 6 months ago
- Web UI and API for managing MCP Orchestrator (mcpo) instances and configurations☆120Updated 5 months ago