aidatatools / ollama-benchmarkLinks

LLM Benchmark for Throughput via Ollama (Local LLMs)

☆269

Alternatives and similar repositories for ollama-benchmark

Users that are interested in ollama-benchmark are comparing it to the libraries listed below

Sorting:

ParisNeo / ollama_proxy_server
A proxy server for multiple ollama instances with Key security
☆470Updated last week
open-webui / bot
beep boop 🤖 (experimental)
☆112Updated 7 months ago
EtiennePerot / safe-code-execution
Code execution utilities for Open WebUI & Ollama
☆290Updated 8 months ago
lemonade-sdk / lemonade
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…
☆381Updated this week
akx / ollama-dl
Download models from the Ollama library, without Ollama
☆90Updated 8 months ago
cloudmercato / ollama-benchmark
Handy tool to measure the performance and efficiency of LLMs workloads.
☆69Updated 3 months ago
open-webui / openapi-servers
OpenAPI Tool Servers
☆573Updated last month
adrienbrault / hf-gguf-to-ollama
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
☆116Updated last year
sammcj / llamalink
Link you Ollama models to LM-Studio
☆141Updated last year
neuml / rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
☆390Updated 3 months ago
iohub / collama
VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.
☆183Updated 6 months ago
MinhNgyuen / llm-benchmark
Benchmark llm performance
☆101Updated last year
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆260Updated 5 months ago
yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆149Updated 3 months ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆158Updated last year
kesor / ollama-proxy
Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel
☆128Updated last month
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
Haervwe / open-webui-tools
Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. …
☆294Updated last week
SearchSavior / OpenArc
Lightweight Inference server for OpenVINO
☆191Updated 2 weeks ago
matteoserva / GraphLLM
☆207Updated 2 weeks ago
BrainDriveAI / openwebui-pipelines
This repository contains custom pipelines developed for the OpenWebUI framework, including advanced workflows such as long-term memory fi…
☆71Updated 2 months ago
AaronFeng753 / Better-R1
A open webui function for better R1 experience
☆79Updated 5 months ago
QuixiAI / dolphin-system-messages
Dolphin System Messages
☆323Updated 5 months ago
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆119Updated 2 weeks ago
antibitcoin / ReflectionAnyLLM
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
☆321Updated 10 months ago
daeisbae / open-repo-wiki
You don’t need to read the code to understand how to build!
☆202Updated 6 months ago
kalavai-net / kalavai-client
A platform to self-host AI on easy mode
☆156Updated this week
universal-tool-calling-protocol / python-utcp
Official python implementation of the UTCP
☆364Updated last week
chigkim / Ollama-MMLU-Pro
☆95Updated 7 months ago
anurmatov / mac-studio-server
Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…
☆225Updated 5 months ago