aidatatools / ollama-benchmark
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆210Updated 2 months ago
Alternatives and similar repositories for ollama-benchmark:
Users that are interested in ollama-benchmark are comparing it to the libraries listed below
- Handy tool to measure the performance and efficiency of LLMs workloads.☆54Updated 2 months ago
- Code execution utilities for Open WebUI & Ollama☆270Updated 5 months ago
- a Repository of Open-WebUI tools to use with your favourite LLMs☆199Updated last month
- OpenAPI Tool Servers☆285Updated this week
- beep boop 🤖 (experimental)☆101Updated 3 months ago
- A open webui function for better R1 experience☆80Updated last month
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆544Updated last week
- A proxy server for multiple ollama instances with Key security☆392Updated last week
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆149Updated this week
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 7 months ago
- ☆169Updated this week
- Benchmark llm performance☆95Updated 8 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆354Updated 3 weeks ago
- API up your Ollama Server.☆147Updated 3 weeks ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆74Updated 2 months ago
- ☆84Updated 4 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆251Updated last month
- Lightweight Inference server for OpenVINO☆156Updated this week
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆150Updated 11 months ago
- A simple to use Ollama autocompletion engine with options exposed and streaming functionality☆123Updated 2 weeks ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆154Updated 2 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆293Updated last month
- Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.☆72Updated last month
- Efficient visual programming for AI language models☆356Updated 7 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆310Updated last month
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆181Updated 2 months ago
- Integrates AI tools into Microsoft Word☆132Updated 4 months ago
- You don’t need to read the code to understand how to build!☆185Updated 3 months ago