MinhNgyuen / llm-benchmarkLinks

Benchmark llm performance

☆106

Alternatives and similar repositories for llm-benchmark

Users that are interested in llm-benchmark are comparing it to the libraries listed below

Sorting:

Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆606Updated 9 months ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165Updated last year
epolewski / EricLLM
A fast batching API to serve LLM models
☆188Updated last year
aidatatools / ollama-benchmark
LLM Benchmark for Throughput via Ollama (Local LLMs)
☆311Updated 3 months ago
matteoserva / GraphLLM
☆208Updated 2 months ago
abgulati / LARS
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
☆621Updated last year
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆265Updated 8 months ago
neuml / rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
☆414Updated last week
galatolofederico / microchain
function calling-based LLM agents
☆289Updated last year
noco-ai / spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
☆165Updated last year
chigkim / Ollama-MMLU-Pro
☆106Updated 2 months ago
antibitcoin / ReflectionAnyLLM
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
☆322Updated last year
AaronFeng753 / Better-R1
A open webui function for better R1 experience
☆77Updated 8 months ago
turboderp-org / exui
Web UI for ExLlamaV2
☆511Updated 9 months ago
v2rockets / Loyal-Elephie
Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible
☆342Updated 8 months ago
inferx-net / inferx
InferX: Inference as a Service Platform
☆138Updated this week
RandomInternetPreson / Lucid_Autonomy
An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…
☆128Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆214Updated 3 weeks ago
AndrewVeee / nucleo-ai
An AI assistant beyond the chat box.
☆328Updated last year
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆99Updated last year
MaxHastings / Kolo
The Fastest Way to Fine-Tune LLMs Locally
☆325Updated 8 months ago
EtiennePerot / safe-code-execution
Code execution utilities for Open WebUI & Ollama
☆305Updated last year
yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆159Updated 6 months ago
Fus3n / TwoAI
A simple experiment on letting two local LLM have a conversation about anything!
☆111Updated last year
mzbac / mlx-llm-server
For inferring and serving local LLMs using the MLX framework
☆109Updated last year
intentee / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…
☆76Updated last year
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆98Updated 4 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆190Updated last year
Contextualist / lone-arena
Self-hosted LLM chatbot arena, with yourself as the only judge
☆41Updated last year
flatsiedatsie / papeg_ai
Code for Papeg.ai
☆226Updated 10 months ago