MinhNgyuen / llm-benchmarkLinks
Benchmark llm performance
☆104Updated last year
Alternatives and similar repositories for llm-benchmark
Users that are interested in llm-benchmark are comparing it to the libraries listed below
Sorting:
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆295Updated last month
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆163Updated last year
- A open webui function for better R1 experience☆79Updated 6 months ago
- A fast batching API to serve LLM models☆187Updated last year
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆322Updated last year
- ☆100Updated last month
- Code execution utilities for Open WebUI & Ollama☆296Updated 10 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆75Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆154Updated 4 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆589Updated 7 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- ☆209Updated 2 weeks ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆409Updated 4 months ago
- InferX: Inference as a Service Platform☆135Updated this week
- One click templates for inferencing Language Models☆214Updated last month
- function calling-based LLM agents☆287Updated last year
- Use locally running LLMs directly from Siri 🦙🟣☆181Updated 11 months ago
- A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp☆360Updated 11 months ago
- A proxy server for multiple ollama instances with Key security☆490Updated 2 weeks ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆338Updated 6 months ago
- automatically quant GGUF models☆202Updated this week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆371Updated this week
- ☆225Updated 4 months ago
- A curated list of awesome Large Language Model (LLM) Web User Interfaces.☆525Updated 3 months ago
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆771Updated this week
- Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. …☆355Updated this week
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆126Updated 11 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆610Updated 10 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated last year