MinhNgyuen / llm-benchmarkLinks
Benchmark llm performance
☆106Updated last year
Alternatives and similar repositories for llm-benchmark
Users that are interested in llm-benchmark are comparing it to the libraries listed below
Sorting:
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆606Updated 9 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- A fast batching API to serve LLM models☆188Updated last year
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆311Updated 3 months ago
- ☆208Updated 2 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆621Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆265Updated 8 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆414Updated last week
- function calling-based LLM agents☆289Updated last year
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆165Updated last year
- ☆106Updated 2 months ago
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆322Updated last year
- A open webui function for better R1 experience☆77Updated 8 months ago
- Web UI for ExLlamaV2☆511Updated 9 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆342Updated 8 months ago
- InferX: Inference as a Service Platform☆138Updated this week
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆128Updated last year
- automatically quant GGUF models☆214Updated 3 weeks ago
- An AI assistant beyond the chat box.☆328Updated last year
- Distributed Inference for mlx LLm☆99Updated last year
- The Fastest Way to Fine-Tune LLMs Locally☆325Updated 8 months ago
- Code execution utilities for Open WebUI & Ollama☆305Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆159Updated 6 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆111Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆76Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆98Updated 4 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆190Updated last year
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Updated last year
- Code for Papeg.ai☆226Updated 10 months ago