MinhNgyuen / llm-benchmark
Benchmark llm performance
☆95Updated 8 months ago
Alternatives and similar repositories for llm-benchmark:
Users that are interested in llm-benchmark are comparing it to the libraries listed below
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆150Updated 11 months ago
- ☆198Updated last week
- LLM Benchmark for Throughput via Ollama (Local LLMs)☆210Updated 2 months ago
- ☆84Updated 4 months ago
- A fast batching API to serve LLM models☆182Updated 11 months ago
- A open webui function for better R1 experience☆80Updated last month
- a Repository of Open-WebUI tools to use with your favourite LLMs☆199Updated last month
- ☆108Updated 5 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆354Updated 3 weeks ago
- LLM Inference on consumer devices☆105Updated last month
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆552Updated 2 months ago
- ☆169Updated this week
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆90Updated 2 weeks ago
- API up your Ollama Server.☆147Updated 3 weeks ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆235Updated this week
- ☆29Updated last year
- A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life.☆121Updated 5 months ago
- Manifold is a platform for enabling workflow automation using AI assistants.☆365Updated last week
- Code execution utilities for Open WebUI & Ollama☆270Updated 5 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 2 months ago
- Comparison of Language Model Inference Engines☆214Updated 4 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆149Updated this week
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆74Updated 2 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆310Updated last month
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆251Updated last month
- automatically quant GGUF models☆168Updated this week
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆305Updated this week
- The Fastest Way to Fine-Tune LLMs Locally☆293Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆580Updated 5 months ago
- Use locally running LLMs directly from Siri 🦙🟣☆169Updated 6 months ago