A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API
☆22Aug 1, 2024Updated last year
Alternatives and similar repositories for llm-speed-benchmark
Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below
Sorting:
- ☆12May 30, 2025Updated 9 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆14Aug 25, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆36Jul 16, 2025Updated 7 months ago
- Mixture-of-Ollamas☆30Aug 12, 2024Updated last year
- Find better generation parameters for your LLM☆27Jun 9, 2024Updated last year
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- A MCP stdio toolpack for local LLMs☆20Oct 6, 2025Updated 4 months ago
- Demonstration of Single Sign On with an OpenId provider.☆12Oct 18, 2020Updated 5 years ago
- A system for managing files and file replicas across many diverse sites☆11Mar 23, 2023Updated 2 years ago
- ☆11Mar 15, 2023Updated 2 years ago
- Folder-structure and some examples for setting up a new ansible-project.☆10Dec 31, 2022Updated 3 years ago
- ☆13May 5, 2015Updated 10 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆23May 2, 2025Updated 10 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- For the better CI as well as CD using gogs and drone base on kubernetes☆10Jul 31, 2021Updated 4 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆13Mar 30, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆12Jan 19, 2024Updated 2 years ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆58Jan 5, 2025Updated last year
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Jan 28, 2024Updated 2 years ago
- Explore Pusheen Pics!☆12Dec 4, 2014Updated 11 years ago
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- ☆12Jul 6, 2024Updated last year
- ☆12Mar 12, 2025Updated 11 months ago
- Use LLMs to clean your gmail inbox☆20Dec 23, 2023Updated 2 years ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆28Dec 29, 2025Updated 2 months ago
- ☆11Apr 23, 2023Updated 2 years ago
- npm package template with typescript and tsup☆10Nov 27, 2025Updated 3 months ago
- A Dronekit based APM connector☆11May 18, 2017Updated 8 years ago
- Docker images for various AI tools.☆13Jun 12, 2023Updated 2 years ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆15Apr 11, 2025Updated 10 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- This project is an app that shows a map with Electric Charging Stations and their information. The app supports station markers clusterin…☆12Jan 15, 2024Updated 2 years ago
- Chat client for LLMs.☆15Jul 23, 2024Updated last year