alexziskind1 / llama-throughput-labView external linksLinks
Interactive launcher and benchmarking harness for llama.cpp server throughput, with tests, sweeps, and round‑robin load tools.
☆237Feb 8, 2026Updated last week
Alternatives and similar repositories for llama-throughput-lab
Users that are interested in llama-throughput-lab are comparing it to the libraries listed below
Sorting:
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated 2 weeks ago
- The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations over WebRTC.☆39Dec 21, 2025Updated last month
- A lite version of OpenClaw built on n8n☆75Updated this week
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆23Jan 2, 2026Updated last month
- Tools to analyze Interlisp source code, to support VM development, and to eventually bootstrap systems☆16Jan 12, 2025Updated last year
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 8 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Jan 27, 2026Updated 2 weeks ago
- ☆54May 28, 2025Updated 8 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Nov 26, 2025Updated 2 months ago
- Run Orpheus 3B Locally With LM Studio☆32Mar 20, 2025Updated 10 months ago
- Flip Board Game for Spatial SharePlay 【Apple Vision Pro】☆14Jan 3, 2026Updated last month
- [ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments☆39Feb 17, 2025Updated 11 months ago
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- ☆17Feb 4, 2026Updated last week
- Legacy official MegaZeux git repository. Use http://github.com/AliceLR/megazeux instead.☆14Jul 19, 2018Updated 7 years ago
- Easy Implementation of Assistants API with Code Interpreter and File Retrieval☆43Dec 9, 2023Updated 2 years ago
- ☆83Feb 28, 2025Updated 11 months ago
- LangChain + LiteLLM that works☆50Sep 1, 2025Updated 5 months ago
- ☆10Oct 2, 2024Updated last year
- Linear programming model for class schedule generation☆11Oct 11, 2015Updated 10 years ago
- ☆11Jan 7, 2023Updated 3 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- Docker Compose to install N8N, Openweb UI, Qdrant, Ollama, EvolutionAPI and other systems.☆10Feb 7, 2026Updated last week
- ☆10Sep 29, 2024Updated last year
- ☆11May 8, 2022Updated 3 years ago
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆28Updated this week
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- Blazer is a case parser for maps and JSON keys using NIFs.☆11Jun 8, 2022Updated 3 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Single-axis solar position tracking prototype with a Cortex M0 MCU (Ardunio MKRZero)☆13Apr 13, 2020Updated 5 years ago
- custom backplane for diy nas (4xSATA)☆16May 12, 2024Updated last year
- Simple Akka HTTP project implemented to describe how to build Microservices with Consumer Driven Contracts testing approach☆11Feb 15, 2018Updated 8 years ago
- Virtual Bash interpreter with a virtual file system for multi-tenant environments.☆31Updated this week
- Technical docs to help you make you Halo Strix WORK!☆23Jan 10, 2026Updated last month
- .NET library that provides Forex trading integration with the Oanda V20 REST Api.☆11Nov 20, 2024Updated last year
- ProTIP permet de caractériser la connectivité réelle entre composants d'une architecture PCI Express☆10Nov 9, 2023Updated 2 years ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆15Jun 27, 2025Updated 7 months ago