Interactive launcher and benchmarking harness for llama.cpp server throughput, with tests, sweeps, and round‑robin load tools.
☆344Feb 8, 2026Updated 2 months ago
Alternatives and similar repositories for llama-throughput-lab
Users that are interested in llama-throughput-lab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jul 10, 2021Updated 4 years ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated last year
- Spreadsheet-like programming on all your devices. http://object.network/onex-app.html☆16Jun 17, 2025Updated 9 months ago
- Anthropic MCP go implementation☆19Mar 19, 2026Updated 3 weeks ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Jan 13, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated 2 months ago
- Blazor Hybrid (.NET MAUI) and Blazor Web App Sample of a Todo app☆17Aug 16, 2023Updated 2 years ago
- ☆25Feb 10, 2026Updated 2 months ago
- A self-contained MCP server in docker that combines the Crawl4AI, SearXNG, and Supabase to provide AI agents and coding assistants with c…☆37Aug 8, 2025Updated 8 months ago
- Lua/Terra + Java Native Interface☆21Mar 3, 2017Updated 9 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- A platform for building reliable AI agents☆94Apr 3, 2026Updated last week
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆29May 7, 2025Updated 11 months ago
- [unmaintained] Simple wrapper script to use Azure CLI with LocalStack☆12Nov 23, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆14Sep 16, 2024Updated last year
- A smart, open-source grocery list interface to Grocy.☆21Aug 9, 2021Updated 4 years ago
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆23Jan 2, 2026Updated 3 months ago
- CLI-based tester for verifying that MCP servers work correctly when called directory and by agents☆34Sep 16, 2025Updated 6 months ago
- ⚠️ This is a mirror template repository for the new Expo default project template. Contributions and bug reports should be made in expo/e…☆30Updated this week
- OpenCode with all telemetry removed. Clean builds, no phone-home.☆77Updated this week
- Learn how to build your first neural network using Keras and Tensorflow to do Deep Learning!☆17Aug 22, 2020Updated 5 years ago
- Easy Implementation of Assistants API with Code Interpreter and File Retrieval☆43Dec 9, 2023Updated 2 years ago
- Transform slides and speaker notes into video☆39Mar 1, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- OpenWrt port for Ruckus R500 wireless access points☆12Mar 2, 2021Updated 5 years ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 10 months ago
- ✅ Iterative Transparent Reasoning System by chonkyDB ✅ combining reasoning, graph and vector for trustworthy, explainable and smart LLMs …☆36Jun 13, 2025Updated 10 months ago
- A browser-based tool that renders `.gguf` language model files as interactive 3D point clouds.☆53Feb 8, 2026Updated 2 months ago
- Bambuser version of ffmpeg/android with custom python bindings☆40Jan 5, 2017Updated 9 years ago
- Flip Board Game for Spatial SharePlay 【Apple Vision Pro】☆14Jan 3, 2026Updated 3 months ago
- Tonepie cat litter on ESPHome☆17Oct 17, 2023Updated 2 years ago
- Second Generation of Large Language Models☆21Jun 30, 2025Updated 9 months ago
- ☆13Oct 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simple JavaScript library for HEX/RGB/HSB/LAB/XYZ color spaces☆25Jan 10, 2017Updated 9 years ago
- SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profi…☆105Apr 6, 2026Updated last week
- adapt data to and from every format☆28Feb 15, 2026Updated 2 months ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Linear programming model for class schedule generation☆11Oct 11, 2015Updated 10 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- AI-Driven Decentralized Organization (AIDO) using Supabase and LangChain.js. It includes all necessary files, configurations, tests, and…☆33Nov 15, 2024Updated last year