llama-benchy - llama-bench style benchmarking tool for all backends
☆138Mar 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for llama-benchy
Users that are interested in llama-benchy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker configuration for running VLLM on dual DGX Sparks☆648Mar 19, 2026Updated last week
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆66Updated this week
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 3 months ago
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 8 months ago
- Various LLM Benchmarks☆24Feb 20, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Thank you LenAnderson I am yoinking this!☆24Jan 13, 2026Updated 2 months ago
- ☆21Oct 13, 2025Updated 5 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆21Oct 27, 2023Updated 2 years ago
- Build a simple CMD chat interface with llama.cpp and C++☆14Sep 19, 2025Updated 6 months ago
- QuickClash Revit Add-in for Clash Detection☆11Jun 17, 2022Updated 3 years ago
- ☆28Jan 2, 2026Updated 2 months ago
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆30Dec 30, 2025Updated 2 months ago
- ☆53Feb 27, 2026Updated last month
- Serverless RAG application with LlamaIndex and code interperter on Azure Container Apps☆12Jan 30, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆58Jan 5, 2025Updated last year
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆69Feb 22, 2026Updated last month
- Professional desktop app for converting text to audiobooks with local TTS☆31Oct 6, 2025Updated 5 months ago
- molequla.ai. live ecology of GPT organisms☆51Mar 18, 2026Updated last week
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆42Jan 27, 2026Updated 2 months ago
- Current Alpha version of the ONTO-TRON-5000☆40Dec 1, 2025Updated 3 months ago
- ☆24Mar 12, 2026Updated 2 weeks ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆22May 8, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Rewritten frontend for SillyTavern☆70Feb 28, 2026Updated 3 weeks ago
- ☆20Mar 20, 2026Updated last week
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆40Oct 17, 2025Updated 5 months ago
- Mindwrite, is simple flutter project with clean architecture and Bloc☆13Oct 24, 2024Updated last year
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 4 months ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆48Nov 30, 2025Updated 3 months ago
- ☆36Jan 25, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CLI-first server inventory management with YAML as the single source of truth☆54Jan 25, 2026Updated 2 months ago
- ☆15Oct 31, 2023Updated 2 years ago
- ☆32Nov 16, 2025Updated 4 months ago
- Select LLM models for code completion, image recognition and text generation☆20Nov 6, 2024Updated last year
- Mustache templates for Haskell. megaparsec -> parsec; stache -> microstache☆17Jan 15, 2025Updated last year
- ☆12Apr 19, 2024Updated last year
- Racket GObjectIntrospection FFI☆16Oct 13, 2021Updated 4 years ago