llama-benchy - llama-bench style benchmarking tool for all backends
☆282Mar 12, 2026Updated last month
Alternatives and similar repositories for llama-benchy
Users that are interested in llama-benchy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Docker configuration for running VLLM on dual DGX Sparks☆1,083Updated this week
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆124Updated this week
- Various LLM Benchmarks☆25Feb 20, 2026Updated last month
- Thank you LenAnderson I am yoinking this!☆25Apr 11, 2026Updated last week
- Turn any Kiwix ZIM archive (offline Wikipedia, Stack Exchange, DevDocs, etc.) into an instant knowledge source for LLMs with a tiny CLI +…☆84Jun 4, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A repository to store helpful information and emerging insights in regard to LLMs☆21Oct 27, 2023Updated 2 years ago
- ☆30Jan 2, 2026Updated 3 months ago
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆34Dec 30, 2025Updated 3 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆56Jan 5, 2025Updated last year
- Merge LLM that are split in to parts☆26Mar 18, 2026Updated last month
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 8 months ago
- 自建 Tailscale DERP 服务器 Docker 镜像☆26Mar 9, 2025Updated last year
- ☆14Jun 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 2 months ago
- Current Alpha version of the ONTO-TRON-5000☆40Dec 1, 2025Updated 4 months ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆22May 8, 2025Updated 11 months ago
- Rewritten frontend for SillyTavern☆70Feb 28, 2026Updated last month
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆29Jul 18, 2024Updated last year
- Mindwrite, is simple flutter project with clean architecture and Bloc☆13Oct 24, 2024Updated last year
- ☆33Mar 12, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆15Oct 12, 2022Updated 3 years ago
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 5 months ago
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆41Oct 17, 2025Updated 6 months ago
- A search index specialised for LaTeX equations. Developed for latexsearch.com.☆17Jul 15, 2011Updated 14 years ago
- A docker image which includs Bind and Stubby for DNS over TLS☆13Oct 25, 2018Updated 7 years ago
- vibebin: code and host inside Incus containers on your own VPS/server.☆75Mar 20, 2026Updated last month
- ☆16Mar 19, 2026Updated last month
- A real-time Grafana dashboard using MISP ZeroMQ message queue and InfluxDB☆19Mar 15, 2024Updated 2 years ago
- NeMo: a toolkit for conversational AI☆16Mar 9, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 😎 Awesome things related to Tailwind CSS☆13Sep 29, 2021Updated 4 years ago
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆55Nov 30, 2025Updated 4 months ago
- Check Point Useful Management API Tools contain scripts and tools that were used as solutions for customers.☆16Mar 31, 2026Updated 2 weeks ago
- ☆33Nov 16, 2025Updated 5 months ago
- An open-source AI task stack, developed in Python, enables users to leverage 'humanized queries' for selecting test cases from a diverse …☆22Apr 10, 2024Updated 2 years ago
- Deterministic Password Generator☆10Oct 11, 2017Updated 8 years ago
- AI Search is a server application leveraging OpenAI's API to perform intelligent search operations on the Booking.com travel site.☆21Mar 28, 2024Updated 2 years ago