🏎️ Dead-simple LLM benchmarking CLI - latency, cost, and quality metrics
☆83Apr 5, 2026Updated last month
Alternatives and similar repositories for bench-my-llm
Users that are interested in bench-my-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Iterative self-refinement fails for three structural reasons: prompt bias (models hallucinate flaws when asked to critique), scope creep …☆131May 21, 2026Updated last week
- Battle-tested operating system for shipping product with LLM coding agents. 17-stage wave loop, two-reviewer gate, dual-document agent me…☆93Apr 26, 2026Updated last month
- Sinaptic® DROID+ Community Version☆304May 5, 2026Updated 3 weeks ago
- Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding☆154May 22, 2026Updated last week
- An OpenCV based image analysis web application (Deployed at Hiroshima University)☆279Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open-source AI-powered Security Operations Center — alert fusion, purple-team drills, agent-assisted triage, MITRE ATT&CK investigation. …☆1,100May 20, 2026Updated last week
- Guide to Stake bonus, Stake drop, Stake codes, and Stake monthly bonus strategies. Learn how rewards work, how to stay eligible for bonus…☆28Apr 5, 2026Updated last month
- Orchestrates multi-step AI agent workflows — defines execution graphs, manages shared state, dispatches work to agents, and tracks progre…☆316May 14, 2026Updated 2 weeks ago
- Self-Improving Agents -- A Progression Four levels of self-improving code agents, from the simplest loop to a full adversarial arena with…☆208Apr 9, 2026Updated last month
- Mission Control for your OpenClaw/Hermes agents☆589Updated this week
- GRRRBLEHHH!☆147May 12, 2026Updated 2 weeks ago
- PRD-driven Context Engineering: A systematic approach to building AI-powered products using progressive documentation and context-aware d…☆217May 22, 2026Updated last week
- Estimate cloud costs from Draw.io diagrams and Pulumi code.☆57May 20, 2026Updated last week
- Persistent memory for Claude Code. Your AI never starts from scratch again.☆59May 7, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- translates postgres schema into typescript and zod☆46May 12, 2026Updated 2 weeks ago
- MCP control plane that's easy to use☆85Apr 27, 2026Updated last month
- A local-first SDLC workflow harness — a concurrent event-sourced process manager with cooperative agents.☆47Updated this week
- The Definitive GPT Image 2 Prompt Vault — Master OpenAI's next-gen model with curated prompts for pixel-perfect text, consistent characte…☆193Apr 23, 2026Updated last month
- Covenant Layer is an open model for outcome-based coordination between users, agents, brokers, providers, verifiers, and settlement servi…☆80Mar 13, 2026Updated 2 months ago
- The unified API for autonomous companies. Ship a real business — website, backend, payments — from one terminal.☆117May 21, 2026Updated last week
- HASP is a local-first broker for managed secrets in agent workflows.☆514May 23, 2026Updated last week
- Lenest Hospital's Custom Billing and Hospital Management System☆11Oct 4, 2023Updated 2 years ago
- One-click Rama cluster deploy on AWS☆19Jan 15, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Shared context, memory, and task coordination across AI coding agents. Single Go binary, local SQLite, hybrid keyword and semantic search…☆271May 13, 2026Updated 2 weeks ago
- Vlad's Playbook — the operator's field manual where every artifact is live, clickable, and forwardable. 39 chapters · 25 interactive widg…☆339May 22, 2026Updated last week
- Autonomous self-evolving agents. Vision-grounded layered memory and self-written skills for LLM agents that operate your computer.☆1,023May 18, 2026Updated last week
- Splatoon Raiders PC — Windows version of the fast-paced ink-shooting action game. Vibrant turf wars, new weapons, maps, and smooth gamep…☆182Apr 22, 2026Updated last month
- B3OS is your automation platform for cross-chain workflows. Build automated strategies, schedule transactions, and execute complex on-cha…☆450May 21, 2026Updated last week
- The Deterministic Contract Layer for AI. Defines strict tool-calling protocols, the Token Box Model for context allocation, and type-safe…☆353Feb 19, 2026Updated 3 months ago
- The source repository for the Unlicense.org website.☆114Jan 24, 2026Updated 4 months ago
- 🤖 Taskade MCP · Official MCP server and OpenAPI to MCP codegen. Build AI agent tools from any OpenAPI API and connect to Claude, Cursor,…☆148May 19, 2026Updated last week
- Mobile Ui Buttons Tutorial from youtube☆28Nov 3, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- High-performance Telegram proxy with DPI evasion☆535May 19, 2026Updated last week
- Resource is a professional suite for advanced 2D/3D drafting. It features optimized toolsets, AI-assisted design, and high-performance li…☆189Apr 24, 2026Updated last month
- 👩💻 Revolt is the #1 Edgenuity hack / Edgenuity script That does ALL your work FOR YOU Automatically Completes Edgenuity assignments, E…☆842May 16, 2026Updated 2 weeks ago
- ArchUnitPython is an architecture testing library. Specify and ensure architecture rules in your Python app. Easy setup and pipeline inte…☆476Updated this week
- Markdown filesystem for agents and teams.☆744Updated this week
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-container☆44Sep 29, 2023Updated 2 years ago
- Desktop pets for AI coding agents. Install pets, connect Claude Code via MCP, and see live coding status on your desktop.☆948Updated this week