🏎️ Dead-simple LLM benchmarking CLI - latency, cost, and quality metrics
☆78Apr 5, 2026Updated 2 months ago
Alternatives and similar repositories for bench-my-llm
Users that are interested in bench-my-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Battle-tested operating system for shipping product with LLM coding agents. 17-stage wave loop, two-reviewer gate, dual-document agent me…☆91Apr 26, 2026Updated last month
- A terrarium of creatures, each grove raised by a different AI. Feed them and they grow clever. Raise them too clever, and they turn on th…☆390Updated this week
- Open-source observability dashboard for OpenClaw daemons — cost analyti cs, live monitor ing, and de bugging tools, all running on your o…☆487Apr 15, 2026Updated 2 months ago
- Sinaptic® DROID+ Community Version☆207May 5, 2026Updated last month
- Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding☆170Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VectorRAG.Net is a .NET-native high-performance vector database library for semantic search and RAG (Retrieval-Augmented Generation). Cor…☆195Jun 3, 2026Updated 2 weeks ago
- Open-source AI-powered Security Operations Center — alert fusion, purple-team drills, agent-assisted triage, MITRE ATT&CK investigation. …☆1,377Updated this week
- Vlad's Playbook — the operator's field manual where every artifact is live, clickable, and forwardable. 39 chapters · 25 interactive widg…☆465Updated this week
- Labor Market AI Agent☆248May 28, 2026Updated 3 weeks ago
- PhenoPixel: A web-based bioimage analysis platform for bacterial single-cell microscopy analytics.☆297Updated this week
- Modern C++23 SDK for the Kalshi prediction-market API — full typed REST coverage + real-time WebSocket streaming, RSA-PSS auth, and an ex…☆122Jun 7, 2026Updated last week
- ☆276Jun 3, 2026Updated 2 weeks ago
- GRRRBLEHHH!☆112May 12, 2026Updated last month
- Guide to Stake bonus, Stake drop, Stake codes, and Stake monthly bonus strategies. Learn how rewards work, how to stay eligible for bonus…☆28Apr 5, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Orchestrates multi-step AI agent workflows — defines execution graphs, manages shared state, dispatches work to agents, and tracks progre…☆282Jun 6, 2026Updated last week
- Mission Control for your Hermes agents☆577Jun 5, 2026Updated 2 weeks ago
- Persistent memory for Claude Code. Your AI never starts from scratch again.☆56May 7, 2026Updated last month
- PRD-driven Context Engineering: A systematic approach to building AI-powered products using progressive documentation and context-aware d…☆212Updated this week
- CLI, SDK, and IDE plugins for Duel Agents☆962Jun 5, 2026Updated 2 weeks ago
- Self-Improving Agents -- A Progression Four levels of self-improving code agents, from the simplest loop to a full adversarial arena with…☆267Apr 9, 2026Updated 2 months ago
- The Definitive GPT Image 2 Prompt Vault — Master OpenAI's next-gen model with curated prompts for pixel-perfect text, consistent characte…☆70Apr 23, 2026Updated last month
- A local-first SDLC workflow engine for AI agents☆45Updated this week
- translates postgres schema into typescript and zod☆45May 12, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Estimate cloud costs for Pulumi.☆75Jun 7, 2026Updated last week
- Covenant Layer is an open model for outcome-based coordination between users, agents, brokers, providers, verifiers, and settlement servi…☆68Mar 13, 2026Updated 3 months ago
- AINL helps turn AI from "a smart conversation" into "a structured worker." It is designed for teams building AI workflows that need mult…☆836Jun 11, 2026Updated last week
- MCP control plane that's easy to use☆79Apr 27, 2026Updated last month
- Turn any MCP server, OpenAPI spec, or GraphQL endpoint into a CLI at runtime.☆610Jun 3, 2026Updated 2 weeks ago
- Lenest Hospital's Custom Billing and Hospital Management System☆11Oct 4, 2023Updated 2 years ago
- One-click Rama cluster deploy on AWS☆19Jan 15, 2026Updated 5 months ago
- The unified API for autonomous companies. Ship a real business — website, backend, payments — from one terminal.☆105Jun 6, 2026Updated last week
- Point it at any GitHub repo, get DORA metrics, vulnerability scan, and a health score. CLI + dashboard.☆305Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- HASP is a local-first broker for managed secrets in agent workflows.☆433Jun 10, 2026Updated last week
- Shared context, memory, and task coordination across AI coding agents. Single Go binary, local SQLite, hybrid keyword and semantic search…☆311Jun 12, 2026Updated last week
- Resource is a professional suite for advanced 2D/3D drafting. It features optimized toolsets, AI-assisted design, and high-performance li…☆75Apr 24, 2026Updated last month
- Autonomous self-evolving agents. Vision-grounded layered memory and self-written skills for LLM agents that operate your computer.☆860May 18, 2026Updated last month
- B3OS is your automation platform for cross-chain workflows. Build automated strategies, schedule transactions, and execute complex on-cha…☆565May 21, 2026Updated 3 weeks ago
- The source repository for the Unlicense.org website.☆89Jan 24, 2026Updated 4 months ago
- The Deterministic Contract Layer for AI. Defines strict tool-calling protocols, the Token Box Model for context allocation, and type-safe…☆206Feb 19, 2026Updated 4 months ago