Provider-agnostic, open-source evaluation infrastructure for language models
☆784Jun 26, 2026Updated last week
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Realtime News and Information Eval☆20Jun 26, 2026Updated last week
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆76May 21, 2024Updated 2 years ago
- Groq Compound Beta MCP Server☆52Jun 26, 2026Updated last week
- Local Groq Desktop chat app with MCP support☆397Jun 26, 2026Updated last week
- Groq Public Changelog☆18May 6, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆176Apr 9, 2024Updated 2 years ago
- The official Node.js / Typescript library for the Groq API☆254Jun 21, 2026Updated last week
- Build, enrich, and transform datasets using AI models with no code☆1,634May 26, 2026Updated last month
- ☆11Aug 26, 2024Updated last year
- Claude Code for Kimi K2☆14Sep 15, 2025Updated 9 months ago
- ☆28Feb 11, 2026Updated 4 months ago
- Base project for bootstrapping frontend projects☆16Jan 28, 2026Updated 5 months ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- SpectralQuant: Calibrated Eigenbasis Rotation and Water-Filled Bit Allocation for KV-Cache Compression☆196May 15, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Jun 11, 2026Updated 3 weeks ago
- A highly customizable, lightweight, and open-source coding CLI powered by Groq for instant iteration.☆734Dec 19, 2025Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,463Jun 23, 2026Updated last week
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆10,198Updated this week
- moodist☆28Apr 23, 2026Updated 2 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆120Jul 31, 2025Updated 11 months ago
- A fun multiplayer game built on Convex using Dall-E.☆18Feb 6, 2026Updated 4 months ago
- Renderer for the harmony response format to be used with gpt-oss☆4,426Apr 8, 2026Updated 2 months ago
- Context Engineering Course with DSPy☆226Jul 27, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An autonomous AI agent that plays Pokemon FireRed in real time using OpenAI's LLM, with a live web dashboard for monitoring.☆77Feb 15, 2026Updated 4 months ago
- ☆17Feb 23, 2026Updated 4 months ago
- Semantic search and document parsing tools for the command line☆1,828Mar 11, 2026Updated 3 months ago
- Open GenAI Stack☆8,418Jun 27, 2026Updated last week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆2,353Updated this week
- AI app to generate blog from youtube video url.☆14Nov 1, 2023Updated 2 years ago
- The LLM Evaluation Framework☆16,516Jun 26, 2026Updated last week
- ☆16May 31, 2025Updated last year
- Run evals using LLM☆27Jan 8, 2026Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Everything about the SmolLM and SmolVLM family of models☆3,826May 26, 2026Updated last month
- Simple repository for training small reasoning models☆51Feb 17, 2026Updated 4 months ago
- groq-gradio☆18Nov 19, 2025Updated 7 months ago
- Our library for RL environments + evals☆4,233Jun 26, 2026Updated last week
- Custom hooks for pi coding agent☆121May 18, 2026Updated last month
- Transcribing audio files on Modal with open source ASR models is fast, cheap, and easy!☆21Jul 25, 2025Updated 11 months ago
- dbSurface is a SQL editor made for pgvector.☆24Dec 6, 2025Updated 6 months ago