groq / openbenchLinks
Provider-agnostic, open-source evaluation infrastructure for language models
☆709Updated 3 weeks ago
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below
Sorting:
- Together Open Deep Research☆354Updated 9 months ago
- ☆348Updated this week
- Claude Deep Research config for Claude Code.☆225Updated 10 months ago
- Routing on Random Forest (RoRF)☆238Updated last year
- Deep Research for your internal data☆353Updated 7 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆745Updated 7 months ago
- A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference …☆214Updated 5 months ago
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆634Updated this week
- Context Engineering Course with DSPy☆211Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆435Updated last week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 10 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆804Updated last week
- Tutorial for building LLM router☆242Updated last year
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆983Updated last month
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆446Updated 4 months ago
- Open-source versioning, tracing, and annotation tooling.☆211Updated 2 months ago
- ☆114Updated 6 months ago
- ☆259Updated last month
- An open-source tool for LLM prompt optimization.☆746Updated last week
- This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates every 3 hours.☆298Updated this week
- 🤖 Headless IDE for AI agents☆199Updated 3 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆341Updated 4 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆490Updated 5 months ago
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.☆1,230Updated this week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆409Updated 4 months ago
- The lightweight framework for building agents☆258Updated this week
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆274Updated 2 months ago
- Letting Claude Code develop his own MCP tools :)☆122Updated 10 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆634Updated last month
- Local Groq Desktop chat app with MCP support☆381Updated last week