groq / openbenchLinks
Provider-agnostic, open-source evaluation infrastructure for language models
☆698Updated this week
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below
Sorting:
- Together Open Deep Research☆356Updated 8 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆727Updated 6 months ago
- Deep Research for your internal data☆350Updated 6 months ago
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆981Updated last month
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆123Updated 9 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆327Updated 3 months ago
- Claude Deep Research config for Claude Code.☆224Updated 9 months ago
- Open-source versioning, tracing, and annotation tooling.☆210Updated last month
- Routing on Random Forest (RoRF)☆235Updated last year
- Local Groq Desktop chat app with MCP support☆379Updated 2 weeks ago
- ☆686Updated last week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆759Updated 2 weeks ago
- The State Of The Art, intelligence☆157Updated 4 months ago
- Tutorial for building LLM router☆239Updated last year
- II-Researcher: a new open-source framework designed to aid building search / research agents☆487Updated 4 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆633Updated 3 weeks ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆654Updated this week
- A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference …☆203Updated 4 months ago
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆441Updated 3 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆270Updated 2 months ago
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆490Updated last month
- mkinf SDK to interact with mkinf hub MCP servers☆134Updated 9 months ago
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆303Updated last month
- A tool kit for generating high quality prompts using DSPy GEPA optimizer☆289Updated 2 weeks ago
- Write YAML, execute Agent Workflows☆294Updated this week
- A toolkit for building computer use AI agents☆180Updated 6 months ago
- ☆113Updated 5 months ago
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆547Updated last week
- Living memory for AI☆320Updated this week
- Context Engineering Course with DSPy☆206Updated 5 months ago