groq / openbenchLinks
Provider-agnostic, open-source evaluation infrastructure for language models
☆681Updated this week
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below
Sorting:
- ☆124Updated this week
- Together Open Deep Research☆355Updated 7 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆715Updated 6 months ago
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆969Updated 2 weeks ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆323Updated 2 months ago
- A general library for generating high-quality synthetic data from scratch or based on your own seed data.☆403Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆696Updated 2 weeks ago
- Open-source versioning, tracing, and annotation tooling.☆207Updated last month
- Routing on Random Forest (RoRF)☆226Updated last year
- Claude Deep Research config for Claude Code.☆222Updated 8 months ago
- Context Engineering Course with DSPy☆202Updated 4 months ago
- Local Groq Desktop chat app with MCP support☆375Updated this week
- Deep Research for your internal data☆349Updated 6 months ago
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆436Updated 3 months ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆367Updated last month
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆269Updated last month
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆123Updated 9 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆624Updated this week
- A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference …☆201Updated 4 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆474Updated last week
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆478Updated 2 weeks ago
- 🤖 Headless IDE for AI agents☆200Updated 2 months ago
- Semantic search and document parsing tools for the command line☆1,474Updated last week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆278Updated last month
- A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework☆194Updated 2 months ago
- Verify Precision of all Kimi K2 API Vendor☆461Updated 2 weeks ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆136Updated 3 weeks ago
- ☆216Updated last week
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆366Updated this week
- The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.☆1,175Updated this week