groq / openbenchLinks
Provider-agnostic, open-source evaluation infrastructure for language models
☆719Updated last month
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below
Sorting:
- Together Open Deep Research☆358Updated 9 months ago
- Deep Research for your internal data☆351Updated 8 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆348Updated 4 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆755Updated 8 months ago
- Claude Deep Research config for Claude Code.☆226Updated 10 months ago
- Routing on Random Forest (RoRF)☆239Updated last year
- ☆757Updated last week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 10 months ago
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆692Updated this week
- The State Of The Art, intelligence☆157Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆273Updated 3 months ago
- Tutorial for building LLM router☆244Updated last year
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆996Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆448Updated 5 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆417Updated 4 months ago
- The lightweight framework for building agents☆291Updated this week
- 🤖 Headless IDE for AI agents☆200Updated 4 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆874Updated last week
- ☆274Updated 2 weeks ago
- Context Engineering Course with DSPy☆211Updated 6 months ago
- Local Groq Desktop chat app with MCP support☆382Updated this week
- A framework for optimizing DSPy programs with RL☆308Updated 3 weeks ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆373Updated last month
- II-Researcher: a new open-source framework designed to aid building search / research agents☆493Updated 6 months ago
- Open-source versioning, tracing, and annotation tooling.☆214Updated 2 weeks ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 11 months ago
- ☆114Updated 7 months ago
- Optimize Document Retrieval with Fine-Tuned KnowledgeBases☆180Updated 3 months ago
- A tool kit for generating high quality prompts using DSPy GEPA optimizer☆296Updated last week