groq / openbenchLinks
Provider-agnostic, open-source evaluation infrastructure for language models
☆531Updated this week
Alternatives and similar repositories for openbench
Users that are interested in openbench are comparing it to the libraries listed below
Sorting:
- Together Open Deep Research☆346Updated 5 months ago
- Deep Research for your internal data☆337Updated 3 months ago
- The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.☆606Updated last week
- Open-source versioning, tracing, and annotation tooling.☆192Updated this week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 6 months ago
- Context Engineering Course with DSPy☆174Updated last month
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆569Updated 3 months ago
- Helping you select an AI agent framework☆377Updated last month
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆290Updated 3 weeks ago
- Routing on Random Forest (RoRF)☆203Updated 11 months ago
- Semantic search and document parsing tools for the command line☆929Updated last week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆932Updated 3 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆470Updated last month
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆251Updated 2 weeks ago
- DS.js (Declarative Self‑learning JavaScript☆118Updated 6 months ago
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆282Updated 3 months ago
- Local Groq Desktop chat app with MCP support☆354Updated this week
- ☆188Updated 9 months ago
- ☆113Updated 2 months ago
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆580Updated last week
- Claude Deep Research config for Claude Code.☆212Updated 6 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆322Updated this week
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆377Updated last week
- The State Of The Art, intelligence☆152Updated last month
- ☆142Updated last month
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆458Updated last month
- Tutorial for building LLM router☆226Updated last year
- ☆295Updated last month
- Letting Claude Code develop his own MCP tools :)☆120Updated 6 months ago
- Pixelagent — Multimodal stateful agents☆217Updated 3 months ago