JudgmentLabs / judgevalLinks
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
☆1,013Updated this week
Alternatives and similar repositories for judgeval
Users that are interested in judgeval are comparing it to the libraries listed below
Sorting:
- OSS RL environment + evals toolkit☆181Updated this week
- A month-long, open-source AI Agent Hackathon — open to all builders and dreamers working on agents, RAG, tool use, and multi-agent system…☆236Updated 2 months ago
- A multi-agent orchestration framework that works with any agent framework☆196Updated 3 months ago
- AgentPro is a lightweight library for developing ReAct-style agents with tools, knowledge base, memory and reasoning.☆43Updated 3 months ago
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆187Updated last week
- The official Python library for Arklex framework☆680Updated this week
- When Philosophy meets AI☆1,335Updated 3 months ago
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆449Updated last week
- An MCP Multimodal AI Agent with eyes and ears!☆441Updated last week
- BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workl…☆578Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆129Updated last week
- This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fo…☆1,473Updated 7 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆232Updated 2 weeks ago
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆932Updated last week
- The Orchestration Layer for AI agents. Connect your models, tools, and data into a smart interface to create agentic apps that can think,…☆225Updated last week
- On the Theoretical Limitations of Embedding-Based Retrieval☆552Updated last week
- Python SDK to build realtime AI applications on voice and video.☆398Updated 10 months ago
- GraphBit is the world’s first enterprise-grade Agentic AI framework, built on a Rust core with a Python wrapper for unmatched speed, secu…☆296Updated this week
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆330Updated 2 weeks ago
- ☆309Updated 5 months ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆539Updated this week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆365Updated 2 weeks ago
- 🤖 AI-powered software engineering multi-agent system with researcher and developer agents that automate code implementation through inte…☆556Updated 2 weeks ago
- The AI Browser Automation Framework☆261Updated last week
- The official Python SDK for Eval Protocol☆60Updated this week
- ☆54Updated 4 months ago
- A catalogue of existing Nanda servers☆185Updated 5 months ago
- 🧑🏫 Automatically transform documents into beautiful slide decks☆140Updated 6 months ago
- Readymade evaluators for your LLM apps☆733Updated 3 weeks ago
- Together Open Deep Research☆349Updated 5 months ago