JudgmentLabs / judgevalLinks
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
☆1,016Updated this week
Alternatives and similar repositories for judgeval
Users that are interested in judgeval are comparing it to the libraries listed below
Sorting:
- The CLI for GPUs☆125Updated last week
- OSS RL environment + evals toolkit☆267Updated this week
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆99Updated 7 months ago
- An interface library for RL post training with environments.☆829Updated this week
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆242Updated last week
- An MCP Multimodal AI Agent with eyes and ears!☆508Updated last month
- ☆126Updated last month
- A general library for generating high-quality synthetic data from scratch or based on your own seed data.☆403Updated this week
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆213Updated last month
- BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workl…☆598Updated this week
- 🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents…☆637Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated 3 weeks ago
- Just like the beloved character Doraemon who pulls out gadgets from his pocket, this agent can dynamically create, save, and utilize its …☆17Updated 10 months ago
- Open source codebase for Scale Agentex☆234Updated this week
- ☆89Updated 10 months ago
- Practical system design, tools, and hands-on resources for building Gen-AI agents & agentic AI systems.☆158Updated last week
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆83Updated 2 months ago
- ☆1,244Updated 2 months ago
- A category wise collection of 200+ LLM survey papers.☆238Updated 8 months ago
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆969Updated 3 weeks ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆609Updated 2 months ago
- Collection of 2025 internships in Product Management!☆84Updated this week
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆202Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆708Updated 2 weeks ago
- NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload a…☆298Updated 6 months ago
- repo of paper implementations☆20Updated 9 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆394Updated last month
- A month-long, open-source AI Agent Hackathon — open to all builders and dreamers working on agents, RAG, tool use, and multi-agent system…☆240Updated 5 months ago
- ☆101Updated 6 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆369Updated 3 months ago