JudgmentLabs / judgevalLinks
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
☆1,013Updated this week
Alternatives and similar repositories for judgeval
Users that are interested in judgeval are comparing it to the libraries listed below
Sorting:
- BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workl…☆586Updated this week
- A month-long, open-source AI Agent Hackathon — open to all builders and dreamers working on agents, RAG, tool use, and multi-agent system…☆237Updated 4 months ago
- OSS RL environment + evals toolkit☆198Updated this week
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆240Updated last week
- curated collection of real world applications that use LLMs☆288Updated 6 months ago
- A multi-agent orchestration framework that works with any agent framework☆199Updated 5 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆38Updated last year
- Cookbooks for AI Agents☆149Updated 5 months ago
- Find the Root Cause in Your Code's Trace☆348Updated this week
- Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.☆1,222Updated this week
- The official Python SDK for Eval Protocol☆65Updated this week
- A comprehensive face analysis library that provides unified APIs for various face-related tasks☆328Updated 4 months ago
- ☆61Updated 6 months ago
- 🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents…☆585Updated last week
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆96Updated 6 months ago
- An example showing how A2A and MCP can be used together☆182Updated 5 months ago
- building a Large Language Model (LLM) from scratch.☆34Updated 9 months ago
- Open-source AI agent for web automation and scraping.☆291Updated 9 months ago
- The official Python library for Arklex framework☆687Updated this week
- A curated list of open source repositories for AI Engineers☆119Updated 7 months ago
- The profiler that gives a unified view of your entire stack - from PyTorch down to GPU☆94Updated 2 months ago
- Readymade evaluators for agent trajectories☆373Updated 2 months ago
- A CLI for GPUs☆115Updated 2 weeks ago
- Enterprise-grade memory framework for LLMs featuring GPU-optimized inference, vector storage, and automated scaling. Enables hyper-person…☆88Updated 6 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆386Updated last month
- An open-source tool for LLM prompt optimization.☆698Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆636Updated this week
- The AI Browser Automation Framework☆318Updated 2 weeks ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆204Updated 2 months ago
- Practical system design, tools, and hands-on resources for building Gen-AI agents & agentic AI systems.☆134Updated 4 months ago