JudgmentLabs / judgevalLinks
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
☆985Updated this week
Alternatives and similar repositories for judgeval
Users that are interested in judgeval are comparing it to the libraries listed below
Sorting:
- Python SDK to build realtime AI applications on voice and video.☆398Updated 9 months ago
- BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML workl…☆573Updated this week
- Cookbooks for AI Agents☆148Updated 3 months ago
- An MCP Multimodal AI Agent with eyes and ears!☆372Updated this week
- Collection of 2025 internships in Product Management!☆71Updated this week
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Google☆88Updated 4 months ago
- Notion for AI Observability 📊☆309Updated this week
- This repo contains assignments and projects specifically to the various Gen AI courses that I am auditing.☆19Updated 11 months ago
- Build hours code to share.☆459Updated last week
- Implement a reasoning LLM in PyTorch from scratch, step by step☆225Updated this week
- Find the Root Cause in Your Code's Trace☆311Updated this week
- A month-long, open-source AI Agent Hackathon — open to all builders and dreamers working on agents, RAG, tool use, and multi-agent system…☆232Updated 2 months ago
- 🦄 ai that works - every tuesday 10 AM PST☆350Updated this week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆924Updated 3 months ago
- The everything tool for model alignment☆61Updated last week
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆230Updated last week
- Tool for generating high quality Synthetic datasets☆1,152Updated 3 weeks ago
- ☆81Updated last month
- A curated list of open source repositories for AI Engineers☆116Updated 5 months ago
- The official Python library for Arklex framework☆663Updated this week
- Multi Agentic Architectures for complex problem-solving.☆27Updated 5 months ago
- A CLI for GPUs☆111Updated 3 weeks ago
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆141Updated 4 months ago
- agent-from-scratch is a Python-based repository designed for developers and researchers interested in understanding the inner workings of…☆91Updated 8 months ago
- Practice The CodeSignal Pre-screen for the Industry Coding Framework.☆171Updated 10 months ago
- A category wise collection of 200+ LLM survey papers.☆176Updated 4 months ago
- An example showing how A2A and MCP can be used together☆179Updated 3 months ago
- A practical RAG where you can download and chat with github repo☆87Updated 6 months ago
- 2025 & 2026 New grad full-time roles in SWE, Quant, and PM.☆1,868Updated this week
- An independent AI research program created by Harshit.☆97Updated last year