JudgmentLabs / judgevalLinks
The open source post-building layer for agents. Our traces + evals power agent post-training (RL, SFT), monitoring, and regression testing.
β563Updated this week
Alternatives and similar repositories for judgeval
Users that are interested in judgeval are comparing it to the libraries listed below
Sorting:
- Perplexity powered AI assistant for time based trivia gamesβ153Updated 4 months ago
- Notion for AI Observability πβ306Updated this week
- Multi Agentic Architectures for complex problem-solving.β26Updated 4 months ago
- Collection of 2025 internships in Product Management!β54Updated this week
- irresponsible innovation. Try now at https://chat.dev/β484Updated last year
- BharatMLStack is an open-source, end-to-end machine learning infrastructure stack built at Meesho to support real-time and batch ML worklβ¦β539Updated last week
- The easiest way to use GPUs.β110Updated last month
- Python SDK to build realtime AI applications on voice and video.β398Updated 7 months ago
- This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & foβ¦β1,372Updated 5 months ago
- An MCP Multimodal AI Agent with eyes and ears!β231Updated last week
- 2024 & 2025 New grad full-time roles in SWE, Quant, and PM.β1,406Updated this week
- β79Updated last year
- Hitchcock a multi-agent movie maker, powered by mahiloβ67Updated 4 months ago
- Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, intoβ¦β4,310Updated this week
- Portia Labs Python SDK for building agentic workflows.β281Updated this week
- Judging opportunities for devs to pass the O-1A / EB-1 judging requirement.β44Updated 5 months ago
- A month-long, open-source AI Agent Hackathon β open to all builders and dreamers working on agents, RAG, tool use, and multi-agent systemβ¦β223Updated 2 weeks ago
- Collection of 2025 New Grad Jobs in Software Engineer!β215Updated this week
- A multi-agent orchestration framework that works with any agent frameworkβ193Updated last month
- vscode extension to convert computationally intensive pytorch kernels to tritonβ22Updated 9 months ago
- A tutorial on how to use Model Context Protocol by Anthropic and Agent2Agent Protocol by Googleβ77Updated 2 months ago
- Everything about LLMs in production.β75Updated last year
- Personal portfolio site based off of ChatGPTβ23Updated this week
- β86Updated 6 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessmentsβ219Updated this week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ309Updated this week
- An independent AI research program created by Harshit.β96Updated 11 months ago
- β138Updated this week
- An example showing how A2A and MCP can be used togetherβ169Updated 2 months ago
- π§βπ« Automatically transform documents into beautiful slide decksβ121Updated 4 months ago