langchain-ai / agentevalsLinks
Readymade evaluators for agent trajectories
☆212Updated last week
Alternatives and similar repositories for agentevals
Users that are interested in agentevals are comparing it to the libraries listed below
Sorting:
- Readymade evaluators for your LLM apps☆510Updated last week
- Build LangGraph agents with large numbers of tools☆291Updated 2 months ago
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.☆251Updated last month
- ☆355Updated 2 weeks ago
- Terminal-based AI Coding Agent, similar to Claude Code, OpenAI Codex etc. but works with many more LLMs e.g. Gemini, Groq, Deepseek☆115Updated 3 weeks ago
- A managed RAG API server.☆175Updated last week
- ☆128Updated last week
- A collection of generative UI agents written with LangGraph.js☆240Updated last month
- ☆192Updated 4 months ago
- Together Open Deep Research☆298Updated last month
- ☆99Updated 2 months ago
- CLI to generate LangGraph stubs from a specification☆74Updated 2 months ago
- An Email manager that takes over your inbox, prioritizing messages, reading attachments and drafting replies so you can focus on what tru…☆150Updated last month
- An implementation of a computer use agent (CUA) using LangGraph☆150Updated 2 months ago
- ☆173Updated 5 months ago
- ☆497Updated 2 weeks ago
- ☆110Updated 2 weeks ago
- ☆106Updated 2 months ago
- Ranking LLMs on agentic tasks☆134Updated 2 weeks ago
- A Generative UI app for interacting with Computer Use Agents☆182Updated last month
- When RAG and agents fall in love☆326Updated 6 months ago
- An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more a…☆82Updated 2 months ago
- ☆102Updated last month
- A bot with memory, built on LangGraph Cloud.☆118Updated 10 months ago
- ☆502Updated last week
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep …☆389Updated last month
- ☆99Updated 8 months ago
- An example showing how A2A and MCP can be used together☆158Updated last week
- ☆81Updated 6 months ago
- Safely run untrusted Python code using Pyodide and Deno☆65Updated last week