langchain-ai / agentevalsLinks
Readymade evaluators for agent trajectories
☆381Updated 2 months ago
Alternatives and similar repositories for agentevals
Users that are interested in agentevals are comparing it to the libraries listed below
Sorting:
- Readymade evaluators for your LLM apps☆795Updated last week
- Build LangGraph agents with large numbers of tools☆463Updated 5 months ago
- ☆201Updated last month
- A managed RAG API server.☆328Updated 5 months ago
- ☆463Updated 5 months ago
- ☆157Updated 7 months ago
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.☆275Updated 6 months ago
- ☆626Updated 5 months ago
- A collection of generative UI agents written with LangGraph.js☆352Updated 6 months ago
- ☆217Updated 4 months ago
- ☆270Updated 11 months ago
- ☆217Updated 6 months ago
- ☆357Updated 3 weeks ago
- ☆139Updated 3 weeks ago
- ☆1,135Updated 2 weeks ago
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep …☆460Updated 6 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆239Updated this week
- 📥 An inbox UX for interacting with human-in-the-loop agents.☆874Updated 6 months ago
- Salesforce Enterprise Deep Research☆732Updated last week
- An example of multi-agent orchestration with llama-index☆437Updated 9 months ago
- Named Entity Recognition using Claude Citations☆79Updated 5 months ago
- LangGraph solution template for MCP☆561Updated 8 months ago
- A Generative UI app for interacting with Computer Use Agents☆210Updated 7 months ago
- Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.☆461Updated 5 months ago
- ☆425Updated last year
- An implementation of a computer use agent (CUA) using LangGraph☆184Updated 7 months ago
- 🤖 An open-source, AI agent-native research canvas application that performs real-time search with HITL (Human in The Loop) capabilities,…☆342Updated this week
- Ranking LLMs on agentic tasks☆199Updated 2 months ago
- Together Open Deep Research☆352Updated 6 months ago
- ☆210Updated 3 months ago