langchain-ai / claude-code-evalsLinks
Ablation study comparing Claude Code configurations
☆51Updated 2 months ago
Alternatives and similar repositories for claude-code-evals
Users that are interested in claude-code-evals are comparing it to the libraries listed below
Sorting:
- Readymade evaluators for agent trajectories☆373Updated 2 months ago
- ☆215Updated 6 months ago
- ☆155Updated 7 months ago
- A collection of generative UI agents written with LangGraph.js☆349Updated 6 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆239Updated this week
- ☆201Updated last month
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep …☆457Updated 6 months ago
- ☆86Updated 11 months ago
- A bot with memory, built on LangGraph Cloud.☆138Updated last year
- ☆166Updated 6 months ago
- Salesforce Enterprise Deep Research☆732Updated last week
- 🧍♂️LLM as a manager for approval processes.☆208Updated 6 months ago
- A managed RAG API server.☆326Updated 5 months ago
- Build LangGraph agents with large numbers of tools☆459Updated 5 months ago
- CLI to generate LangGraph stubs from a specification☆96Updated 7 months ago
- An implementation of a computer use agent (CUA) using LangGraph☆184Updated 7 months ago
- ☆49Updated 5 months ago
- A Generative UI app for interacting with Computer Use Agents☆209Updated 7 months ago
- ☆463Updated 5 months ago
- Extract structured data from CUAD contracts using LangChain, build a knowledge graph, and query insights through a LangGraph agent - tran…☆137Updated 5 months ago
- ☆449Updated 3 months ago
- ☆267Updated 11 months ago
- LangGraph Studio template for creating an agent that does web research to genearte or enrich structured data.☆205Updated 3 weeks ago
- Named Entity Recognition using Claude Citations☆79Updated 5 months ago
- fullstack chat agent with authentication, request credits and payments built in☆169Updated 4 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆168Updated last year
- ☆209Updated 9 months ago
- This open-source project & guide shows you exactly how to implement Canvas UX pattern + LangGraph human-in-the-loop workflows in your AI …☆86Updated 7 months ago
- Prepare for meetings with company and attendee research notes, integrated with Google Calendar through MCP☆172Updated 3 months ago
- 🤖 An open-source, AI agent-native research canvas application that performs real-time search with HITL (Human in The Loop) capabilities,…☆340Updated last week