strands-agents/evals

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/strands-agents/evals)

strands-agents / evals

A comprehensive evaluation framework for AI agents and LLM applications.

☆163

Alternatives and similar repositories for evals

Users that are interested in evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

strands-agents / sdk-typescript
View on GitHub
A model-driven approach to building AI agents in just a few lines of code.
☆694Jun 3, 2026Updated last month
strands-agents / shell
View on GitHub
Give your agent a shell without giving it the keys to your machine.
☆227Updated this week
strands-agents / agent-builder
View on GitHub
An example agent demonstrating streaming, tool use, and interactivity from your terminal. This agent builder can help you to build your o…
☆424May 12, 2026Updated 2 months ago
strands-agents / docs
View on GitHub
Documentation for the Strands Agents SDK. A model-driven approach to building AI agents in just a few lines of code.
☆194Jun 2, 2026Updated last month
strands-agents / tools
View on GitHub
A set of tools that gives agents powerful capabilities.
☆1,136Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws / agentcore-cli
View on GitHub
The terminal experience for AgentCore!
☆224Updated this week
cagataycali / awesome-strands-agents
View on GitHub
Curated resources related Strands Agents.
☆61Jun 15, 2026Updated last month
strands-agents / agent-sop
View on GitHub
Natural language workflows that enable AI agents to perform complex, multi-step tasks with consistency and reliability.
☆1,109Updated this week
strands-agents / mcp-server
View on GitHub
This MCP server provides documentation about Strands Agents to your GenAI tools, so you can use your favorite AI coding assistant to vibe…
☆292Jun 25, 2026Updated 3 weeks ago
aws / bedrock-agentcore-sdk-typescript
View on GitHub
☆86Updated this week
cagataycali / devduck
View on GitHub
Minimalist AI agent that fixes itself when things break.
☆41May 11, 2026Updated 2 months ago
aws / bedrock-agentcore-sdk-python
View on GitHub
Python SDK for transforming any AI agent into a production-ready application. Framework-agnostic primitives for runtime, memory, authenti…
☆744Updated this week
strands-agents / harness-sdk
View on GitHub
Build an agent harness and control it end-to-end. Open-source SDK for production AI agents in Python & TypeScript - any model, any cloud.
☆6,688Updated this week
aws / bedrock-agentcore-starter-toolkit
View on GitHub
Python CLI toolkit for Amazon Bedrock AgentCore (legacy). For new projects, use the AgentCore CLI: https://github.com/aws/agentcore-cli
☆498Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
strands-labs / ai-functions
View on GitHub
Python functions powered by AI agents - with runtime post-conditions for reliable agentic workflows.
☆295Updated this week
strands-agents / samples
View on GitHub
Agent samples built using the Strands Agents SDK.
☆815Jul 17, 2026Updated last week
strands-labs / benchmark-harnesses
View on GitHub
Strands-based agents and harnesses for agentic benchmarks.
☆40Jul 7, 2026Updated 2 weeks ago
awslabs / agentcore-rl-toolkit
View on GitHub
Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.
☆46Updated this week
cagataycali / strands-mlx
View on GitHub
Experimental: MLX model provider for Strands Agents - Build, train, and deploy AI agents on Apple Silicon.
☆37Apr 22, 2026Updated 3 months ago
mikegc-aws / async-agentic-tools
View on GitHub
True async agentic tools — the model keeps talking while tools run in the background
☆40May 28, 2026Updated last month
maxritter / aws-bedrock-multi-agent-blueprint
View on GitHub
Blueprint for running AWS Bedrock Multi-Agent AI collaboration with CDK, Graph DB, Streamlit and LangFuse
☆21May 2, 2025Updated last year
aws-samples / sample-agentic-platform
View on GitHub
A sample agentic ai platform to run agentic workflows on AWS using either EKS or Bedrock AgentCore with open source frameworks like LangC…
☆110Jun 18, 2026Updated last month
opensearch-project / agent-health
View on GitHub
☆30Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aws-samples / msk-powered-financial-data-feed
View on GitHub
Publish a real-time financial data feed to a Kafka client using Amazon MSK
☆14Nov 19, 2024Updated last year
awslabs / agentcore-samples
View on GitHub
Amazon Bedrock Agentcore accelerates AI agents into production with the scale, reliability, and security, critical to real-world deployme…
☆3,215Updated this week
cdklabs / cdk-ethereum-node
View on GitHub
CDK construct to deploy an Ethereum node running on Amazon Managed Blockchain
☆15Jul 1, 2026Updated 3 weeks ago
awslabs / fullstack-solution-template-for-agentcore
View on GitHub
Flexible Fullstack solution template for production-ready deployments of any use case on Amazon Bedrock AgentCore.
☆557Updated this week
amazon-archives / __template_MIT-0
View on GitHub
A template with a license appropriate for sample code, workshops, CloudFormation templates, and other small projects.
☆17Feb 1, 2023Updated 3 years ago
moritalous / bedrocksmith
View on GitHub
BedrockSmith - CloudWatch Logsに出力したBedrockの呼び出しログを整形して表示します
☆12Feb 3, 2025Updated last year
aws-samples / deploy-langfuse-on-ecs-with-fargate
View on GitHub
Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python
☆77Jun 24, 2025Updated last year
aws-samples / sample-developer-environment
View on GitHub
AWS Cloud9 and CodeCommit alternative with GitOps pipeline
☆36Jul 2, 2026Updated 3 weeks ago
awslabs / neptuneml-toolkit
View on GitHub
☆28Apr 25, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
awslabs / stickler
View on GitHub
A library for evaluating structured data and AI outputs with weighted field comparison and custom comparators
☆37Updated this week
langchain-ai / langchain-aws
View on GitHub
Build LangChain Applications on AWS
☆334Updated this week
minorun365 / aws-level-checker
View on GitHub
AWSレベル判定くん
☆25Feb 22, 2026Updated 5 months ago
aristsakpinis93 / agentcore-langfuse-continous-eval-loop
View on GitHub
☆16Mar 26, 2026Updated 3 months ago
os1ma / cloud9-alternative
View on GitHub
AWS Cloud9 新規利用終了のための代替環境
☆27Mar 5, 2025Updated last year
aws-samples / anthropic-on-aws
View on GitHub
☆386Updated this week
gdamjan / uv-getting-started
View on GitHub
An example "getting started" python project based on `uv`
☆17Apr 20, 2026Updated 3 months ago