A comprehensive evaluation framework for AI agents and LLM applications.
☆153Jun 29, 2026Updated this week
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimalist AI agent that fixes itself when things break.☆40May 11, 2026Updated last month
- Amazon Nova Act is an AWS service for building and deploying highly reliable AI agents that automate UI-based workflows at scale.☆63Apr 30, 2026Updated 2 months ago
- From nothing to a deployed object detection model on SageMaker with Detectron2☆29Oct 17, 2023Updated 2 years ago
- The getting started sample demonstrates how to perform common tasks (CRUD operations) using the Azure Blob Service in Go.☆10Mar 24, 2018Updated 8 years ago
- A model-driven approach to building AI agents in just a few lines of code.☆698Jun 3, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆56Jun 11, 2026Updated 3 weeks ago
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 3 years ago
- Python CLI toolkit for Amazon Bedrock AgentCore (legacy). For new projects, use the AgentCore CLI: https://github.com/aws/agentcore-cli☆497Updated this week
- CDK construct to deploy an Ethereum node running on Amazon Managed Blockchain☆15Jun 19, 2026Updated 2 weeks ago
- The new terminal experience for AgentCore!☆198Updated this week
- beko-translateは、Apple Silicon Mac向けのCLI翻訳ツールです。PDF見開き翻訳機能も同梱してあり原文・訳文を交互に表示できます。☆36Mar 25, 2026Updated 3 months ago
- The IDP Accelerator provides a scalable, serverless approach for automated document processing and information extraction using AWS servi…☆264Updated this week
- ☆13Mar 14, 2024Updated 2 years ago
- 👨💼Python Wrapper for the Linkedin API☆21Jun 30, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Manage Workflows with optional Scheduler or Event Arc triggers☆24Feb 24, 2026Updated 4 months ago
- Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.☆46Jun 22, 2026Updated last week
- ☆15Jun 18, 2026Updated 2 weeks ago
- Deep learning for pedestrians: backpropagation in CNNs. Latex and PyTorch code to verify theoretical derivations.☆13Jun 21, 2022Updated 4 years ago
- ☆15Jul 4, 2025Updated last year
- An Ionic PWA using AWS Amplify and ML/AI services to do predictions on images☆13Mar 4, 2023Updated 3 years ago
- provides a Suricata Eve output for Kafka with Suricate Eve plugin☆15Nov 25, 2021Updated 4 years ago
- Blueprint for running AWS Bedrock Multi-Agent AI collaboration with CDK, Graph DB, Streamlit and LangFuse☆21May 2, 2025Updated last year
- ☆25Jun 5, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- GPT-4 を用いて、言語モデルの応答を自動評価するスクリプト☆17Jun 6, 2024Updated 2 years ago
- Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python☆77Jun 24, 2025Updated last year
- Prompt Contracts☆48Oct 19, 2025Updated 8 months ago
- Insurance AI Assistant A smart system combining PostgreSQL, Milvus, and specialized AI agents (Life/Home/Auto) to answer insurance querie…☆30Apr 29, 2025Updated last year
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆17Aug 24, 2022Updated 3 years ago
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- Biologically-inspired persistent memory engine for Claude Code. 26 cognitive subsystems, Hopfield networks, predictive coding, causal dis…☆60Apr 1, 2026Updated 3 months ago
- BedrockSmith - CloudWatch Logsに出力したBedrockの呼び出しログを整形して表示します☆12Feb 3, 2025Updated last year
- Example project showing how you can use your fast.ai based scripts to let Amazon SageMaker perform the training and hosting of your model…