A comprehensive evaluation framework for AI agents and LLM applications.
☆103Apr 8, 2026Updated this week
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimalist AI agent that fixes itself when things break.☆37Apr 3, 2026Updated last week
- Amazon Nova Act is an AWS service for building and deploying highly reliable AI agents that automate UI-based workflows at scale.☆64Feb 20, 2026Updated last month
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- The new terminal experience for AgentCore!☆79Updated this week
- A model-driven approach to building AI agents in just a few lines of code.☆568Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- ☆17Updated this week
- Rebalanser for .NET☆14Jan 7, 2019Updated 7 years ago
- ☆34Dec 13, 2025Updated 4 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated 2 months ago
- Manage Workflows with optional Scheduler or Event Arc triggers☆22Feb 24, 2026Updated last month
- Blueprint for running AWS Bedrock Multi-Agent AI collaboration with CDK, Graph DB, Streamlit and LangFuse☆21May 2, 2025Updated 11 months ago
- ☆25Jun 5, 2024Updated last year
- A quick fix model for the Charm BubbleTea ecosystem.☆15Nov 27, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Useful command builder for spf13/cobra☆11Jun 29, 2020Updated 5 years ago
- Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python☆77Jun 24, 2025Updated 9 months ago
- [CVPR 2026] Official Implementation of Edit2Perceive☆35Feb 21, 2026Updated last month
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- An example agent demonstrating streaming, tool use, and interactivity from your terminal. This agent builder can help you to build your o…☆402Jan 14, 2026Updated 3 months ago
- How to build a simplified Corrective RAG assistant with Amazon Bedrock using LLMs, Embeddings model, Knowledge Bases for Amazon Bedrock, …☆16May 22, 2024Updated last year
- A controller for Godot that handles similarly to Quake and Titanfall 2.☆39Oct 1, 2025Updated 6 months ago
- Random Pluto notebooks in Julia☆12Oct 23, 2025Updated 5 months ago
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Oct 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multi-modal Assistant With Advanced RAG And Amazon Bedrock Claude 3☆20Feb 7, 2025Updated last year
- ☆25Nov 18, 2025Updated 4 months ago
- ☆11Mar 19, 2026Updated 3 weeks ago
- Create and manage Artifact Registry repositories☆24Feb 24, 2026Updated last month
- 한국어 소설 텍스트를 위한 자연어처리 라이브러리입니다. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)☆11Jan 16, 2024Updated 2 years ago
- My dotfiles managed by chezmoi☆34Updated this week
- Heroku/Dash app for inDelphi.☆11Dec 8, 2022Updated 3 years ago
- ☆25May 29, 2025Updated 10 months ago
- ☆13Jul 14, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.☆18Dec 10, 2024Updated last year
- E コマースにおける生成AI 4大ユースケースに関する Amazon Bedrock デモ☆18Feb 19, 2025Updated last year
- How to build an advanced RAG router based assistant with Amazon Bedrock using LLMs, Embeddings model, and Knowledge Bases for Amazon Bedr…☆22Dec 3, 2024Updated last year
- Implementation of a LangGraph.js CheckpointSaver that uses a AWS's DynamoDB☆16Feb 10, 2025Updated last year
- JavaScript examples that can be used in Autify.☆20Nov 15, 2025Updated 4 months ago
- Swallowプロジェクト 大規模言語モデル 評価スクリプト☆24Sep 17, 2025Updated 6 months ago
- Bayesball: Bayesian analysis of batting average☆12Mar 4, 2018Updated 8 years ago