A comprehensive evaluation framework for AI agents and LLM applications.
☆78Feb 27, 2026Updated this week
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- From nothing to a deployed object detection model on SageMaker with Detectron2☆29Oct 17, 2023Updated 2 years ago
- Customization of kubernetes YAML configurations, with native integration to Hashicorp Vault.☆10Jan 27, 2022Updated 4 years ago
- hierarchical core-periphery structure☆10Jul 21, 2023Updated 2 years ago
- An OpenAPI to TypeScript generator.☆13Updated this week
- Alarm routing engine for security and platform incident response teams.☆12Oct 23, 2019Updated 6 years ago
- A tiny Javascript library that allows for real time formatting of numbers & currencies, ~1600 bytes minified + gzipped.☆10Feb 12, 2025Updated last year
- A password manager to share TOTP with your team☆14Jan 7, 2021Updated 5 years ago
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- Turn an arbitrary command into a Kubernetes Key Management Service GRPC server☆15Apr 4, 2018Updated 7 years ago
- beko-translateは、Apple Silicon Mac向けのCLI翻訳ツールです。PDF見開き翻訳機能も同梱してあり原文・訳文を交互に表示できます。☆32Feb 12, 2026Updated 2 weeks ago
- A tool for generating sample cost usage data for testing purposes☆10Updated this week
- Statically-typed localization messages.☆10Oct 11, 2020Updated 5 years ago
- Social network programming interface with support for Twitter, Facebook, ..., and easily add more.☆13Dec 4, 2017Updated 8 years ago
- Go crypto.Signer and crypto.Decrypter that uses AWS KMS asymmetric keys☆11Jan 2, 2024Updated 2 years ago
- code for epidemics spreading, heterogeneous random walk on network☆13Apr 12, 2021Updated 4 years ago
- ThRust is a software framework for thermodynamic and probabilistic computing.☆10Jun 14, 2023Updated 2 years ago
- Transform AWS Config snapshots to a more AWS Athena-friendly format.☆11Aug 26, 2020Updated 5 years ago
- Every element is an HTML.☆12Nov 6, 2023Updated 2 years ago
- Buildkite trigger for Gerrit☆11Feb 13, 2025Updated last year
- Collection of AWS Fault Injection Simulator (FIS) experiment templates. These templates let you perform chaos engineering experiments on …☆10Dec 3, 2021Updated 4 years ago
- ☆15Nov 19, 2020Updated 5 years ago
- A Golang implementation of the Jackson-Smile data format☆12Jan 28, 2026Updated last month
- gRPC mocks with Jsonnet☆13Mar 25, 2025Updated 11 months ago
- PKCS#7 Padding for Go☆11Apr 24, 2020Updated 5 years ago
- A set of tools designed for CQ5, for both usage and inspiration☆15Jul 8, 2022Updated 3 years ago
- CLI toolkit for deploying AI agents to Amazon Bedrock AgentCore. Zero infrastructure management with built-in gateway and memory integrat…☆428Feb 17, 2026Updated last week
- Heroku/Dash app for inDelphi.☆11Dec 8, 2022Updated 3 years ago
- ACMagent - automates ACM certificates approval using cli☆11Mar 25, 2021Updated 4 years ago
- A golang implementation of Amazon's Ion data notation☆13Jun 14, 2020Updated 5 years ago
- ☆15Jul 4, 2025Updated 7 months ago
- Benchmark of LLMs on real open-source projects against dependency hell, legacy toolchains, and complex build systems.☆52Dec 23, 2025Updated 2 months ago
- This utility verifies all commands used by a shell script against an allow list☆11Jan 1, 2024Updated 2 years ago
- Kye☆23Sep 30, 2022Updated 3 years ago
- Small projects and experiments with the BeagleBone AI-64 platform (mostly written in Go).☆10Dec 28, 2025Updated 2 months ago
- Tool for plotting Go benchmark results☆17Jul 10, 2024Updated last year
- Content for the Athena Guide (https://athena.guide)☆11Nov 4, 2024Updated last year
- A Modular System for Flexible, High-Performance Traffic http://www.ict-mplane.eu/☆24Oct 4, 2018Updated 7 years ago
- Library for creating fake OIDC providers in tests☆13Feb 9, 2026Updated 3 weeks ago
- ☆15Nov 20, 2025Updated 3 months ago