☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications.
☆65Feb 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for awesome-ai-eval
Users that are interested in awesome-ai-eval are comparing it to the libraries listed below
Sorting:
- A Free, Open Source MCP server for dynamic custom persona management with public a GitHub collection of personas, skills, templates, and …☆28Jan 7, 2026Updated 2 months ago
- A Ruby DSL for creating Claude Code hooks☆35Mar 2, 2026Updated last week
- React component for extracting thumbnails from video☆12Dec 11, 2024Updated last year
- Chemical Processes Control☆12Dec 9, 2025Updated 3 months ago
- A collection of Summoner clients and agents featuring example implementations and reusable templates☆22Feb 19, 2026Updated 2 weeks ago
- Chemical Processes Instrumentation☆14Jun 3, 2023Updated 2 years ago
- A curated list from our community of AI, ML and data science resources☆24Aug 21, 2025Updated 6 months ago
- Code used in the scientific article: Consequential life cycle assessment of carbon capture and utilization technologies within the chemic…☆13Apr 28, 2021Updated 4 years ago
- A Claude Code plugin that solves the same problems as community frameworks (GSD, BMAD, Ralph, Agent OS) — but using the tool's native arc…☆28Mar 1, 2026Updated last week
- ☆19Dec 20, 2025Updated 2 months ago
- Simple AutoHotKey Script for Minecraft. Includes AFK-Fishing, Auto-Sweep Attack and Nether Portal Calculator.☆11Jun 4, 2022Updated 3 years ago
- 🚀 A simple, modern, full-stack toolkit for Python 🐍☆38Oct 18, 2024Updated last year
- Collection of specialized agent definitions for Claude Code☆32Feb 2, 2026Updated last month
- Experimental framework taking inspiration from biological systems, combining compression-based architectures, group theory, and symmetry …☆14Nov 13, 2025Updated 3 months ago
- An MCP server that can spawn linux sandbox containers using docker and run commands in them via a TTY interface.☆25Sep 18, 2025Updated 5 months ago
- The Chemical Reaction Optimization (CRO) algorithm with dependent classes in python 3.☆11Apr 21, 2020Updated 5 years ago
- Cross-platform toolkit to enhance Claude Code with multi-LLM consensus, 8 specialist agents, semantic knowledge search, and one-command i…☆31Feb 16, 2026Updated 3 weeks ago
- CBE 30338 Chemical Process Control☆14Feb 27, 2024Updated 2 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- Linux Tools☆24Updated this week
- A modular framework for Spatial Code Navigation (SCNS) with Universal Code Coordinates (UCCS) protocols.☆19Sep 8, 2025Updated 6 months ago
- Backup program , menu based , CLI TUI utility for Linux distributions using Tar and rsync. Written in bash, CLI program.☆15Sep 24, 2022Updated 3 years ago
- ☆11Mar 25, 2021Updated 4 years ago
- ☆33Sep 25, 2025Updated 5 months ago
- Claude code slash commands creation for session management☆13Sep 13, 2025Updated 5 months ago
- A PHP library for Time-based One-Time Password (TOTP) authentication☆30Sep 5, 2025Updated 6 months ago
- This is a short introduction to data analysis in Jupyter notebooks for chemical engineering students.☆11Mar 16, 2022Updated 3 years ago
- Yii 2 Advanced Project Template using Shared Hosting☆10May 7, 2016Updated 9 years ago
- Stripped down versions of several archlinux packages, former name llvm-libs-debloated. [Maintainer=@Samueru-sama]☆14Updated this week
- VSCode Highlight Extension for DBML Language☆11Sep 14, 2019Updated 6 years ago
- Script to remove Linux bloatware☆10Jul 13, 2024Updated last year
- Gen AI Demo☆19Updated this week
- ☆28Mar 2, 2026Updated last week
- chemical master equation solver☆16May 2, 2018Updated 7 years ago
- [ArXiv 2025] A curated list of papers on on-device large language models, focusing on model compression and system optimization technique…☆23Jan 27, 2026Updated last month
- aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-firs…☆10May 9, 2015Updated 10 years ago
- Search over RDF schemas and OWL ontologies☆11Sep 28, 2013Updated 12 years ago
- Moonshot Bundler Bot is a lightweight Solana tool that builds and sends bundled transactions (priority fees, ATA creation, SOL transfers,…☆16Sep 10, 2025Updated 5 months ago
- ⚡A multithreaded toolkit for digital media processing using ffmpeg. It provides both a CLI and a GUI. If ffmpeg can do it, ffzap can do i…☆18Updated this week