☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications
☆69Mar 25, 2026Updated 3 weeks ago
Alternatives and similar repositories for awesome-ai-eval
Users that are interested in awesome-ai-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mass Mailer for sending big email campaigns from multiple smtp servers on windows.☆17Jan 5, 2026Updated 3 months ago
- An archiving/caching statistics tool for the Brawlhalla API☆12Apr 8, 2026Updated last week
- AI-powered Ethereum crypto trading bot using ChatGPT for automated DeFi strategies, arbitrage, and passive income generation.☆18Jan 12, 2026Updated 3 months ago
- Provide quick access to useful actions by adding context menus to your iOS app.☆29Dec 24, 2025Updated 3 months ago
- The pre-flight check for AI agents☆26Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Your personal AI assistant KIT for automated trading with openclaw, with full documentation available at https://devs.sidex.fun/documenta…☆21Feb 13, 2026Updated 2 months ago
- MCP server for spreadsheet analysis and editing. Slim, token-efficient tool surface designed for LLM agents.☆37Apr 1, 2026Updated 2 weeks ago
- Quick setup for projects or repositories using Claude Code with built-in agents, ticket system, and planning tools.☆53Jan 18, 2026Updated 3 months ago
- ☆67Feb 14, 2026Updated 2 months ago
- ☆88Mar 20, 2026Updated 3 weeks ago
- Production operations framework for AI-powered SaaS. The architectural patterns, failure modes, and operational playbooks that determine …☆14Mar 10, 2026Updated last month
- Turn ML models into APIs with one command☆22Jan 29, 2026Updated 2 months ago
- Make coding agents talk to each other☆42Updated this week
- Full-Stack Development Platform for Building Reliable Agents☆209Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆93Jan 27, 2026Updated 2 months ago
- Stop feeding your AI agent the entire codebase. Give it ctx instead. FIrst context engine for AI agents that actually works.☆61Feb 14, 2026Updated 2 months ago
- Scam intelligence, phishing attribution, drainer mapping. Legal OSINT only. Public data. Real cases. For researchers and victims.☆96Nov 30, 2025Updated 4 months ago
- The Zero-Config TypeScript Framework for Modern Backends.☆98Feb 2, 2026Updated 2 months ago
- Dockerized Hytale server with auto-updates, configurable env vars, persistent storage, and optional authentication.☆31Feb 16, 2026Updated 2 months ago
- Free poc, open-source hardware ID spoofer for bypassing game bans. Changes HWID, MAC, disk serials. We simply do not wish harm to anybody…☆23Dec 31, 2025Updated 3 months ago
- ☆167Oct 28, 2025Updated 5 months ago
- Lightweight & Fast Security Scanner for React Native & Expo☆453Mar 31, 2026Updated 2 weeks ago
- ☆186Apr 9, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Free, Open Source MCP server for dynamic custom persona management with public a GitHub collection of personas, skills, templates, and …☆29Updated this week
- A modern PowerShell-based GUI tool for Microsoft Intune administration - including device ownership analysis, configuration backup, assig…☆32Jan 14, 2026Updated 3 months ago
- Agent orchestration framework for spec-driven development☆49Mar 14, 2026Updated last month
- Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes☆817Updated this week
- A Swift Package for building a “code review bot” workflow in Swift — intended to automate review-style feedback (lint-like suggestions, h…☆18Jan 19, 2026Updated 3 months ago
- TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal v…☆449Apr 7, 2026Updated last week
- The RL training platform. Use ReinforceNow to train reliable AI agents from raw data to production.☆32Feb 9, 2026Updated 2 months ago
- quantum_notary is a command-line tool for cryptographically signing and verifying Software Bills of Materials (SBOMs) using post-quantum …☆196Feb 20, 2026Updated last month
- Worlds first open-source real-time end-to-end spoken dialogue model with personalized voice cloning.☆531Jan 28, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A showcase of Nigeria's most innovative and disruptive digital entrepreneurs. This repo catalogs the robust infrastructure hosting next-g…☆75Aug 10, 2025Updated 8 months ago
- Polykalshi AI Agent (Rust Edition): Unleash the power of AI with Polykalshi, a blazing-fast, highly efficient AI agent built entirely in …☆148Apr 2, 2026Updated 2 weeks ago
- ☆18Apr 6, 2026Updated last week
- ☆343Feb 20, 2026Updated last month
- 🚀 Kill the Junior AI Era. 🤖 Level up your AI code to Principal standards. No more sloppy lines or junior mistakes. Automated ESLint ✨ T…☆33Apr 1, 2026Updated 2 weeks ago
- Teaching LLMs to reason in the Latent Space to precondition responses.☆98Updated this week
- Hyperliquid Copy Trading Bot — Perpetual DEX Trading Bot for Hyperliquid Perps, hyperliquid trading bot hyperliquid trading bot hyperliqu…☆76Apr 3, 2026Updated 2 weeks ago