hidai25/eval-view

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hidai25/eval-view)

hidai25 / eval-view

Regression testing for AI agents. Snapshot behavior,diff tool calls,catch regressions in CI. Works with LangGraph, CrewAI, OpenAI, Anthropic.

☆124

Alternatives and similar repositories for eval-view

Users that are interested in eval-view are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OriNachum / claude-code-guide
View on GitHub
An interactive plugin for Claude Code delivering guided onboarding, workflow automation, and gamified progression.
☆117Jun 17, 2026Updated last month
agentculture / culture
View on GitHub
Culture turns isolated stochastic agents into cooperative, inspectable, improvable artificial colleagues.
☆111Jul 15, 2026Updated last week
AvivK5498 / Golem
View on GitHub
A self-hosted platform for creating and managing personal AI agents. Each agent gets its own Telegram bot, custom persona, tools, memory,…
☆26Jun 22, 2026Updated last month
flakestorm / flakestorm
View on GitHub
Flakestorm — Automated Robustness Testing for AI Agents. Stop guessing if your agent really works. FlakeStorm generates adversarial mutat…
☆43Apr 16, 2026Updated 3 months ago
aerroberts / fire-aspect
View on GitHub
☆14Jun 6, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yuvalsuede / claudia
View on GitHub
CLI task manager for AI agents with MCP server support
☆15Feb 28, 2026Updated 4 months ago
teabranch / agentic-developer-mcp
View on GitHub
An MCP server that scales development into controllable agentic, recursive flows, and build a feature from bottom-up
☆45Jun 29, 2025Updated last year
tony-baseball / Pitcher-Report-Generator-using-R-Markdown
View on GitHub
This code helps you generate post-game pitcher reports in R Markdown from Yakkertech/Trackman Data
☆11Nov 12, 2024Updated last year
cage1016 / alfred-paletter
View on GitHub
Extract palette from an image
☆15Nov 20, 2022Updated 3 years ago
Second-Inc / second
View on GitHub
The factory for custom internal software, purpose-built for human2agent work.
☆74Updated this week
infrarely / infrarely
View on GitHub
Stop prompting your agents to behave. Start engineering them to.
☆25Apr 17, 2026Updated 3 months ago
Gizra / Gizra-Way-Book
View on GitHub
The Gizra Way Definitive Guide
☆10Dec 2, 2019Updated 6 years ago
spences10 / ccrecall
View on GitHub
🔄️ Sync Claude Code transcripts to SQLite for analytics, uses node:sqlite
☆27Updated this week
MKme / Arduino-MQ3-Alcohol-Sensor
View on GitHub
Eric's Arduino Breathalyzer
☆10Apr 8, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alirezarezvani / claude-code-mastery
View on GitHub
☆22Feb 26, 2026Updated 5 months ago
romiluz13 / cc10x
View on GitHub
The Loop Engine for Claude Code — engineer the loop, not the prompt. 1 router · 9 agents · 16 skills · 4 workflows. Fail-closed gates, te…
☆157Jul 19, 2026Updated last week
clutcher / bh
View on GitHub
Issue tracker for Better Highlights Intellij IDEA plugin
☆12Jul 16, 2023Updated 3 years ago
famitzsy8 / opencode-tool-search-tool
View on GitHub
An implementation of the Tool Search Tool in OpenCode as offered by Claude
☆22Jan 19, 2026Updated 6 months ago
NWYLZW / idea-comment-queries
View on GitHub
☆11Apr 20, 2025Updated last year
shep-ai / shep
View on GitHub
Ship features 10x faster. Built In Auto: Memory, K8S Agent & Security (SDD+SDLC) . 😇
☆239Jul 17, 2026Updated last week
capsulerun / bash
View on GitHub
Sandboxed bash for agents. Track changes on every command.
☆15May 16, 2026Updated 2 months ago
ZiyuGong-proj / Assessment-of-ACL-Injury-Risk-Based-on-Openpose
View on GitHub
Aiming at the detection of the potential injury risk of the anterior cruciate ligament (ACL)
☆15Feb 8, 2023Updated 3 years ago
briltec / furniture_App_UI_flutter
View on GitHub
This is the UI of Furniture App made using Flutter SDK. The original design was made by someone else in dribble and I tried to create the…
☆13Nov 7, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eqtylab / agent-console
View on GitHub
Live view of Claude Code sessions and the ability to search them
☆87Dec 30, 2025Updated 6 months ago
Gizra / message_notify
View on GitHub
https://www.drupal.org/project/message_notify
☆11Jun 12, 2020Updated 6 years ago
getagentseal / agentseal
View on GitHub
Security toolkit for AI agents. Scan your machine for dangerous skills and MCP configs, monitor for supply chain attacks, test prompt inj…
☆333Jun 11, 2026Updated last month
catcam / hads
View on GitHub
Human-AI Document Standard — lightweight convention for AI-optimized technical documentation
☆28Jul 9, 2026Updated 2 weeks ago
Lets7512 / rlm-skill
View on GitHub
☆23Mar 6, 2026Updated 4 months ago
PacktPublishing / WordPress-101---The-Complete-Guide
View on GitHub
☆19Mar 24, 2025Updated last year
baz-scm / baz-cli
View on GitHub
CLI for AI-assisted manual code review
☆47Updated this week
PatrickSys / codebase-context
View on GitHub
Codebase Context gives AI agents understanding of your codebase through semantic code search, team conventions, patterns, and memory, so …
☆57May 11, 2026Updated 2 months ago
gossipcat-ai / gossipcat-ai
View on GitHub
Multi-agent code review mesh — orchestrates AI agents from multiple providers to review code in parallel, cross-review each other's findi…
☆38Jul 19, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Mirascope / mcp-community
View on GitHub
Easily run, deploy, and connect to MCP servers
☆23Mar 15, 2025Updated last year
DragonShadows1978 / AI-AfterImage
View on GitHub
Episodic memory for AI coding agents. The ghost of code written, persisting across sessions.
☆22May 10, 2026Updated 2 months ago
guimatheus92 / mcp-video-analyzer
View on GitHub
MCP server that turns any video — YouTube, Instagram, TikTok, Loom, X, Vimeo, direct URLs, local files — into transcripts, key frames, OC…
☆28Updated this week
0-Vault / Vault-0
View on GitHub
Vault-0: Agent Security, Monitor & x402 Wallet for OpenClaw. Encrypted secret vault, real-time agent monitor, policy enforcement, and nat…
☆15Feb 13, 2026Updated 5 months ago
IgorWarzocha / pi-vent
View on GitHub
A tool for your agent to give you feedback on issues.
☆15May 28, 2026Updated last month
simplifylabs / WearAI
View on GitHub
Empower your wearable tech with AI. WearAI integrates OpenAI's ChatGPT with your smartwatch, enabling real-time voice-to-text and text-to…
☆16Jul 24, 2023Updated 3 years ago
preloop / preloop
View on GitHub
The open-source AI agent control plane: MCP firewall, model gateway with budgets, human approvals, runtime observability, and audit trail…
☆37Updated this week