☆132Mar 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for agent-eval
Users that are interested in agent-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Agent Skills for AI coding agents. Compatible with Claude Code, Cursor, Copilot, Windsurf, and other skills-compatible agents.☆102Mar 27, 2026Updated last week
- ☆76Mar 21, 2026Updated 2 weeks ago
- CLI tool to automate changeset-based releases☆151Feb 3, 2026Updated 2 months ago
- Textlint rules for Japanese official documents [in Japanese]☆20May 23, 2021Updated 4 years ago
- cos is a toy operating system in C Language with reference to https://operating-system-in-1000-lines.vercel.app☆12Aug 17, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Power Assert in Swift☆22Jan 20, 2023Updated 3 years ago
- Run `publint` to lint npm packages after the build.☆21Updated this week
- An Rsbuild plugin to compress images☆22Updated this week
- The purpose of this repo is to demonstrate how easy it is to create "Human-In-The-Loop" Durable Tools for MCP servers by leveraging Tempo…☆19Aug 14, 2025Updated 7 months ago
- ☆20Jul 17, 2025Updated 8 months ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- Sandcastle: a web-based linux desktop environment on top of Vercel Sandbox (PoC)☆45Feb 16, 2026Updated last month
- A Modbus TCP browser application as a command-line client.☆13May 7, 2022Updated 3 years ago
- Enable `gh` command in Claude Code on the Web environment, just adding SessionStartHooks☆46Mar 14, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- an npm trends visualizer☆51Mar 17, 2026Updated 3 weeks ago
- ☆10Mar 1, 2026Updated last month
- json-rpc-on-a-stream☆14Jan 7, 2022Updated 4 years ago
- ☆29Jun 12, 2025Updated 9 months ago
- An MCP server for logging activity in Roo Code/Cline.☆24Aug 8, 2025Updated 8 months ago
- Fastes way to get a bleeding edge, fully fledged Next Js App up and running. Right from your terminal!☆32Aug 29, 2025Updated 7 months ago
- Prevent accidental PII leakage in LLM prompts before they hit the model.☆58Updated this week
- ATProto OAuth Client for Cloudflare Workers☆24Sep 25, 2025Updated 6 months ago
- The @estruyf/vscode package contains a couple of helpers to make Visual Studio Code Extension development easier.☆13Jul 31, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- n-wise coverage tool for combinatorial testing☆11Sep 7, 2019Updated 6 years ago
- Derive a key and secret key file from a name☆20Dec 11, 2020Updated 5 years ago
- A browser spreadsheet with an integrated AI chat (with MCP support) powered by Groq inference☆32Feb 20, 2026Updated last month
- ☆54Jan 9, 2025Updated last year
- Sugar functions for manipulating paths in rust.☆25Feb 24, 2026Updated last month
- This repo lets you run mistral-7b in Google Colab.☆16Oct 1, 2023Updated 2 years ago
- A Spotlight Importer for Xcode Playground.☆11May 2, 2016Updated 9 years ago
- We should dance with Mocks (a.k.a. Test Doubles), but we don't use any Mocking libraries. Why?☆11Dec 10, 2022Updated 3 years ago
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆48Feb 26, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A stream that can mimic network latency☆21Feb 6, 2020Updated 6 years ago
- A command-line bridge tool that orchestrates external image generation commands to convert text/code strings to images.☆26Jan 12, 2026Updated 2 months ago
- Measuring execution time of functions☆17Jul 2, 2018Updated 7 years ago
- Integrate with OpenAI API Codex to shape and evolve code through generative iterations.☆12Mar 31, 2026Updated last week
- A collection of various technical indicators implemented in LitScript☆10Apr 5, 2022Updated 4 years ago
- Developed a full stack application imitating the popular social media app - X (formerly Twitter) using a Next.js with Typescript, Sass, T…☆14Aug 6, 2025Updated 8 months ago
- ☆11Jul 19, 2024Updated last year