Skills for AI Evals to compliment the course: AI Evals For Engineers & PMs
☆1,290Mar 3, 2026Updated 2 months ago
Alternatives and similar repositories for evals-skills
Users that are interested in evals-skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python port of the Flue: The Agent Harness Framework☆78Updated this week
- Claude Code CLI skill: Interactive assistant for intercepting, debugging, analyzing and reviewing Claude Code API requests using mitmprox…☆156Nov 8, 2025Updated 6 months ago
- ☆14Jul 28, 2024Updated last year
- Cloud-synced dashboards for OpenCode and Claude Code. Track sessions, search with semantic lookup, export eval datasets.☆357Feb 23, 2026Updated 2 months ago
- Implementation of Recursive Language Model paper from scratch☆45Feb 10, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆69Updated this week
- Low code framework to build and launch a crew of AI agents with shared state. See docs https://axcrew.dev☆42Mar 30, 2026Updated last month
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated last month
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- Upload your Claude Code transcripts to the web☆20May 26, 2025Updated 11 months ago
- ☆86Apr 7, 2026Updated last month
- CoinGecko CLI - Real Time & Historical Crypto Data☆120Updated this week
- Minimal example of MCP for parsing llms.txt☆39Apr 8, 2025Updated last year
- My emacs setup. 25+ years in the making, not counting reboots.☆26Apr 28, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆49Mar 9, 2026Updated 2 months ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated 2 months ago
- AI helpers for Elixir projects☆20Jan 28, 2024Updated 2 years ago
- ☆47Apr 25, 2026Updated 3 weeks ago
- Working reverse-engineered Claude Code CLI rebuilt from source analysis to reproduce the original terminal workflow☆109Apr 18, 2026Updated last month
- ChatBot App built using LangChain and Lightning AI☆17Mar 4, 2023Updated 3 years ago
- A Python package to dynamically load functions for OpenAI Assistant☆55Dec 6, 2023Updated 2 years ago
- ☆41Jan 28, 2026Updated 3 months ago
- Extract design tokens from Figma and convert them into code without plugins or Dev Mode required.☆16Jul 7, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implements a lightweight workflow for Codex inspired by Recursive Language Models (MIT). Now known as 'recursive-mode'☆57Apr 10, 2026Updated last month
- (Alternative) Visualizer for XState☆12Mar 2, 2023Updated 3 years ago
- Claude Code sub-agents definitions and prompts for building a YouTube social proof widget powered by ChatGPT widget☆27Sep 5, 2025Updated 8 months ago
- Reusable components for AI coding agents: skills, subagents, MCP servers, and extensions.☆41May 13, 2026Updated last week
- ☆143Jan 13, 2026Updated 4 months ago
- ☆116Mar 27, 2026Updated last month
- Automated code review loop extension for Pi coding agent☆82Apr 15, 2026Updated last month
- ☆25Apr 25, 2026Updated 3 weeks ago
- Sandboxed Ruby for AI agents☆50Feb 27, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Drawing inspiration from Andrej Karpathy's LLM Council, this is an implementation for coding. LLMs evaluate each other and generate the b…☆23Dec 7, 2025Updated 5 months ago
- cursor can now read your terminal in realtime☆22Sep 14, 2025Updated 8 months ago
- ☆11Jun 13, 2024Updated last year
- A self-improving product system that reads reports, identifies priorities, and autonomously implements fixes☆522Jan 23, 2026Updated 3 months ago
- 7 GUIs implemented using XState☆14Apr 11, 2021Updated 5 years ago
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆49Jan 30, 2026Updated 3 months ago
- The Destructive Command Guard (dcg) is for blocking dangerous git and shell commands from being executed by agents.☆1,020Updated this week