Skills for AI Evals to compliment the course: AI Evals For Engineers & PMs
☆761Mar 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for evals-skills
Users that are interested in evals-skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A recursive coding agent inpired by RLMs☆167Updated this week
- Claude Code CLI skill: Interactive assistant for intercepting, debugging, analyzing and reviewing Claude Code API requests using mitmprox…☆151Nov 8, 2025Updated 4 months ago
- ☆14Jul 28, 2024Updated last year
- Track associated files of Ruby tests and cache their results☆19Mar 9, 2026Updated 2 weeks ago
- Implementation of Recursive Language Model paper from scratch☆38Feb 10, 2026Updated last month
- Rust implementation of KittenTTS. CLI and API server.☆189Updated this week
- Direct Preference Optimization Implementation☆17Feb 1, 2024Updated 2 years ago
- ☆49Mar 9, 2026Updated 2 weeks ago
- Minimal example of MCP for parsing llms.txt☆40Apr 8, 2025Updated 11 months ago
- Cloud-synced dashboards for OpenCode and Claude Code. Track sessions, search with semantic lookup, export eval datasets.☆319Feb 23, 2026Updated last month
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…☆29Mar 8, 2026Updated 2 weeks ago
- ☆29Feb 11, 2026Updated last month
- AI helpers for Elixir projects☆20Jan 28, 2024Updated 2 years ago
- A Python package to dynamically load functions for OpenAI Assistant☆55Dec 6, 2023Updated 2 years ago
- ChatBot App built using LangChain and Lightning AI☆17Mar 4, 2023Updated 3 years ago
- It’s a coding agent, but I’m trying to strip it down to the bare essentials and offload everything else to MCP/subagents, snapping the pi…☆23Feb 23, 2026Updated last month
- ☆40Jan 28, 2026Updated last month
- Access Jina AI news via ssh guest@news.jina.ai☆13May 3, 2024Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated 2 years ago
- 🚀 一键打造现代化终端环境 - Oh My Zsh + Powerlevel10k + 现代 CLI 工具套装☆41Feb 4, 2026Updated last month
- (Alternative) Visualizer for XState☆12Mar 2, 2023Updated 3 years ago
- Claude Code sub-agents definitions and prompts for building a YouTube social proof widget powered by ChatGPT widget☆26Sep 5, 2025Updated 6 months ago
- Sandboxed Ruby for AI agents☆47Feb 27, 2026Updated 3 weeks ago
- Interactive form tool for pi-agent to gather user responses with keyboard navigation, themes, and image attachments☆162Mar 16, 2026Updated last week
- YLL is a lightweight and secure URL shortener built with Ruby on Rails. It provides a simple way to generate short links, track clicks, a…☆19May 26, 2025Updated 9 months ago
- 7 GUIs implemented using XState☆14Apr 11, 2021Updated 4 years ago
- ☆65Mar 2, 2026Updated 3 weeks ago
- Inspired by Cow Clicker, but on-chain. Available as a Farcaster Frame!☆32Jan 28, 2024Updated 2 years ago
- Python script for analyzing the effect your fantasy schedule has on the league standings☆11Dec 7, 2018Updated 7 years ago
- A tool for context compression of tool call results☆165Nov 21, 2025Updated 4 months ago
- 🌀 an Interactive Terminal for AI (interminai) - A PTY proxy enabling AI interaction with interactive CLI applications like vim, git reba…☆45Feb 9, 2026Updated last month
- Build the right thing from the start. Two AI agents collaborate to transform ideas into implementation-ready technical specs.☆34Aug 14, 2025Updated 7 months ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Prompt-driven automation platform - Transform natural language into executable workflows☆32Jul 13, 2025Updated 8 months ago
- This is a project that aims to use Claude.ai's coding capabilities, artifact capabilities, and project capabilities to create a new metho…☆12Jan 31, 2025Updated last year
- ☆952Mar 15, 2026Updated last week
- This is the repository for the API backend of theseventhman.net☆10Dec 8, 2022Updated 3 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 6 months ago
- ☆12Nov 18, 2023Updated 2 years ago