AnswerLayer / sniffbenchLinks
Evaluate coding agents. Like a sniff test, but it's a benchmark.
☆25Updated 3 weeks ago
Alternatives and similar repositories for sniffbench
Users that are interested in sniffbench are comparing it to the libraries listed below
Sorting:
- An OpenSource Deep Research library with reasoning☆170Updated last month
- ☆39Updated last week
- Turn any MCP server into a Python module☆235Updated 2 months ago
- Turn any question into multi-agent exploration. Recursive Claude agents that spawn sub-agents.☆158Updated last month
- Metadspy: The framework for specifying—not programming—language models☆88Updated 7 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆272Updated 3 months ago
- ☆44Updated 7 months ago
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆152Updated last month
- Deep research agents using MiniMax M2.1 interleaved thinking☆194Updated last month
- A prompt optimization system that adapts your prompts for different AI providers.☆158Updated last month
- ☆53Updated 2 weeks ago
- ☆85Updated 4 months ago
- A web-based Kanban board for viewing Claude Code tasks☆102Updated last week
- A Model Context Protocol (MCP) server that provides advanced code analysis and reasoning capabilities powered by Google's Gemini AI☆102Updated 3 weeks ago
- ☆37Updated 5 months ago
- Claude Code infrastructure with auto-activating skills and framework-specific kits. Install complete Claude Code infrastructure in 30 se…☆59Updated 2 months ago
- An MCP server for loading skills (shim for non-claude clients).☆334Updated 2 months ago
- Cross-platform desktop app for agentic chat powered by Claude Agent SDK.☆184Updated 2 months ago
- Awesome list of apps that work with OpenRouter. OpenRouter provides access to 300+ AI Models through a single API.☆156Updated this week
- Z.AI API Playground - Complete examples for GLM-4.7, Vision, Image/Video Generation, Audio, and more. Powered by Z.AI-GLM-4.7-Coding Plan☆47Updated 3 weeks ago
- The Coral Reef is a collection of awesome agents for multi-agent systems, built by the Coral Protocol team. It’s organised into categorie…☆47Updated 3 months ago
- Your own Claude Code UI, local/e2b/modal sandbox, in-browser VS Code, terminal, multi-provider support (Max, Z.AI, OpenRouter), custom sk…☆186Updated this week
- comprehensive UX Designer skill based on the Anthropic Agent Skills guide and our design philosophy☆70Updated 2 months ago
- Run Claude Agent (Claude Code) in a sandbox, control it via websocket☆501Updated last month
- Zero-setup CLI to download images from iCloud share links. Bridge iPhone screenshots to remote AI coding sessions—paste link, get image. …☆25Updated this week
- ☆98Updated last month
- A simple TUI to watch tmux sessions☆124Updated 3 weeks ago
- Autonomous agent loop for implementing features☆48Updated 3 weeks ago
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆185Updated 7 months ago
- ☆86Updated 4 months ago