The agent benchmark that scores the full stack — harness, config, and model — not just the LLM. Trace-based scoring, reliability metrics, configuration diagnostics.
☆124Jun 24, 2026Updated last week
Alternatives and similar repositories for shellbench
Users that are interested in shellbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 12, 2025Updated last year
- ☆11Sep 26, 2022Updated 3 years ago
- ☆37Nov 26, 2025Updated 7 months ago
- ☆40Feb 17, 2026Updated 4 months ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆25Jan 9, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 软微新圣经----大兴究竟有什么可以输?☆14Sep 18, 2022Updated 3 years ago
- Implementation of Attention-based Fusion for Multi-source Human Image Generation, S. Lathuilière, E. Sangineto, A. Siarohin, N. Sebe, WAC…☆10Oct 9, 2020Updated 5 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- ☆28Jun 12, 2025Updated last year
- ☆16Sep 17, 2021Updated 4 years ago
- ☆10May 25, 2023Updated 3 years ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020☆42Jul 14, 2020Updated 5 years ago
- ☆21Mar 5, 2025Updated last year
- ☆58May 28, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PostgreSQL 数据库中文手册☆14Dec 27, 2014Updated 11 years ago
- Generates graphs from Machinery state machines☆14Oct 18, 2020Updated 5 years ago
- Custom firmware for the HackRF+PortaPack H1/H2/H4☆21Updated this week
- A task focused web browser for working with Claude.ai chat to smooth the workflow for projects☆16Sep 3, 2025Updated 9 months ago
- ✏ Solidity support for VSCode☆10Jan 11, 2023Updated 3 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated 2 years ago
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- A library for sending software performance metrics from Python libraries and apps to statsd.☆31May 19, 2026Updated last month
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Apr 14, 2025Updated last year
- Deterministic and Agentic patterns for installing and maintaining successful production applications☆100Jan 25, 2026Updated 5 months ago
- A tiny search engine.☆13Sep 6, 2022Updated 3 years ago
- Rigorously evaluating autonomous systems for cybersecurity at scale☆31Jul 9, 2025Updated 11 months ago
- Experimental stub files for PyMongo☆13Mar 3, 2022Updated 4 years ago
- Write SQL-like queries over JavaScript data structures☆10Jan 30, 2020Updated 6 years ago
- Claude Code hook that detects context compaction and injects a reminder to re-read AGENTS.md, preventing post-compaction rule amnesia in …☆44Apr 29, 2026Updated 2 months ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆36Jun 16, 2025Updated last year
- GenDB, an LLM-Powered Generative Query Engine Built for the Future☆66Jun 8, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Jun 23, 2026Updated last week
- 新版《Redis 设计与实现》的支持网站。☆12May 1, 2024Updated 2 years ago
- Next-Toggle is just a simple plug and use, theme toggle button with multiple light and dark themes.☆11May 9, 2024Updated 2 years ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- Automated Theorem Prover inspired by Aletheia. Claude Code for mathematicians.☆76Apr 20, 2026Updated 2 months ago
- Dongliang Mu de Blog☆10Apr 24, 2026Updated 2 months ago
- Fast, zero-copy HTML Parser written in Rust☆30Dec 6, 2025Updated 6 months ago