The agent benchmark that scores the full stack — harness, config, and model — not just the LLM. Trace-based scoring, reliability metrics, configuration diagnostics.
☆117Jun 22, 2026Updated last week
Alternatives and similar repositories for clawbench
Users that are interested in clawbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 12, 2025Updated last year
- [Paper][WWW2025] OntoTune: Ontology-Driven Self-training for Aligning Large Language Models☆57Jul 21, 2025Updated 11 months ago
- ☆37Nov 26, 2025Updated 7 months ago
- Sorty: The FOSS AI File Organiser☆37Updated this week
- ☆40Feb 17, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆25Jan 9, 2024Updated 2 years ago
- A lightweight library for working with JSON Lines (JSONL) data in Swift.☆18Jul 24, 2025Updated 11 months ago
- ☆11Nov 27, 2023Updated 2 years ago
- A Swift version of Marvis TTS, running locally on Apple Silicon using MLX Swift.☆23Jan 4, 2026Updated 5 months ago
- A template written in Ruby using Sinatra to create Facebook messenger robots☆11Apr 22, 2016Updated 10 years ago
- ☆36May 30, 2025Updated last year
- Implementation of Attention-based Fusion for Multi-source Human Image Generation, S. Lathuilière, E. Sangineto, A. Siarohin, N. Sebe, WAC…☆10Oct 9, 2020Updated 5 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Run…☆148Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆48Jan 13, 2026Updated 5 months ago
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆22Jan 6, 2025Updated last year
- Simple, modular graphs for iOS.☆22Mar 2, 2021Updated 5 years ago
- ☆14Jul 24, 2023Updated 2 years ago
- Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"☆23Mar 30, 2024Updated 2 years ago
- MVP for updated PEP 543 proposal☆14Jun 12, 2026Updated 2 weeks ago
- Code for "Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference", NeurIPS 2021☆15Dec 2, 2021Updated 4 years ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020☆42Jul 14, 2020Updated 5 years ago
- ☆21Mar 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆58May 28, 2024Updated 2 years ago
- Generates graphs from Machinery state machines☆14Oct 18, 2020Updated 5 years ago
- fuzzy matching with Levenshtein, Damerau-Levenshtein, Bitap and n-gram☆24Jul 31, 2025Updated 10 months ago
- ✏ Solidity support for VSCode☆10Jan 11, 2023Updated 3 years ago
- PPT2Fig 用来把 PPT 页面导出成适合论文、汇报和文档插图使用的 PDF,并自动裁掉多余白边。☆33Apr 24, 2026Updated 2 months ago
- ☆16Aug 5, 2022Updated 3 years ago
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆29Sep 13, 2023Updated 2 years ago
- A library for sending software performance metrics from Python libraries and apps to statsd.☆31May 19, 2026Updated last month
- Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …☆28Feb 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OWASP Zed Attack Proxy plugin for py.test☆13Sep 10, 2015Updated 10 years ago
- Linux Security Module Stacking☆10Apr 25, 2026Updated 2 months ago
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning☆36Jun 10, 2025Updated last year
- ☆13Apr 14, 2025Updated last year
- Deterministic and Agentic patterns for installing and maintaining successful production applications☆100Jan 25, 2026Updated 5 months ago
- Project page of "GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation"☆21Apr 3, 2023Updated 3 years ago
- ☆26Mar 4, 2026Updated 3 months ago