openclaw/clawbench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openclaw/clawbench)

openclaw / clawbench

The agent benchmark that scores the full stack — harness, config, and model — not just the LLM. Trace-based scoring, reliability metrics, configuration diagnostics.

☆117

Alternatives and similar repositories for clawbench

Users that are interested in clawbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year
zjukg / OntoTune
View on GitHub
[Paper][WWW2025] OntoTune: Ontology-Driven Self-training for Aligning Large Language Models
☆57Jul 21, 2025Updated 11 months ago
NovaSky-AI / SkyRL-OpenHands
View on GitHub
☆37Nov 26, 2025Updated 7 months ago
sorty-organizer / Sorty
View on GitHub
Sorty: The FOSS AI File Organiser
☆37Updated this week
ByteDance-Seed / Seed2.0
View on GitHub
☆40Feb 17, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / emphassess
View on GitHub
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …
☆25Jan 9, 2024Updated 2 years ago
mattt / JSONLines
View on GitHub
A lightweight library for working with JSON Lines (JSONL) data in Swift.
☆18Jul 24, 2025Updated 11 months ago
o-ifeanyi / swift-trivia
View on GitHub
☆11Nov 27, 2023Updated 2 years ago
Marvis-Labs / marvis-tts-swift
View on GitHub
A Swift version of Marvis TTS, running locally on Apple Silicon using MLX Swift.
☆23Jan 4, 2026Updated 5 months ago
luisbebop / facebook-robot-sinatra
View on GitHub
A template written in Ruby using Sinatra to create Facebook messenger robots
☆11Apr 22, 2016Updated 10 years ago
amazon-science / PrefEval
View on GitHub
☆36May 30, 2025Updated last year
Stephlat / Multi-source-Human-Image-Generation
View on GitHub
Implementation of Attention-based Fusion for Multi-source Human Image Generation, S. Lathuilière, E. Sangineto, A. Siarohin, N. Sebe, WAC…
☆10Oct 9, 2020Updated 5 years ago
AllenXuuu / DCR
View on GitHub
Official implementation of our CVPR'22 paper.
☆13Nov 18, 2022Updated 3 years ago
defilantech / LLMKube
View on GitHub
Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Run…
☆148Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MooreThreads / tutorial_on_musa
View on GitHub
☆48Jan 13, 2026Updated 5 months ago
Justherozen / FlowBench
View on GitHub
[EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
☆22Jan 6, 2025Updated last year
quickthyme / graffeine
View on GitHub
Simple, modular graphs for iOS.
☆22Mar 2, 2021Updated 5 years ago
wubowen416 / gesture-generation-using-WGAN
View on GitHub
☆14Jul 24, 2023Updated 2 years ago
thunlp / AutoForm
View on GitHub
Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"
☆23Mar 30, 2024Updated 2 years ago
trailofbits / tlslib.py
View on GitHub
MVP for updated PEP 543 proposal
☆14Jun 12, 2026Updated 2 weeks ago
kennethwdk / PINet
View on GitHub
Code for "Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference", NeurIPS 2021
☆15Dec 2, 2021Updated 4 years ago
scwangdyd / zero_shot_hoi
View on GitHub
Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020
☆42Jul 14, 2020Updated 5 years ago
PKU-ICST-MIPL / MAI_ICLR2025
View on GitHub
☆21Mar 5, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CodeEditorBench / CodeEditorBench
View on GitHub
☆58May 28, 2024Updated 2 years ago
Ianleeclark / machinery_display
View on GitHub
Generates graphs from Machinery state machines
☆14Oct 18, 2020Updated 5 years ago
raphamorim / fuzzy
View on GitHub
fuzzy matching with Levenshtein, Damerau-Levenshtein, Bitap and n-gram
☆24Jul 31, 2025Updated 10 months ago
beaugunderson / vscode-solidity-extended
View on GitHub
✏ Solidity support for VSCode
☆10Jan 11, 2023Updated 3 years ago
elliottzheng / ppt2fig
View on GitHub
PPT2Fig 用来把 PPT 页面导出成适合论文、汇报和文档插图使用的 PDF，并自动裁掉多余白边。
☆33Apr 24, 2026Updated 2 months ago
fujita / rust-nvme
View on GitHub
☆16Aug 5, 2022Updated 3 years ago
aws / sagemaker-mxnet-inference-toolkit
View on GitHub
Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…
☆29Sep 13, 2023Updated 2 years ago
zodb / perfmetrics
View on GitHub
A library for sending software performance metrics from Python libraries and apps to statsd.
☆31May 19, 2026Updated last month
cnsdqd-dyb / Guide-GRPO
View on GitHub
Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, …
☆28Feb 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
davehunt / pytest-zap
View on GitHub
OWASP Zed Attack Proxy plugin for py.test
☆13Sep 10, 2015Updated 10 years ago
cschaufler / lsm-stacking
View on GitHub
Linux Security Module Stacking
☆10Apr 25, 2026Updated 2 months ago
yangjie-cv / WeThink
View on GitHub
WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning
☆36Jun 10, 2025Updated last year
mckaywrigley / takeoff-cursor-course-3
View on GitHub
☆13Apr 14, 2025Updated last year
disler / install-and-maintain
View on GitHub
Deterministic and Agentic patterns for installing and maintaining successful production applications
☆100Jan 25, 2026Updated 5 months ago
YuDeng / GRAM
View on GitHub
Project page of "GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation"
☆21Apr 3, 2023Updated 3 years ago
holi-lab / ToolDial
View on GitHub
☆26Mar 4, 2026Updated 3 months ago