HazyResearch/wonderbread

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HazyResearch/wonderbread)

HazyResearch / wonderbread

WONDERBREAD benchmark + dataset for BPM tasks

☆35

Alternatives and similar repositories for wonderbread

Users that are interested in wonderbread are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
xnancy / russ
View on GitHub
☆16Apr 9, 2021Updated 5 years ago
asappresearch / webagents-step
View on GitHub
☆41Jul 21, 2024Updated 2 years ago
Zhiyuan-Zeng / EvalTree
View on GitHub
[COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
☆31Jul 11, 2025Updated last year
Timothyxxx / KVCachePapers
View on GitHub
☆20May 24, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TheDuckAI / DuckTrack
View on GitHub
Multimodal computer agent data collection program
☆174Updated this week
Berkeley-NLP / Agent-Eval-Refine
View on GitHub
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆149Nov 26, 2024Updated last year
xiamengzhou / NLPerf
View on GitHub
Performance Prediction for NLP Tasks
☆17May 5, 2020Updated 6 years ago
JHU-CLSP / turking-bench
View on GitHub
Web-grounded natural language instructions
☆18Nov 25, 2024Updated last year
xlang-ai / Spider2-V
View on GitHub
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
☆153Aug 26, 2024Updated last year
Timothyxxx / TestTimeTrainingPapers
View on GitHub
☆59Apr 13, 2026Updated 3 months ago
uivision / UI-Vision
View on GitHub
☆33Jul 3, 2025Updated last year
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
microsoft / iclr2019-learning-to-represent-edits
View on GitHub
Code for the ICLR 2019 paper "Learning to Represent Edits"
☆13Dec 8, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
WildVision-AI / LMM-Engines
View on GitHub
☆17Oct 22, 2024Updated last year
chengyou-jia / AgentStore
View on GitHub
[ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
☆46Dec 19, 2024Updated last year
zorazrw / odex
View on GitHub
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆49Dec 22, 2023Updated 2 years ago
OS-Copilot / OS-Genesis
View on GitHub
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
☆188Oct 8, 2025Updated 9 months ago
peterbhase / ExplanationSearch
View on GitHub
Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"
☆18Oct 17, 2022Updated 3 years ago
XianyiCheng / HiDex
View on GitHub
☆13Jun 30, 2023Updated 3 years ago
ShuangLI59 / Pre-Trained-Language-Models-for-Interactive-Decision-Making
View on GitHub
Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]
☆131Jun 8, 2022Updated 4 years ago
cxcscmu / Montessori-Instruct
View on GitHub
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆51Jan 24, 2025Updated last year
OSU-NLP-Group / ACuRL
View on GitHub
An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…
☆38Jun 7, 2026Updated last month
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
theSergeyGusev / simple10GbaseR
View on GitHub
FPGA Low latency 10GBASE-R PCS
☆13May 23, 2023Updated 3 years ago
takuseno / d3rlpy-benchmarks
View on GitHub
Benchmark data for d3rlpy
☆22Nov 28, 2023Updated 2 years ago
violet-zct / swarm-distillation-zero-shot
View on GitHub
☆23Oct 15, 2022Updated 3 years ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month
josancamon19 / trace
View on GitHub
Trajectory Recording and Capture Environments
☆19Jan 24, 2026Updated 6 months ago
LlamaTouch / AgentEnv
View on GitHub
An environment for mobile angets to interact with realistic android device or android emulator
☆13Jul 19, 2024Updated 2 years ago
WukLab / osworld-human
View on GitHub
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆27May 17, 2026Updated 2 months ago
zharry29 / causal_reasoning_of_entities_and_events
View on GitHub
Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.
☆11May 26, 2023Updated 3 years ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Ericcsr / DiffSRL
View on GitHub
Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"
☆12Apr 4, 2022Updated 4 years ago
tongshuangwu / llm-crowdsourcing-pipeline
View on GitHub
☆11Jul 6, 2023Updated 3 years ago
Zangir / LLM-for-CP
View on GitHub
☆13Oct 3, 2024Updated last year
JasonMa2016 / CODAC
View on GitHub
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
☆22Aug 1, 2021Updated 4 years ago
XianyiCheng / CMGMP
View on GitHub
Contact Mode Guided Motion Planning for Robotic Manipulation in 3D
☆17Jan 13, 2023Updated 3 years ago
apple / ml-cadd
View on GitHub
☆18Apr 24, 2026Updated 3 months ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year