WONDERBREAD benchmark + dataset for BPM tasks
☆34Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for wonderbread
Users that are interested in wonderbread are comparing it to the libraries listed below
Sorting:
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 7 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated this week
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- ☆17Oct 22, 2024Updated last year
- Performance Prediction for NLP Tasks☆17May 5, 2020Updated 5 years ago
- ☆16Apr 9, 2021Updated 4 years ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆18Oct 17, 2022Updated 3 years ago
- ☆21May 24, 2024Updated last year
- Multimodal computer agent data collection program☆164Dec 5, 2025Updated 2 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆139Aug 26, 2024Updated last year
- ☆17Sep 1, 2024Updated last year
- ☆19Updated this week
- Self-hosted GPT-4V api☆27Nov 6, 2023Updated 2 years ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆34Jun 17, 2024Updated last year
- ☆31Sep 27, 2024Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Feb 11, 2026Updated 3 weeks ago
- This repository contains the registries for components, agents and services, the second part of the autonolas-v1 protocol.☆15Updated this week
- ☆13Apr 27, 2021Updated 4 years ago
- Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.☆38May 16, 2023Updated 2 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- The repository provides code for the paper RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders, CIKM'24☆11Oct 21, 2024Updated last year
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- Designed to help lawyers and legal professionals find precedent fast and prepare for case negotiations by simulating trajectories☆10Oct 16, 2024Updated last year
- An Awesome, Feature Rich Discord Bot for Hosting and Managing CTF Challenges on Discord Written in Python3☆11Jun 29, 2024Updated last year
- Kait's Site☆14Sep 7, 2021Updated 4 years ago
- Photonic Quantum Machine Learning Framework☆19Feb 18, 2026Updated 2 weeks ago
- Ask AI to test your website with a specific goal☆15Dec 22, 2023Updated 2 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- This repository contains numerous small utility packages. These packages serve various useful purposes and are written in nano ESModule w…☆10Feb 18, 2026Updated 2 weeks ago
- Concurrent data extraction from unstructured text and images using AI models.☆18Aug 10, 2025Updated 6 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆180Oct 8, 2025Updated 4 months ago
- 🤖 A list of latest AGI-related repos, resources and courses including LLMs and AI Agents.☆13Sep 24, 2024Updated last year
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated last month
- 神经辐射场 论文学习☆10Sep 25, 2021Updated 4 years ago
- UCPR: User-Centric Path Reasoning towards Explainable Recommendation, SIGIR 2021☆12Jun 18, 2022Updated 3 years ago
- ☆11Sep 8, 2023Updated 2 years ago