OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation
☆36Apr 1, 2026Updated 2 months ago
Alternatives and similar repositories for OfficeBench
Users that are interested in OfficeBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents☆24May 7, 2025Updated last year
- ☆13May 23, 2024Updated 2 years ago
- Agent Memory Playground: AI Agent Memory Design & Optimization Techniques☆38Aug 7, 2025Updated 10 months ago
- ☆28Oct 30, 2025Updated 7 months ago
- ☆18Feb 28, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Aug 16, 2023Updated 2 years ago
- MI and Formal Verification of NNs on Algorithmic tasks!☆18Mar 18, 2024Updated 2 years ago
- ☆41Jan 19, 2026Updated 4 months ago
- ☆21Apr 27, 2026Updated last month
- Fetch a random wallpaper from Konachan.☆10Jun 4, 2018Updated 8 years ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆27Nov 7, 2025Updated 7 months ago
- Code and data for the USENIX 2025 paper "We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating…☆29Aug 12, 2025Updated 10 months ago
- ☆52Mar 30, 2026Updated 2 months ago
- ☆15Oct 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).☆20Oct 10, 2022Updated 3 years ago
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆23Jul 28, 2025Updated 10 months ago
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents☆57Jan 28, 2025Updated last year
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14May 31, 2023Updated 3 years ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆31Jun 4, 2023Updated 3 years ago
- Official Implementation for "EmojiLM: Modeling the New Emoji Language"☆12Feb 23, 2024Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated last month
- Official PyTorch implementation of RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects☆15Mar 2, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of SimFlow☆32Dec 16, 2025Updated 5 months ago
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)☆18Nov 24, 2022Updated 3 years ago
- UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience☆74Apr 3, 2026Updated 2 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆478Mar 19, 2024Updated 2 years ago
- Discovering Hypernymy in Text-Rich Heterogeneous Information Network by Exploiting Context Granularity (CIKM'19)☆18Nov 4, 2019Updated 6 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆71Dec 9, 2024Updated last year
- Official Implementation of CAPEAM (ICCV'23)☆16Nov 30, 2024Updated last year
- Code for ICLR 2024 paper "Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection"☆17Apr 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jun 24, 2025Updated 11 months ago
- ☆14May 9, 2024Updated 2 years ago
- Open source code for paper☆15May 27, 2024Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆16Mar 29, 2025Updated last year
- [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning☆26Sep 6, 2025Updated 9 months ago
- [ICLR 2026] AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆48Apr 17, 2026Updated last month