[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆27Feb 17, 2026Updated last month
Alternatives and similar repositories for Explorer
Users that are interested in Explorer are comparing it to the libraries listed below
Sorting:
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆23Jan 6, 2026Updated 2 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- ☆33Aug 17, 2025Updated 7 months ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆102Feb 28, 2026Updated 3 weeks ago
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Nov 28, 2022Updated 3 years ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆41Mar 31, 2025Updated 11 months ago
- 🔵 [App][Android] AppLock for Android☆12Aug 31, 2020Updated 5 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆11Nov 15, 2023Updated 2 years ago
- ☆30Feb 4, 2026Updated last month
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- ☆24Apr 3, 2025Updated 11 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- ☆13Oct 19, 2023Updated 2 years ago
- ☆10Nov 29, 2024Updated last year
- ☆10Nov 28, 2024Updated last year
- Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…☆29Mar 5, 2025Updated last year
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 9 months ago
- ☆17Jul 20, 2022Updated 3 years ago
- ☆11Aug 1, 2024Updated last year
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆15Jul 11, 2024Updated last year
- ☆13Nov 29, 2024Updated last year
- ☆10Dec 18, 2020Updated 5 years ago
- ☆22May 3, 2025Updated 10 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆68Jan 7, 2026Updated 2 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆153May 29, 2025Updated 9 months ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Code for the CIKM'23 paper "A Retrieve-and-Read Framework for Knowledge Graph Link Prediction"☆12Mar 23, 2025Updated 11 months ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- ☆12Apr 18, 2025Updated 11 months ago
- Ongoing research training transformer models at scale☆18Jul 27, 2023Updated 2 years ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆28May 14, 2025Updated 10 months ago
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆35Dec 6, 2025Updated 3 months ago
- ☆32Sep 19, 2025Updated 6 months ago
- ☆37Dec 20, 2024Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆14Jun 1, 2025Updated 9 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,388Nov 26, 2025Updated 3 months ago
- ☆17Apr 3, 2022Updated 3 years ago
- ☆19Mar 10, 2025Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆834Feb 3, 2025Updated last year