[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆99Oct 5, 2025Updated 7 months ago
Alternatives and similar repositories for WebDreamer
Users that are interested in WebDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Jan 3, 2025Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆222Jul 25, 2024Updated last year
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆312Mar 11, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆20Jan 26, 2026Updated 3 months ago
- Measuring General Intelligence With Generated Games (Preprint)☆25Jul 30, 2025Updated 9 months ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆109Feb 28, 2026Updated 2 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆136Updated this week
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆121Apr 14, 2025Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- An Illusion of Progress? Assessing the Current State of Web Agents☆172Jan 2, 2026Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆28Jun 5, 2025Updated 11 months ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆520Jun 6, 2025Updated 11 months ago
- Building a comprehensive and handy list of papers for GUI agents☆747Apr 25, 2026Updated last week
- Official code repository for "Web Agents with World Models [ICLR 2025]".☆28Mar 2, 2025Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Towards Large Multimodal Models as Visual Foundation Agents☆265Apr 24, 2025Updated last year
- A super simple html + css + js ollama chat interface for hacking on.☆16May 1, 2025Updated last year
- ☆36Sep 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆40Oct 2, 2024Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- ☆21Aug 18, 2024Updated last year
- ☆23Oct 2, 2024Updated last year
- EMNLP 2025 | TongSearch-QR☆44Dec 4, 2025Updated 5 months ago
- ☆73May 23, 2025Updated 11 months ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆575Mar 17, 2026Updated last month
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- Code for ScribeAgent paper☆63Mar 3, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Indranet Explorer, a simulated browser☆16Nov 12, 2024Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated last year
- ☆124Feb 21, 2025Updated last year
- A curated list of tools, guides and resources for the Replicate AI model platform☆17Jan 10, 2024Updated 2 years ago
- Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones☆13Jul 27, 2022Updated 3 years ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆138Feb 19, 2025Updated last year
- Official Implementation of the Baby-AIGS system☆24Nov 25, 2024Updated last year