[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆94Oct 5, 2025Updated 5 months ago
Alternatives and similar repositories for WebDreamer
Users that are interested in WebDreamer are comparing it to the libraries listed below
Sorting:
- ☆18Jan 3, 2025Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- Code for the paper 🌳 Tree Search for Language Model Agents☆220Jul 25, 2024Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆124Aug 26, 2025Updated 6 months ago
- Measuring General Intelligence With Generated Games (Preprint)☆25Jul 30, 2025Updated 7 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆256Apr 24, 2025Updated 10 months ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆100Feb 26, 2026Updated last week
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆112Apr 14, 2025Updated 10 months ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- Building a comprehensive and handy list of papers for GUI agents☆641Oct 27, 2025Updated 4 months ago
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- ☆27Jun 5, 2025Updated 9 months ago
- ☆73May 23, 2025Updated 9 months ago
- ☆13Jul 2, 2025Updated 8 months ago
- Meme search engine for the real shitposters☆10Jan 27, 2026Updated last month
- A super simple html + css + js ollama chat interface for hacking on.☆16May 1, 2025Updated 10 months ago
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Jan 13, 2024Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- ☆35Sep 30, 2024Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- Main repo for GIOROM☆18Sep 28, 2025Updated 5 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆64Jul 22, 2025Updated 7 months ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- ☆14Feb 13, 2025Updated last year
- Agentic Deep Graph Reasoning Implementation☆14Mar 4, 2025Updated last year
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Feb 17, 2026Updated 2 weeks ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions☆55Apr 29, 2025Updated 10 months ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆130Feb 19, 2025Updated last year
- ☆37Oct 2, 2024Updated last year
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆17Oct 4, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- ☆16Feb 28, 2025Updated last year
- A dataset of 80 millon constraint preserving transformations of CAD sketches☆13Nov 22, 2024Updated last year