OSU-NLP-Group/WebDreamer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OSU-NLP-Group/WebDreamer)

OSU-NLP-Group / WebDreamer

[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"

☆104

Alternatives and similar repositories for WebDreamer

Users that are interested in WebDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OSU-NLP-Group / SeeActChromeExtension
View on GitHub
☆18Jan 3, 2025Updated last year
OSU-NLP-Group / Middleware
View on GitHub
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
☆37Dec 29, 2024Updated last year
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
kohjingyu / search-agents
View on GitHub
Code for the paper 🌳 Tree Search for Language Model Agents
☆223Jul 25, 2024Updated 2 years ago
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RUCAIBox / LSVCR
View on GitHub
☆14Apr 1, 2024Updated 2 years ago
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆111May 17, 2026Updated 2 months ago
OSU-NLP-Group / Online-Mind2Web
View on GitHub
An Illusion of Progress? Assessing the Current State of Web Agents
☆192Jun 25, 2026Updated last month
OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
kyle8581 / WMA-Agents
View on GitHub
Official code repository for "Web Agents with World Models [ICLR 2025]".
☆31Mar 2, 2025Updated last year
zorazrw / multilingual-conala
View on GitHub
[EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
☆23Feb 13, 2023Updated 3 years ago
dki-lab / few-shot-bioIE
View on GitHub
True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning
☆12Jul 6, 2022Updated 4 years ago
heaplax / ARMAP
View on GitHub
☆29Jun 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OSU-NLP-Group / SkillWeaver
View on GitHub
SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.
☆144Apr 14, 2025Updated last year
zorazrw / agent-skill-induction
View on GitHub
Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"
☆42Apr 24, 2025Updated last year
jonathanherzig / zero-shot-semantic-parsing
View on GitHub
Author implementation of the paper "Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing"
☆18Nov 2, 2018Updated 7 years ago
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
THUDM / WebRL
View on GitHub
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆535Jun 6, 2025Updated last year
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
THUDM / VisualAgentBench
View on GitHub
Towards Large Multimodal Models as Visual Foundation Agents
☆274Apr 24, 2025Updated last year
MobileAgentBench / mobile-agent-bench
View on GitHub
☆37Sep 30, 2024Updated last year
Philip-MIT / thread
View on GitHub
☆22Aug 18, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Di-viner / LLM-Robustness-to-Irrelevant-Information
View on GitHub
[COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
☆23Oct 13, 2024Updated last year
OSU-NLP-Group / GrokkedTransformer
View on GitHub
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆240Jul 19, 2025Updated last year
satori-reasoning / Satori-SWE
View on GitHub
☆21May 30, 2025Updated last year
mobilegptsys / MobileGPT
View on GitHub
☆27Oct 2, 2024Updated last year
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
colonylabs / ScribeAgent
View on GitHub
Code for ScribeAgent paper
☆63Mar 3, 2025Updated last year
Shoalstone / helm
View on GitHub
Interface for text continuations emphasizing autonomous exploration and complex tree management
☆26Nov 14, 2025Updated 8 months ago
cosmicoptima / indranet-explorer
View on GitHub
Indranet Explorer, a simulated browser
☆16Nov 12, 2024Updated last year
amazon-science / AgentOccam
View on GitHub
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
☆59Jan 28, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
InfiXAI / InfiGUIAgent
View on GitHub
☆74May 23, 2025Updated last year
OSU-NLP-Group / GUI-Agents-Paper-List
View on GitHub
Awesome GUI Agent Paper List
☆864Jun 28, 2026Updated 3 weeks ago
RenzeLou / AAAR-1.0
View on GitHub
The source code for running LLMs on the AAAR-1.0 benchmark.
☆20Apr 5, 2025Updated last year
gerred / mcp-server-replicate
View on GitHub
☆16Feb 28, 2025Updated last year
deepfates / awesome-replicate
View on GitHub
A curated list of tools, guides and resources for the Replicate AI model platform
☆17Jan 10, 2024Updated 2 years ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year