McGill-NLP/weblinx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/McGill-NLP/weblinx)

McGill-NLP / weblinx

WebLINX is a benchmark for building web navigation agents with conversational capabilities

☆162

Alternatives and similar repositories for weblinx

Users that are interested in weblinx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shulin16 / MMInA
View on GitHub
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆54Feb 27, 2025Updated last year
McGill-NLP / webllama
View on GitHub
Llama-3 agents that can browse the web by following instructions and talking to you
☆1,400Dec 10, 2024Updated last year
OSU-NLP-Group / SeeAct
View on GitHub
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…
☆851Feb 3, 2025Updated last year
asappresearch / webagents-step
View on GitHub
☆41Jul 21, 2024Updated 2 years ago
web-arena-x / visualwebarena
View on GitHub
VisualWebArena is a benchmark for multimodal agents.
☆484Nov 9, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OSU-NLP-Group / Mind2Web
View on GitHub
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…
☆1,015Nov 5, 2025Updated 8 months ago
ServiceNow / WorkArena
View on GitHub
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
☆261Apr 25, 2026Updated 2 months ago
THUDM / AutoWebGLM
View on GitHub
An LLM-based Web Navigating Agent (KDD'24)
☆930Sep 27, 2024Updated last year
ServiceNow / BrowserGym
View on GitHub
🌎💪 BrowserGym, a Gym environment for web task automation
☆1,288Jul 17, 2026Updated last week
VisualWebBench / VisualWebBench
View on GitHub
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
☆68Oct 19, 2024Updated last year
princeton-nlp / WebShop
View on GitHub
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
☆572Sep 6, 2024Updated last year
MinorJerry / WebVoyager
View on GitHub
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
☆1,110Mar 4, 2024Updated 2 years ago
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
Berkeley-NLP / Agent-Eval-Refine
View on GitHub
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆149Nov 26, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
oriyor / assistantbench
View on GitHub
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆71Dec 9, 2024Updated last year
kohjingyu / search-agents
View on GitHub
Code for the paper 🌳 Tree Search for Language Model Agents
☆223Jul 25, 2024Updated 2 years ago
web-arena-x / webarena
View on GitHub
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
☆1,556Nov 26, 2025Updated 7 months ago
cooelf / Auto-GUI
View on GitHub
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
☆261Jul 16, 2024Updated 2 years ago
google-deepmind / pix2act
View on GitHub
☆60Jul 8, 2026Updated 2 weeks ago
camel-ai / crab
View on GitHub
🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/
☆423Updated this week
OpenGVLab / GUI-Odyssey
View on GitHub
[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…
☆159Jan 3, 2026Updated 6 months ago
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
McGill-NLP / CHASE
View on GitHub
Synthetic Data Generation for Evaluation
☆16Feb 21, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ServiceNow / AgentLab
View on GitHub
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…
☆606Jul 17, 2026Updated last week
niuzaisheng / ScreenAgent
View on GitHub
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
☆607Nov 25, 2024Updated last year
showlab / GUI-Narrator
View on GitHub
Repository of GUI Action Narrator
☆13Apr 8, 2025Updated last year
convergence-ai / webgames
View on GitHub
Challenges for general-purpose web-browsing AI agents
☆68Jun 2, 2025Updated last year
ltzheng / Synapse
View on GitHub
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
☆69Jan 7, 2026Updated 6 months ago
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
DigiRL-agent / digirl
View on GitHub
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆393Feb 22, 2025Updated last year
Farama-Foundation / MiniWoB-plusplus
View on GitHub
A collection of reinforcement learning environments for simple web interaction tasks
☆393Updated this week
posgnu / rci-agent
View on GitHub
A codebase for "Language Models can Solve Computer Tasks"
☆240May 1, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ddupont808 / GPT-4V-Act
View on GitHub
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
☆1,060Dec 9, 2024Updated last year
agentsea / surfkit
View on GitHub
A toolkit for building computer use AI agents
☆194Jun 26, 2025Updated last year
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
MurtyShikhar / NNetnav
View on GitHub
Interaction-first method for generating demonstrations for web-agents on any website
☆57Apr 29, 2025Updated last year
Destiner / oneshot
View on GitHub
Anthropic MCP client for macOS
☆16Jan 5, 2025Updated last year
OS-Copilot / OS-Atlas
View on GitHub
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
☆452Apr 20, 2025Updated last year
HazyResearch / wonderbread
View on GitHub
WONDERBREAD benchmark + dataset for BPM tasks
☆35Jul 30, 2025Updated 11 months ago