Setup scripts for the WebArena benchmark
☆19Jun 19, 2025Updated 8 months ago
Alternatives and similar repositories for webarena-setup
Users that are interested in webarena-setup are comparing it to the libraries listed below
Sorting:
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆75Nov 4, 2025Updated 3 months ago
- ☆13Nov 5, 2024Updated last year
- Resources for the Enigmata Project.☆77Aug 13, 2025Updated 6 months ago
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆35Feb 12, 2026Updated 2 weeks ago
- The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))☆13Dec 21, 2023Updated 2 years ago
- Structural Pre-training for Dialogue Comprehension (ACL 2021)☆10Apr 25, 2022Updated 3 years ago
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 9 months ago
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- ☆21Jun 4, 2025Updated 8 months ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago
- [WWW 2025 Oral] ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning☆20Jul 2, 2025Updated 7 months ago
- implementations sde-net☆14Dec 8, 2020Updated 5 years ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- Blockscout verified smart-contracts dumps☆12Feb 5, 2022Updated 4 years ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆54Jan 2, 2024Updated 2 years ago
- Unofficial implementation of 《Towards Photo-Realistic VisibleWatermark Removal with Conditional Generative Adversarial Networks》☆13Feb 24, 2023Updated 3 years ago
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification, published in ACL 2021.☆13Apr 29, 2022Updated 3 years ago
- [EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"☆15Oct 10, 2024Updated last year
- Official Implementation of CAPEAM (ICCV'23)☆16Nov 30, 2024Updated last year
- Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue (TOIS)☆13Oct 18, 2025Updated 4 months ago
- Yet another RL Baseline repo.☆12May 28, 2024Updated last year
- A computer for your agent - sandboxed code execution for AI agents☆34Nov 29, 2025Updated 3 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆11May 11, 2022Updated 3 years ago
- Challenges for general-purpose web-browsing AI agents☆67Jun 2, 2025Updated 8 months ago
- [ICLR 2025] SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models☆17Sep 17, 2025Updated 5 months ago
- code for "Fine-grained Entity Typing via Label Reasoning" EMNLP2021☆13May 27, 2022Updated 3 years ago