A verified version of the WebArena Benchmark
☆31Mar 8, 2026Updated 2 weeks ago
Alternatives and similar repositories for webarena-verified
Users that are interested in webarena-verified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow-Based Approach to Handling Single-Step and Iterated Forecasting☆10Jun 24, 2024Updated last year
- ☆14Jul 18, 2025Updated 8 months ago
- [ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations☆10Jun 5, 2022Updated 3 years ago
- ☆10Sep 14, 2023Updated 2 years ago
- Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)☆10Aug 1, 2024Updated last year
- An enterprise deep research benchmark☆35Updated this week
- Welcome to the DeepTrack GitHub repository, a cutting-edge solution for vehicle trajectory prediction in the realm of intelligent transpo…☆19Oct 27, 2023Updated 2 years ago
- N/A☆18Aug 15, 2022Updated 3 years ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 5 years ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 9 months ago
- ☆33Aug 17, 2025Updated 7 months ago
- French Machine Reading for Question Answering☆18Sep 21, 2022Updated 3 years ago
- RPG^2 is a pure-software system that operates on running C/C++ programs, profiling them, injecting prefetch instructions, and then tuning…☆12May 15, 2024Updated last year
- ☆16Mar 30, 2024Updated last year
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆23Oct 26, 2021Updated 4 years ago
- ☆10Jul 23, 2023Updated 2 years ago
- Code for WALT – Web Agents that Learn Tools☆67Oct 30, 2025Updated 4 months ago
- 不仅完成了作业的基础和提高,还为202扩展了其他算法:Efficient GPU SSR,Hiz-SSR,IBL,SVGF。GAMES101在另一个分支,完成了Final Project,还扩展了Roughness BSDF!☆19Sep 30, 2023Updated 2 years ago
- Gantry provides an API that streamlines running experiments in Beaker☆33Mar 11, 2026Updated last week
- 🌟✨一个纯粹基于requests的Python爬虫工具,专为获取拼多多商品分类和详情页面而设计!🛒🎉 给繁琐的自动化浏览器代码说再见👋,用这个轻量级🎈、高效🚀的工具轻松获取你需要的信息。📚🌈☆12Aug 31, 2023Updated 2 years ago
- ☆33Jul 8, 2024Updated last year
- Label Efficient Learning From Explanations☆22Mar 9, 2022Updated 4 years ago
- ☆11Dec 8, 2015Updated 10 years ago
- Conversational Neuro-Symbolic Commonsense Reasoning☆26Jun 18, 2020Updated 5 years ago
- [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly☆25Jan 6, 2026Updated 2 months ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆15Feb 8, 2023Updated 3 years ago
- A web-based RISC-V simulator https://riscv-simulator-five.vercel.app☆44Jan 22, 2026Updated 2 months ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 4 years ago
- A unified interface for with Linux-based cloud sandbox providers☆32Oct 9, 2025Updated 5 months ago
- SyGra - Graph-oriented Synthetic data generation Pipeline☆75Updated this week
- This is the official Pytorch implementation of Generate Like Experts: Multi-Stage Font Generation by Incorporating Font Transfer Process …☆45Feb 5, 2025Updated last year
- ☆21May 26, 2024Updated last year
- ZJU_digital_logic_design 数字逻辑设计☆13Aug 9, 2023Updated 2 years ago
- ☆47Jun 5, 2023Updated 2 years ago
- My TechStack written in MkDocs☆21Mar 11, 2026Updated 2 weeks ago
- Developed a sophisticated machine learning model capable of generating diverse interview questions aligned with specific topics, ensuring…☆42Dec 13, 2025Updated 3 months ago
- 浙江大学 2023-2024 春夏学期《计算机组成与设计》实验文档(刘海风老师班)☆21Dec 10, 2024Updated last year
- ☆57Nov 18, 2024Updated last year