asappresearch / webagents-step
☆38Updated 5 months ago
Alternatives and similar repositories for webagents-step:
Users that are interested in webagents-step are comparing it to the libraries listed below
- ☆89Updated this week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- ☆81Updated this week
- Code for the paper 🌳 Tree Search for Language Model Agents☆163Updated 5 months ago
- ☆120Updated 7 months ago
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆175Updated this week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆131Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆154Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆118Updated 5 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆77Updated 2 months ago
- ☆52Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 3 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆54Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- UGround: Universal GUI Visual Grounding for GUI Agents☆138Updated this week
- ☆81Updated last year
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆57Updated 2 weeks ago
- AWM: Agent Workflow Memory☆231Updated last month
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆53Updated 10 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- ☆47Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆44Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆153Updated last month
- ☆48Updated last month
- ☆22Updated this week
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆50Updated 3 months ago