OSU-NLP-Group / WebDreamer
☆48Updated last month
Alternatives and similar repositories for WebDreamer:
Users that are interested in WebDreamer are comparing it to the libraries listed below
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆109Updated 2 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆105Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆85Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆52Updated 10 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆82Updated last week
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆54Updated 8 months ago
- ☆57Updated 4 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆89Updated 3 weeks ago
- ☆111Updated 3 weeks ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆53Updated last week
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆47Updated 2 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆44Updated 3 weeks ago
- The Official Code Repository for GUI-World.☆44Updated 3 weeks ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆62Updated last week
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆105Updated 8 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆44Updated 5 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆34Updated last week
- ☆27Updated 3 weeks ago
- ☆76Updated 2 weeks ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆39Updated 2 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆26Updated 6 months ago
- ☆85Updated 2 months ago
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆53Updated this week
- Self-Alignment with Principle-Following Reward Models☆150Updated 10 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆43Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆77Updated 2 months ago
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆111Updated 2 months ago
- The official repository of the Omni-MATH benchmark.☆66Updated 2 weeks ago