A codebase for "Language Models can Solve Computer Tasks"
☆240May 1, 2024Updated 2 years ago
Alternatives and similar repositories for rci-agent
Users that are interested in rci-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MiniWoB++: a web interaction benchmark for reinforcement learning☆380Apr 6, 2026Updated 3 weeks ago
- ☆60Jan 9, 2024Updated 2 years ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆68Jan 7, 2026Updated 3 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆984Nov 5, 2025Updated 5 months ago
- Demos for the MiniWoB++ benchmark☆21Feb 23, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆526Sep 6, 2024Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆845Feb 3, 2025Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,443Nov 26, 2025Updated 5 months ago
- ☆41Jul 21, 2024Updated last year
- Yet another LLM☆10Apr 6, 2023Updated 3 years ago
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning☆3,133Jan 14, 2025Updated last year
- AI-agents that automatically generate and use Langchain Tools and ChatGPT plugins☆533Apr 19, 2023Updated 3 years ago
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆388Nov 26, 2023Updated 2 years ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆149Nov 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Workflow-Guided Exploration: sample-efficient RL agent for web tasks☆118Jun 5, 2023Updated 2 years ago
- VisualWebArena is a benchmark for multimodal agents.☆463Nov 9, 2024Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆798Oct 4, 2024Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 8 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆477Mar 19, 2024Updated 2 years ago
- ☆16Apr 9, 2021Updated 5 years ago
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆26Sep 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆248May 5, 2024Updated last year
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- ☆12Aug 8, 2024Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Self-hosted GPT-4V api☆27Nov 6, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents☆24May 7, 2025Updated 11 months ago
- Multimodal computer agent data collection program☆167Dec 5, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AI Developer is an AI agent powered by GPT-4-Turbo that's using custom E2B Sandbox☆55Feb 11, 2025Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆124Mar 31, 2025Updated last year
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆260Jul 16, 2024Updated last year
- ☆20Feb 28, 2024Updated 2 years ago
- ☆35Jun 20, 2024Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,139Dec 23, 2023Updated 2 years ago
- ☆14Nov 5, 2022Updated 3 years ago