DigiRL-agent / digiqLinks
☆106Updated 2 months ago
Alternatives and similar repositories for digiq
Users that are interested in digiq are comparing it to the libraries listed below
Sorting:
- ☆61Updated 3 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆76Updated last week
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆138Updated 6 months ago
- ☆169Updated this week
- [IROS'25 Oral & NeurIPSw'24] Official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simula…☆91Updated last week
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆57Updated 5 months ago
- ☆130Updated 11 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆78Updated 3 weeks ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934☆45Updated 2 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆216Updated 2 months ago
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation)☆32Updated 6 months ago
- Paper collections of the continuous effort start from World Models.☆173Updated 11 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 7 months ago
- ☆44Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR 2024 Spotlight)☆65Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆138Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆104Updated 3 weeks ago
- AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆63Updated 2 weeks ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆108Updated 3 weeks ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆118Updated 2 weeks ago
- Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)☆41Updated 2 months ago
- ☆42Updated last month
- ☆114Updated 5 months ago
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆79Updated last week
- [CVPR2024] This is the official implement of MP5☆102Updated 11 months ago
- Official implementation of "Self-Improving Video Generation"☆66Updated last month
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆63Updated this week
- (ICLR 2025) The Official Code Repository for GUI-World.☆59Updated 6 months ago
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"