☆118Apr 8, 2025Updated 10 months ago
Alternatives and similar repositories for digiq
Users that are interested in digiq are comparing it to the libraries listed below
Sorting:
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆387Feb 22, 2025Updated last year
- ☆31Jul 3, 2025Updated 7 months ago
- ☆67Mar 6, 2025Updated 11 months ago
- ☆22May 23, 2025Updated 9 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆202Apr 17, 2025Updated 10 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 9 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆110Jul 17, 2025Updated 7 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆60Jul 11, 2025Updated 7 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆510Jun 6, 2025Updated 8 months ago
- ☆20Apr 24, 2024Updated last year
- Building a comprehensive and handy list of papers for GUI agents☆641Oct 27, 2025Updated 4 months ago
- ☆53Feb 19, 2025Updated last year
- Training VLM agents with multi-turn reinforcement learning☆416Feb 24, 2026Updated last week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆381Mar 7, 2025Updated 11 months ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆199Dec 17, 2024Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- ☆14May 9, 2024Updated last year
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Mar 21, 2019Updated 6 years ago
- ☆18Apr 5, 2025Updated 10 months ago
- A parallel PX4 flight controller for large-scale reinforcement learning.☆21Apr 19, 2025Updated 10 months ago
- [IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames☆11Jun 1, 2025Updated 9 months ago
- 基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.☆11Dec 16, 2022Updated 3 years ago
- ☆35Jan 12, 2026Updated last month
- AbstainQA, ACL 2024☆28Feb 4, 2026Updated 3 weeks ago
- AndroidWorld is an environment and benchmark for autonomous agents☆640Updated this week
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆488Sep 6, 2024Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆406Dec 15, 2024Updated last year
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆24Jun 17, 2025Updated 8 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated this week