wendell0218 / GVA-Survey
Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms
☆16Updated 2 months ago
Alternatives and similar repositories for GVA-Survey:
Users that are interested in GVA-Survey are comparing it to the libraries listed below
- ☆55Updated 6 months ago
- Code for "UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning"☆91Updated this week
- A Self-Training Framework for Vision-Language Reasoning☆77Updated 3 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated 4 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆58Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆119Updated last month
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 6 months ago
- ☆94Updated 3 weeks ago
- ☆153Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆85Updated 6 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆39Updated 2 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆205Updated last week
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆67Updated 3 weeks ago
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆130Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 2 weeks ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆94Updated 3 weeks ago
- Building a comprehensive and handy list of papers for GUI agents☆313Updated last week
- The official code repository for PRMBench.☆73Updated 2 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 2 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆70Updated 2 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆146Updated 3 weeks ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆100Updated 2 months ago
- ☆163Updated this week
- ☆73Updated 11 months ago
- Awesome Agent Training☆96Updated this week
- A comprehensive collection of process reward models.☆76Updated this week
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆111Updated 2 weeks ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆33Updated 2 weeks ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆135Updated 4 months ago