OS-Agent-Survey / OS-Agent-SurveyLinks
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025).
☆282Updated 2 weeks ago
Alternatives and similar repositories for OS-Agent-Survey
Users that are interested in OS-Agent-Survey are comparing it to the libraries listed below
Sorting:
- Train your Agent model via our easy and efficient framework☆776Updated this week
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆232Updated this week
- ☆208Updated last week
- Building a comprehensive and handy list of papers for GUI agents☆371Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆518Updated last week
- ☆193Updated last week
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆162Updated last month
- Awesome Agent Training☆131Updated this week
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆124Updated 6 months ago
- ☆140Updated 4 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆343Updated this week
- minimal-cost for training 0.5B R1-Zero☆730Updated 3 weeks ago
- Controllable Text Generation for Large Language Models: A Survey☆175Updated 9 months ago
- The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""☆170Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆409Updated last month
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆228Updated this week
- ☆214Updated 3 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆216Updated last month
- Codebase for Iterative DPO Using Rule-based Rewards☆245Updated last month
- The related works and background techniques about Openai o1☆221Updated 4 months ago
- A series of technical report on Slow Thinking with LLM☆679Updated last week
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆135Updated last week
- A recipe for online RLHF and online iterative DPO.☆514Updated 5 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆111Updated 2 months ago
- ☆237Updated last year
- ☆198Updated last week
- adds Sequence Parallelism into LLaMA-Factory☆498Updated this week
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆226Updated 2 months ago
- Paper list for Personal LLM Agents☆388Updated last year
- Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆107Updated last week