OS-Agent-Survey / OS-Agent-Survey
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".
☆233Updated this week
Alternatives and similar repositories for OS-Agent-Survey:
Users that are interested in OS-Agent-Survey are comparing it to the libraries listed below
- Controllable Text Generation for Large Language Models: A Survey☆164Updated 7 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆230Updated this week
- Building a comprehensive and handy list of papers for GUI agents☆269Updated 2 weeks ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆321Updated 3 months ago
- The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""☆165Updated 2 weeks ago
- Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆53Updated this week
- Recipes to train the self-rewarding reasoning LLMs.☆207Updated 3 weeks ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆161Updated 4 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆171Updated this week
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆117Updated 4 months ago
- minimal-cost for training 0.5B R1-Zero☆673Updated this week
- ☆216Updated this week
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆136Updated last month
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆176Updated last week
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆118Updated this week
- Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms☆16Updated 3 weeks ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆113Updated 3 months ago
- ☆199Updated this week
- The related works and background techniques about Openai o1☆217Updated 2 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆195Updated last month
- A Survey on Efficient Reasoning for LLMs☆204Updated this week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆162Updated 3 months ago
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆162Updated 5 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆216Updated last week
- ☆138Updated 2 weeks ago
- A recipe for online RLHF and online iterative DPO.☆502Updated 3 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- adds Sequence Parallelism into LLaMA-Factory☆437Updated this week
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆80Updated 5 months ago
- Benchmarking LLMs via Uncertainty Quantification☆217Updated last year