MagicAgent-GUI / MagicGUILinks
☆69Updated 3 months ago
Alternatives and similar repositories for MagicGUI
Users that are interested in MagicGUI are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆400Updated 7 months ago
- MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding☆75Updated 9 months ago
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆238Updated last month
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆275Updated 2 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆303Updated 7 months ago
- ☆241Updated 10 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆183Updated 2 years ago
- ☆261Updated 4 months ago
- ☆187Updated 10 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆329Updated 6 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆142Updated 3 weeks ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆163Updated 2 months ago
- a toolkit on knowledge distillation for large language models☆221Updated last week
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆295Updated 5 months ago
- GLM Series Edge Models☆156Updated 6 months ago
- ☆703Updated last month
- ☆338Updated 2 months ago
- Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.☆222Updated last week
- [ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 e…☆137Updated 4 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆132Updated 5 months ago
- ☆62Updated 3 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- ☆40Updated last year
- MiMo-VL☆596Updated 4 months ago
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆61Updated 2 weeks ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆95Updated last year
- GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆170Updated 2 months ago
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆289Updated 3 months ago
- An automated pipeline for evaluating LLMs for role-playing.☆201Updated last year
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆361Updated last month