MagicAgent-GUI / MagicGUILinks
☆65Updated 2 months ago
Alternatives and similar repositories for MagicGUI
Users that are interested in MagicGUI are comparing it to the libraries listed below
Sorting:
- MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding☆75Updated 9 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆265Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆396Updated 7 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆183Updated 2 years ago
- ☆186Updated 9 months ago
- [COLM 2025] Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources☆285Updated 3 months ago
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆236Updated 3 weeks ago
- GLM Series Edge Models☆153Updated 5 months ago
- a toolkit on knowledge distillation for large language models☆209Updated 3 weeks ago
- ☆337Updated last month
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆298Updated 6 months ago
- ☆249Updated 3 months ago
- ☆40Updated last year
- ☆699Updated last week
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆160Updated 2 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆269Updated 10 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆445Updated 2 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆329Updated 6 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆77Updated last year
- ☆240Updated 9 months ago
- ☆62Updated 2 months ago
- GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆168Updated last month
- Scaling Preference Data Curation via Human-AI Synergy☆130Updated 4 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆127Updated last year
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆293Updated 4 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆353Updated 3 months ago
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆208Updated 2 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆140Updated 3 weeks ago
- MiMo-VL☆591Updated 3 months ago
- [ICCV2025] A Token-level Text Image Foundation Model for Document Understanding☆124Updated 3 months ago