CVPR25
☆27Jul 2, 2025Updated 8 months ago
Alternatives and similar repositories for MP-GUI
Users that are interested in MP-GUI are comparing it to the libraries listed below
Sorting:
- VisionDroid☆22Apr 2, 2024Updated last year
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆37Jan 31, 2026Updated last month
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆21Mar 10, 2026Updated last week
- ☆24Jul 8, 2023Updated 2 years ago
- DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents☆60Mar 12, 2024Updated 2 years ago
- Deep Graph Outlier Detection☆67Oct 5, 2023Updated 2 years ago
- Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios☆11Mar 21, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆85Feb 13, 2026Updated last month
- ☆14Sep 11, 2025Updated 6 months ago
- ☆20Nov 21, 2025Updated 4 months ago
- Under construction☆13Jan 15, 2025Updated last year
- [Up-to-date] A curated list of resources on graph-empowered agents and agent-facilitated graph learning (Graphs Meet Agents).☆91Sep 13, 2025Updated 6 months ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 9 months ago
- GUIPilot: A Consistency-based Mobile GUI Testing Approach for Detecting Application-specific Bugs☆14Jan 5, 2026Updated 2 months ago
- Owl Eyes: Spotting UI Display Issues via Visual Understanding☆11Jul 31, 2020Updated 5 years ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆46Jan 8, 2025Updated last year
- ☆12Aug 24, 2023Updated 2 years ago
- [TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation☆10Jul 8, 2023Updated 2 years ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- ☆10Nov 9, 2023Updated 2 years ago
- ☆63Dec 5, 2025Updated 3 months ago
- ☆13May 15, 2025Updated 10 months ago
- ☆19Sep 24, 2024Updated last year
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- K-means algorithm implementation in Javascript.☆20Mar 5, 2026Updated 2 weeks ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆137Jul 28, 2025Updated 7 months ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆15Apr 21, 2025Updated 11 months ago
- ☆10Nov 2, 2022Updated 3 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated last year
- LLaVA-Next for STVG☆18Dec 5, 2025Updated 3 months ago
- SKT A.X LLM 3.1☆13Jul 24, 2025Updated 7 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆18Jun 19, 2025Updated 9 months ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- Code for The Web Conference 2022 Paper "Collaborative Knowledge Distillation for Heterogeneous Information Network Embedding"☆17Jan 21, 2022Updated 4 years ago
- ☆13Dec 18, 2024Updated last year
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆21Sep 2, 2025Updated 6 months ago
- ☆38Feb 8, 2024Updated 2 years ago