XiaoMi / MobileBenchLinks
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
☆26Updated last year
Alternatives and similar repositories for MobileBench
Users that are interested in MobileBench are comparing it to the libraries listed below
Sorting:
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Updated 11 months ago
- KV cache compression via sparse coding☆17Updated 3 months ago
- ☆82Updated 10 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34Updated 8 months ago
- ☆210Updated last month
- ☆111Updated 4 months ago
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆107Updated 6 months ago
- [CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>☆155Updated 3 weeks ago
- [ICML 2025 Oral] Mixture of Lookup Experts☆70Updated 2 months ago
- ☆78Updated 7 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆193Updated 10 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Updated 2 months ago
- ☆35Updated 3 weeks ago
- ICML2025: Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning☆51Updated 9 months ago
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆149Updated last year
- ☆94Updated this week
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆76Updated this week
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆32Updated last week
- ☆71Updated 6 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Updated 10 months ago
- ☆23Updated last year
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆71Updated 6 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Updated last year
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Updated last year
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆119Updated last year
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Updated this week
- [TMLR] LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects☆152Updated 2 months ago
- ☆74Updated 8 months ago
- ☆75Updated 7 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆331Updated 8 months ago