🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation".
☆27Oct 10, 2025Updated 5 months ago
Alternatives and similar repositories for MobA
Users that are interested in MobA are comparing it to the libraries listed below
Sorting:
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Nov 19, 2025Updated 4 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- [ACL'25 Findings] Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"☆38Apr 7, 2025Updated 11 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 9 months ago
- Official Repo for MageBench: Bridging Large Multimodal Models to Agents☆22Jan 8, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 7 months ago
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆53May 21, 2025Updated 9 months ago
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 8 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆93Feb 6, 2026Updated last month
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Base on React's UI lib☆14Jan 7, 2023Updated 3 years ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆60Mar 17, 2025Updated last year
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆45Mar 3, 2026Updated 2 weeks ago
- 公司现用的RF所有测试公共方法及资源组织架构,包括API测试、UI测试、JMeter性能测试 样例,自定义库NdolLibrary、redis、mysql、Opencv、xml、date、appium等公共封装,PAI基础关键字、UI基础关键字、业务组合关键字、常规逻辑的通…☆21Sep 13, 2017Updated 8 years ago
- 星搭低代码AI助手插件,使用 StableDiffusion 和 ChatGPT 生成插画和文案☆11Mar 22, 2023Updated 2 years ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Nov 22, 2025Updated 3 months ago
- ☆22Jun 10, 2025Updated 9 months ago
- ☆36Jan 8, 2026Updated 2 months ago
- VisionDroid☆22Apr 2, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 9 months ago
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆32Dec 14, 2025Updated 3 months ago
- Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those i…☆10Aug 8, 2021Updated 4 years ago
- ☆10Aug 16, 2024Updated last year
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆31Jun 14, 2024Updated last year
- [ACM MM2025] The official repository for the RealSyn dataset☆40Dec 14, 2025Updated 3 months ago
- Start WebContainer from VS Codde for the Web, and use Jsh and Preview URL.☆17Updated this week
- 基于nginx lua做前端防御,基于hadoop做用户行为分析的waf☆11Nov 17, 2016Updated 9 years ago
- ☆23Jan 16, 2023Updated 3 years ago
- ☆20Oct 25, 2022Updated 3 years ago
- ☆12Aug 24, 2023Updated 2 years ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- Use bert by transformer and pytorch-lightning☆16Jul 9, 2024Updated last year
- Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NL…☆18May 4, 2022Updated 3 years ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆70Sep 25, 2025Updated 5 months ago
- ☆16Oct 28, 2021Updated 4 years ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆100Oct 14, 2024Updated last year