OpenDFM / MobA
๐ฎManipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automation".
โ22Updated 3 weeks ago
Alternatives and similar repositories for MobA:
Users that are interested in MobA are comparing it to the libraries listed below
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relโฆโ13Updated last year
- โ56Updated 5 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocksโ16Updated 5 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understandingโ51Updated 4 months ago
- An open-source toolkit helping developers build natural language database query solutionsโ12Updated this week
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024โ58Updated 2 months ago
- โ21Updated 2 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generationโ10Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLMโ44Updated 11 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"โ42Updated 2 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Modelsโ28Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.โ33Updated last year
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"โ26Updated 9 months ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debuggingโ71Updated last month
- โ37Updated 2 years ago
- โ32Updated 3 months ago
- Official Repository for Task-Circuit Quantizationโ19Updated last week
- โ13Updated 2 years ago
- โ63Updated last month
- EfficientSAM + YOLO World base model for use with Autodistill.โ10Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant โฆโ16Updated last year
- โ39Updated 9 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectoriesโ12Updated 3 weeks ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.โ37Updated 7 months ago
- โ20Updated 11 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0โ24Updated 2 months ago
- MPI Code Generation through Domain-Specific Language Modelsโ13Updated 5 months ago
- โ16Updated 2 months ago
- โ29Updated 8 months ago
- โ27Updated 2 months ago