OpenDFM / MobA
๐ฎManipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automation".
โ16Updated 2 months ago
Alternatives and similar repositories for MobA:
Users that are interested in MobA are comparing it to the libraries listed below
- โ28Updated last week
- Simple Implementation of TinyGPTV in super simple Zeta lego blocksโ15Updated 2 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generationโ10Updated 3 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"โ41Updated this week
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Modelsโ28Updated 10 months ago
- OLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024โ47Updated this week
- Official Pytorch Implementation of Self-emerging Token Labelingโ32Updated 10 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant โฆโ15Updated 10 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"โ17Updated 3 weeks ago
- A Framework for Decoupling and Assessing the Capabilities of VLMsโ40Updated 7 months ago
- survery of small language modelsโ14Updated 6 months ago
- โ15Updated 6 months ago
- Representing Rule-based Chatbots with Transformersโ19Updated 6 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.โ37Updated 4 months ago
- โ31Updated last week
- โ12Updated 5 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];โ36Updated last year
- The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"โ22Updated this week
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"โ19Updated this week
- EfficientSAM + YOLO World base model for use with Autodistill.โ9Updated 11 months ago
- Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxiang Li, Lu Yiโฆโ16Updated last month
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editingโ21Updated last month
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"โ11Updated 3 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)โ17Updated 3 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMsโ72Updated 3 months ago
- โ13Updated last year
- โ27Updated 4 months ago
- โ12Updated 3 weeks ago
- Exploration of the multi modal fuyu-8b model of Adept. ๐ค ๐โ28Updated last year