[ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices
☆669May 10, 2025Updated 10 months ago
Alternatives and similar repositories for MobiLlama
Users that are interested in MobiLlama are comparing it to the libraries listed below
Sorting:
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 9 months ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆19May 22, 2025Updated 9 months ago
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Sep 24, 2024Updated last year
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark☆17May 25, 2025Updated 9 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Aug 5, 2025Updated 7 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…☆47Sep 28, 2023Updated 2 years ago
- Pre-training code for Amber 7B LLM☆172May 10, 2024Updated last year
- Strong and Open Vision Language Assistant for Mobile Devices☆1,345Apr 15, 2024Updated last year
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos☆23Jan 26, 2026Updated last month
- Composed Video Retrieval☆63May 2, 2024Updated last year
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 9 months ago
- Bilingual Medical Mixture of Experts LLM☆32Nov 23, 2024Updated last year
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…☆52Nov 14, 2023Updated 2 years ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,417Apr 21, 2025Updated 11 months ago
- FuseAI Project☆592Jan 25, 2025Updated last year
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆46May 26, 2025Updated 9 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆29Oct 20, 2025Updated 5 months ago
- [BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.☆525Aug 8, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆989Jul 23, 2024Updated last year
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆127Jan 14, 2025Updated last year
- (ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.☆19Sep 28, 2023Updated 2 years ago
- Reinforcement Training of Robot☆11Dec 1, 2019Updated 6 years ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,308Feb 5, 2026Updated last month
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…☆50Aug 23, 2024Updated last year
- [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…☆949Aug 5, 2025Updated 7 months ago
- (BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …☆34Jan 8, 2023Updated 3 years ago
- [CVPR -2025] GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model☆132Mar 22, 2025Updated 11 months ago
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆848Aug 5, 2025Updated 7 months ago
- 3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers☆14Apr 17, 2023Updated 2 years ago
- a family of highly capabale yet efficient large multimodal models☆193Aug 23, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,922May 3, 2024Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆61Feb 28, 2025Updated last year
- [NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.☆37Apr 17, 2025Updated 11 months ago
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆52Mar 13, 2025Updated last year
- [ACL 2025 🔥] Rethinking Step-by-step Visual Reasoning in LLMs☆310May 21, 2025Updated 10 months ago
- 【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models☆2,307Jul 15, 2025Updated 8 months ago
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]☆33Oct 27, 2024Updated last year