mbzuai-oryx / MobiLlamaView external linksLinks
[ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devices
β667May 10, 2025Updated 9 months ago
Alternatives and similar repositories for MobiLlama
Users that are interested in MobiLlama are comparing it to the libraries listed below
Sorting:
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathologyβ12Jun 17, 2025Updated 8 months ago
- Strong and Open Vision Language Assistant for Mobile Devicesβ1,330Apr 15, 2024Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hβ¦β84Aug 5, 2025Updated 6 months ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]β22Oct 27, 2024Updated last year
- Pre-training code for Amber 7B LLMβ171May 10, 2024Updated last year
- FuseAI Projectβ587Jan 25, 2025Updated last year
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabiβ¦β79Sep 24, 2024Updated last year
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmarkβ17May 25, 2025Updated 8 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,409Apr 21, 2025Updated 9 months ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,307Feb 5, 2026Updated last week
- Reaching LLaMA2 Performance with 0.1M Dollarsβ986Jul 23, 2024Updated last year
- a family of highly capabale yet efficient large multimodal modelsβ191Aug 23, 2024Updated last year
- γTMM 2025π₯γ Mixture-of-Experts for Large Vision-Language Modelsβ2,302Jul 15, 2025Updated 7 months ago
- [MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Claβ¦β47Sep 28, 2023Updated 2 years ago
- [ICML'24] The official implementation of βRethinking Optimization and Architecture for Tiny Language Modelsββ126Jan 14, 2025Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β8,889May 3, 2024Updated last year
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pβ¦β1,315Aug 8, 2025Updated 6 months ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"β25Jun 8, 2025Updated 8 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Modelsβ126May 7, 2024Updated last year
- π₯π₯ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)β849Aug 5, 2025Updated 6 months ago
- Official code for the CVPR 2025 paper "SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models."β587Jun 1, 2025Updated 8 months ago
- [ACL 2025 π₯] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifactsβ18May 22, 2025Updated 8 months ago
- Modeling, training, eval, and inference code for OLMoβ6,306Nov 24, 2025Updated 2 months ago
- [MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"β¦β52Nov 14, 2023Updated 2 years ago
- A simple library for working with Hugging Face models.β14Dec 30, 2024Updated last year
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Modelsβ15Nov 1, 2024Updated last year
- Universal LLM Deployment Engine with ML Compilationβ22,039Updated this week
- [NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"β350Mar 16, 2025Updated 11 months ago
- Run Mixtral-8x7B models in Colab or consumer desktopsβ2,325Apr 8, 2024Updated last year
- This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code β¦β12Aug 25, 2023Updated 2 years ago
- Composed Video Retrievalβ62May 2, 2024Updated last year
- [CVPR 2024 π₯] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses thaβ¦β945Aug 5, 2025Updated 6 months ago
- MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasksβ8,606Updated this week
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ763Feb 1, 2024Updated 2 years ago
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ22Jan 26, 2026Updated 3 weeks ago
- Low-bit LLM inference on CPU/NPU with lookup tableβ919Jun 5, 2025Updated 8 months ago
- [BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.β526Aug 8, 2024Updated last year
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Modelsβ28Oct 20, 2025Updated 3 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Modelsβ237Oct 14, 2025Updated 4 months ago