mbzuai-oryx / MobiLlamaLinks
[ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devices
β664Updated 6 months ago
Alternatives and similar repositories for MobiLlama
Users that are interested in MobiLlama are comparing it to the libraries listed below
Sorting:
- π₯π₯ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)β843Updated 3 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AIβ772Updated last year
- Strong and Open Vision Language Assistant for Mobile Devicesβ1,297Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ760Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollarsβ987Updated last year
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMsβ510Updated 2 years ago
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"β863Updated 6 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.β511Updated last year
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,301Updated last year
- Reference implementation of Megalodon 7B modelβ523Updated 6 months ago
- Efficient AI Inference & Servingβ478Updated last year
- HPT - Open Multimodal LLMs from HyperGAIβ315Updated last year
- β713Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementationβ369Updated last year
- Beyond Language Models: Byte Models are Digital World Simulatorsβ329Updated last year
- A family of lightweight multimodal models.β1,046Updated last year
- AI for all: Build the large graph of the language modelsβ277Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,390Updated 6 months ago
- For releasing code related to compression methods for transformers, accompanying our publicationsβ449Updated 10 months ago
- Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"β858Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ822Updated 2 years ago
- γTMM 2025π₯γ Mixture-of-Experts for Large Vision-Language Modelsβ2,270Updated 4 months ago
- Codebase for Aria - an Open Multimodal Native MoEβ1,079Updated 9 months ago
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them Allβ831Updated 2 years ago
- MiniCPM on Android platform.β636Updated 7 months ago
- LLaVA-Interactive-Demoβ378Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningβ660Updated last year
- a family of highly capabale yet efficient large multimodal modelsβ191Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchβ¦β597Updated 2 years ago
- The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google