mbzuai-oryx / MobiLlamaLinks
[ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devices
β655Updated 3 months ago
Alternatives and similar repositories for MobiLlama
Users that are interested in MobiLlama are comparing it to the libraries listed below
Sorting:
- π₯π₯ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)β840Updated 3 weeks ago
- Reaching LLaMA2 Performance with 0.1M Dollarsβ988Updated last year
- Strong and Open Vision Language Assistant for Mobile Devicesβ1,259Updated last year
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AIβ769Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ758Updated last year
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,297Updated last year
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"β861Updated 3 months ago
- HPT - Open Multimodal LLMs from HyperGAIβ315Updated last year
- β712Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.β509Updated last year
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,315Updated 4 months ago
- [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.β842Updated 3 months ago
- Reference implementation of Megalodon 7B modelβ524Updated 3 months ago
- For releasing code related to compression methods for transformers, accompanying our publicationsβ441Updated 7 months ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ821Updated 2 years ago
- Efficient AI Inference & Servingβ476Updated last year
- AI for all: Build the large graph of the language modelsβ274Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementationβ371Updated last year
- a family of highly capabale yet efficient large multimodal modelsβ187Updated last year
- A family of lightweight multimodal models.β1,030Updated 9 months ago
- γTMM 2025π₯γ Mixture-of-Experts for Large Vision-Language Modelsβ2,221Updated last month
- LLaVA-Interactive-Demoβ378Updated last year
- MiniCPM on Android platform.β634Updated 5 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMsβ512Updated 2 years ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningβ661Updated last year
- Implementation of DoRAβ301Updated last year
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attentionβ¦β291Updated last year
- Beyond Language Models: Byte Models are Digital World Simulatorsβ328Updated last year
- Codebase for Merging Language Models (ICML 2024)β844Updated last year
- Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"β858Updated last year