mbzuai-oryx / MobiLlamaLinks
[ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devices
β641Updated 3 weeks ago
Alternatives and similar repositories for MobiLlama
Users that are interested in MobiLlama are comparing it to the libraries listed below
Sorting:
- π₯π₯ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)β835Updated 10 months ago
- Strong and Open Vision Language Assistant for Mobile Devicesβ1,221Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollarsβ979Updated 10 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AIβ767Updated last year
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"β860Updated 3 weeks ago
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbonesβ1,283Updated last year
- Beyond Language Models: Byte Models are Digital World Simulatorsβ322Updated 11 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuningβ651Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructionsβ821Updated 2 years ago
- [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.β808Updated last week
- [ACL 2024] Progressive LLaMA with Block Expansion.β501Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ743Updated last year
- MiniCPM on Android platform.β631Updated 2 months ago
- For releasing code related to compression methods for transformers, accompanying our publicationsβ429Updated 4 months ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attentionβ¦β1,040Updated this week
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decodingβ1,249Updated 2 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.β727Updated 8 months ago
- Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"β842Updated 9 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,301Updated last month
- β536Updated 7 months ago
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generationβ765Updated 9 months ago
- A family of lightweight multimodal models.β1,017Updated 6 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMsβ509Updated last year
- HPT - Open Multimodal LLMs from HyperGAIβ316Updated 11 months ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantizationβ689Updated 9 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruningβ611Updated last year
- Official repository for LongChat and LongEvalβ518Updated last year
- Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Traβ¦β483Updated this week
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them Allβ790Updated 2 years ago
- Mixture-of-Experts for Large Vision-Language Modelsβ2,173Updated 6 months ago