mbzuai-oryx / MobiLlama
MobiLlama : Small Language Model tailored for edge devices
☆630Updated last year
Alternatives and similar repositories for MobiLlama:
Users that are interested in MobiLlama are comparing it to the libraries listed below
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆836Updated 9 months ago
- Strong and Open Vision Language Assistant for Mobile Devices☆1,184Updated 11 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆981Updated 8 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆767Updated last year
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,276Updated 11 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,274Updated last month
- [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.☆791Updated 6 months ago
- For releasing code related to compression methods for transformers, accompanying our publications☆424Updated 2 months ago
- [NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which r…☆965Updated this week
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆735Updated last year
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,233Updated last month
- [ACL 2024] Progressive LLaMA with Block Expansion.☆499Updated 10 months ago
- Beyond Language Models: Byte Models are Digital World Simulators☆322Updated 10 months ago
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"☆864Updated 3 months ago
- ☆707Updated last year
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,493Updated last year
- ☆530Updated 5 months ago
- AI for all: Build the large graph of the language models☆263Updated 10 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆508Updated last year
- Efficient AI Inference & Serving☆470Updated last year
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆712Updated 6 months ago
- Advanced Quantization Algorithm for LLMs/VLMs.☆417Updated this week
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆648Updated 10 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆778Updated last week
- Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.☆1,147Updated this week
- MINT-1T: A one trillion token multimodal interleaved dataset.☆807Updated 8 months ago
- Official repository for LongChat and LongEval☆517Updated 10 months ago
- [CVPR 2024] OneLLM: One Framework to Align All Modalities with Language☆627Updated 5 months ago
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆273Updated last year
- ☆524Updated 7 months ago