facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,287Updated last week
Alternatives and similar repositories for MobileLLM:
Users that are interested in MobileLLM are comparing it to the libraries listed below
- Everything about the SmolLM2 and SmolVLM family of models☆2,177Updated 2 weeks ago
- nanoGPT style version of Llama 3.1☆1,351Updated 8 months ago
- Minimalistic large language model 3D-parallelism training☆1,786Updated this week
- DataComp for Language Models☆1,275Updated 3 weeks ago
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆2,922Updated this week
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆626Updated 2 weeks ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆784Updated last week
- PyTorch native quantization and sparsity for training and inference☆1,954Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆982Updated 8 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆713Updated last month
- MINT-1T: A one trillion token multimodal interleaved dataset.☆807Updated 8 months ago
- Code for BLT research paper☆1,445Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling