facebookresearch / MobileLLMLinks
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
β1,395Updated 7 months ago
Alternatives and similar repositories for MobileLLM
Users that are interested in MobileLLM are comparing it to the libraries listed below
Sorting:
- DataComp for Language Modelsβ1,394Updated 2 months ago
- π MINT-1T: A one trillion token multimodal interleaved dataset.β827Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ932Updated 2 weeks ago
- TinyChatEngine: On-Device LLM Inference Libraryβ929Updated last year
- Everything about the SmolLM and SmolVLM family of modelsβ3,433Updated 2 weeks ago
- Reaching LLaMA2 Performance with 0.1M Dollarsβ988Updated last year
- Minimalistic large language model 3D-parallelism trainingβ2,351Updated last week
- Official implementation of Half-Quadratic Quantization (HQQ)β894Updated last month
- Open weights language model from Google DeepMind, based on Griffin.β654Updated 6 months ago
- VPTQ, A Flexible and Extreme low-bit quantization algorithmβ668Updated 7 months ago
- OLMoE: Open Mixture-of-Experts Language Modelsβ916Updated 2 months ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attentionβ¦β1,163Updated 2 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Modelsβ1,640Updated last year
- [ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devicesβ667Updated 6 months ago
- A modern model graph visualizer and debuggerβ1,342Updated this week
- Code for BLT research paperβ2,010Updated last month
- Recipes to scale inference-time compute of open modelsβ1,118Updated 6 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,682Updated 7 months ago
- Muon is Scalable for LLM Trainingβ1,372Updated 4 months ago
- A pytorch quantization backend for optimumβ1,011Updated last week
- llama3.np is a pure NumPy implementation for Llama 3 model.β992Updated 7 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.β2,068Updated last year
- Serving multiple LoRA finetuned LLM as oneβ1,121Updated last year
- PyTorch native quantization and sparsity for training and inferenceβ2,543Updated this week
- β581Updated last year
- Reference implementation of Megalodon 7B modelβ525Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,162Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAIβ1,406Updated last year
- Scalable data pre processing and curation toolkit for LLMsβ1,243Updated this week
- Official inference library for pre-processing of Mistral modelsβ818Updated this week