facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,288Updated this week
Alternatives and similar repositories for MobileLLM:
Users that are interested in MobileLLM are comparing it to the libraries listed below
- Everything about the SmolLM2 and SmolVLM family of models☆2,201Updated 3 weeks ago
- nanoGPT style version of Llama 3.1☆1,356Updated 8 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆862Updated 2 months ago
- MINT-1T: A one trillion token multimodal interleaved dataset.☆808Updated 8 months ago
- DataComp for Language Models☆1,279Updated last month
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆628Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆1,055Updated last month
- Official implementation of Half-Quadratic Quantization (HQQ)☆791Updated this week
- [NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which r…☆971Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆716Updated last month
- Open weights language model from Google DeepMind, based on Griffin.☆636Updated 2 months ago
- PyTorch native quantization and sparsity for training and inference☆1,974Updated this week
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,237Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,438Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆982Updated 9 months ago
- NanoGPT (124M) in 3 minutes☆2,501Updated this week
- MobiLlama : Small Language Model tailored for edge devices☆632Updated last year
- Minimalistic large language model 3D-parallelism training☆1,793Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆2,747Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,378Updated last year
- A pytorch quantization backend for optimum☆922Updated last week
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,267Updated 5 months ago
- A PyTorch native library for large-scale model training☆3,607Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,899Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆445Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,200Updated this week
- Muon is Scalable for LLM Training☆1,022Updated 3 weeks ago
- Code for BLT research paper☆1,513Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,508Updated last year
- PyTorch native post-training library☆5,103Updated this week