facebookresearch/MobileLLM-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/MobileLLM-R1)

facebookresearch / MobileLLM-R1

MobileLLM-R1

☆86

Alternatives and similar repositories for MobileLLM-R1

Users that are interested in MobileLLM-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 5 months ago
SakanaAI / fast-weight-product-key-memory
View on GitHub
Code for Fast-weight Product Key Memory (FwPKM)
☆19Mar 18, 2026Updated 4 months ago
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aiha-lab / MX-QLLM
View on GitHub
LLM Inference with Microscaling Format
☆35Nov 12, 2024Updated last year
piotrpiekos / MoSA
View on GitHub
User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…
☆29May 3, 2025Updated last year
ASISys / AdaSkip
View on GitHub
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference
☆21Jan 24, 2025Updated last year
actypedef / ARCQuant
View on GitHub
[ACL 2026 Main] Code for the paper "ARCQuant: Boosting NVFP4 Quantization with Augmented Residual Channels for LLMs"
☆28Jun 1, 2026Updated last month
hangeol / UniR
View on GitHub
Official repo for paper: Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
☆20Nov 26, 2025Updated 8 months ago
IDSIA / recurrent-fwp
View on GitHub
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆52Jun 11, 2025Updated last year
HuyNguyen-hust / flash-attn-101
View on GitHub
☆22Sep 3, 2024Updated last year
facebookresearch / ParetoQ
View on GitHub
This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"
☆131Oct 15, 2025Updated 9 months ago
FLOW-open-project / FLOW
View on GitHub
Codebase for layer wise N:M pruning pattern assignment for LLMs
☆15Aug 5, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / MobileLLM
View on GitHub
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,453Apr 30, 2026Updated 3 months ago
RTkenny / RiskPO
View on GitHub
Official implementation of 'RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training', accepted by ICLR 2026
☆18Oct 15, 2025Updated 9 months ago
BryceZhuo / PolyCom
View on GitHub
The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".
☆18Apr 25, 2025Updated last year
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
facebookresearch / Ternary_Binary_Transformer
View on GitHub
ACL 2023
☆39Jun 6, 2023Updated 3 years ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
AI-Initiative-KAUST / VideoRLCS
View on GitHub
Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)
☆28Aug 19, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
irfan112 / yowov3-multistreaming-inferencing
View on GitHub
A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension …
☆26May 15, 2026Updated 2 months ago
mit-han-lab / vcpo
View on GitHub
[ICML 2026] Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
☆29Apr 27, 2026Updated 3 months ago
facebookresearch / SpinQuant
View on GitHub
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
☆419Feb 14, 2025Updated last year
zlab-princeton / vero
View on GitHub
Vero: An Open RL Recipe for General Visual Reasoning
☆138Jun 19, 2026Updated last month
HazyResearch / prefix-linear-attention
View on GitHub
☆62Jul 9, 2024Updated 2 years ago
Anonymous1252022 / fp4-all-the-way
View on GitHub
☆51May 20, 2025Updated last year
Yeyke / HBLLM
View on GitHub
[NeurIPS 2025 (spotlight)] HBLLM: Wavelet-Enhanced High-Fidelity 1-Bit Quantization for LLMs
☆16Dec 17, 2025Updated 7 months ago
hexiaoxiao-cs / DICE
View on GitHub
☆16May 10, 2026Updated 2 months ago
alansong1322 / VECA
View on GitHub
Elastic Attention Cores for Scalable Vision Transformers
☆15May 13, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
analokmaus / kaggle-aimo2-fast-math-r1
View on GitHub
Kaggle AIMO2 solution with token-efficient reasoning LLM recipes
☆50Aug 7, 2025Updated 11 months ago
sanyalsunny111 / LLM-Inheritune
View on GitHub
[TMLR 2025] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models
☆126Mar 6, 2026Updated 4 months ago
isLinXu / paper-read-notes
View on GitHub
paper-read-notes
☆13Sep 26, 2024Updated last year
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
zlai0 / S-Seg
View on GitHub
☆23Jan 24, 2024Updated 2 years ago
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
qiuk2 / RobusTok
View on GitHub
Image Tokenizer Needs Post-Training
☆24Oct 4, 2025Updated 9 months ago