huggingface / smollmLinks

Everything about the SmolLM and SmolVLM family of models

☆3,086

Alternatives and similar repositories for smollm

Users that are interested in smollm are comparing it to the libraries listed below

Sorting:

facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,315Updated 3 months ago
facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆2,257Updated 6 months ago
facebookresearch / blt
Code for BLT research paper
☆1,765Updated 2 months ago
merveenoyan / smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,546Updated 2 weeks ago
rllm-org / rllm
Democratizing Reinforcement Learning for LLMs
☆3,962Updated last week
huggingface / nanoVLM
The simplest, fastest repository for training/finetuning small-sized VLMs.
☆3,855Updated this week
NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆3,320Updated 3 weeks ago
Blaizzy / mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
☆1,563Updated 2 weeks ago
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,468Updated last week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,833Updated last week
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,132Updated 6 months ago
OpenCoder-llm / OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
☆1,791Updated 8 months ago
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,147Updated 2 weeks ago
codelion / optillm
Optimizing inference proxy for LLMs
☆2,722Updated last week
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,473Updated last month
MoonshotAI / Kimi-k1.5
☆3,456Updated 5 months ago
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,412Updated last year
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,508Updated 3 months ago
microsoft / Magma
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
☆1,765Updated 2 months ago
MinishLab / model2vec
Fast State-of-the-Art Static Embeddings
☆1,786Updated last week
StarsfieldAI / R1-V
Witness the aha moment of VLM with less than $3.
☆3,882Updated 2 months ago
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,344Updated 2 months ago
open-thoughts / open-thoughts
Fully open data curation for reasoning models
☆2,022Updated 2 weeks ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,985Updated 3 weeks ago
huggingface / smol-course
A course on aligning smol models.
☆6,055Updated last month
openai / harmony
Renderer for the harmony response format to be used with gpt-oss
☆2,637Updated this week
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,793Updated this week
meta-llama / synthetic-data-kit
Tool for generating high quality Synthetic datasets
☆1,100Updated this week
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,224Updated 6 months ago
PKU-YuanGroup / LLaVA-CoT
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
☆2,043Updated 2 weeks ago