huggingface / smollmLinks
Everything about the SmolLM and SmolVLM family of models
β3,423Updated last week
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,794Updated last month
- The simplest, fastest repository for training/finetuning small-sized VLMs.β4,294Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,394Updated 7 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β1,867Updated last week
- The Open Cookbook for Top-Tier Code Large Language Modelβ1,952Updated 11 months ago
- Sky-T1: Train your own O1 preview model within $450β3,356Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,950Updated this week
- Democratizing Reinforcement Learning for LLMsβ4,770Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agentsβ1,861Updated last month
- Code for BLT research paperβ2,010Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extractionβ1,557Updated 4 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,141Updated this week
- Minimalistic large language model 3D-parallelism trainingβ2,334Updated last week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,306Updated 10 months ago
- Optimizing inference proxy for LLMsβ3,192Updated last week
- Bringing BERT into modernity via both architecture changes and scalingβ1,567Updated 5 months ago
- Fully open data curation for reasoning modelsβ2,147Updated 2 months ago
- Tool for generating high quality Synthetic datasetsβ1,400Updated last month
- DataComp for Language Modelsβ1,394Updated 2 months ago
- Textbook on reinforcement learning from human feedbackβ1,329Updated this week
- Witness the aha moment of VLM with less than $3.β3,994Updated 6 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,167Updated 10 months ago
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ2,099Updated 3 weeks ago
- A course on aligning smol models.β6,522Updated 2 weeks ago
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,675Updated 7 months ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, imβ¦β2,966Updated last month
- Renderer for the harmony response format to be used with gpt-ossβ4,033Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,742Updated last week
- AllenAI's post-training codebaseβ3,373Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,907Updated 2 weeks ago