huggingface / smollmLinks
Everything about the SmolLM and SmolVLM family of models
β3,423Updated last week
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,794Updated last month
- The simplest, fastest repository for training/finetuning small-sized VLMs.β4,331Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,395Updated 7 months ago
- Code for BLT research paperβ2,010Updated 3 weeks ago
- Textbook on reinforcement learning from human feedbackβ1,329Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,557Updated 4 months ago
- Sky-T1: Train your own O1 preview model within $450β3,356Updated 4 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,950Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,675Updated 7 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β1,887Updated this week
- Bringing BERT into modernity via both architecture changes and scalingβ1,572Updated 5 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,167Updated 10 months ago
- The Open Cookbook for Top-Tier Code Large Language Modelβ1,955Updated 11 months ago
- NanoGPT (124M) in 3 minutesβ3,878Updated last week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,141Updated this week
- Large Concept Models: Language modeling in a sentence representation spaceβ2,308Updated 10 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,965Updated last month
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee β¦β3,100Updated 6 months ago
- Democratizing Reinforcement Learning for LLMsβ4,770Updated this week
- Minimalistic large language model 3D-parallelism trainingβ2,334Updated last week
- Optimizing inference proxy for LLMsβ3,192Updated last week
- Tool for generating high quality Synthetic datasetsβ1,400Updated last month
- DataComp for Language Modelsβ1,394Updated 2 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.β4,735Updated 4 months ago
- Environments for LLM Reinforcement Learningβ3,514Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agentsβ1,861Updated last month
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ2,099Updated 3 weeks ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, imβ¦β2,966Updated last month
- Fully open data curation for reasoning modelsβ2,147Updated 2 months ago
- Witness the aha moment of VLM with less than $3.β3,994Updated 6 months ago