huggingface / smollmLinks
Everything about the SmolLM2 and SmolVLM family of models
β2,606Updated last week
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning small-sized VLMs.β3,625Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,504Updated last month
- Code for BLT research paperβ1,720Updated last month
- Sky-T1: Train your own O1 preview model within $450β3,286Updated last month
- Witness the aha moment of VLM with less than $3.β3,821Updated last month
- Synthetic data curation for post-training and structured data extractionβ1,425Updated last week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,305Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,788Updated last week
- Fast State-of-the-Art Static Embeddingsβ1,746Updated last month
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,458Updated 2 months ago
- Bringing BERT into modernity via both architecture changes and scalingβ1,426Updated this week
- Democratizing Reinforcement Learning for LLMsβ3,600Updated this week
- Minimalistic large language model 3D-parallelism trainingβ1,965Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ2,773Updated 2 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purposeβ1,561Updated last month
- An Open Large Reasoning Model for Real-World Solutionsβ1,502Updated last month
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ762Updated 3 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,448Updated 5 months ago
- Large Concept Models: Language modeling in a sentence representation spaceβ2,239Updated 5 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,670Updated this week
- Fully open data curation for reasoning modelsβ1,959Updated last month
- Recipes to scale inference-time compute of open modelsβ1,099Updated last month
- OLMoE: Open Mixture-of-Experts Language Modelsβ792Updated 3 months ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agentsβ1,745Updated last month
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ2,023Updated last month
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β1,426Updated this week
- MoBA: Mixture of Block Attention for Long-Context LLMsβ1,813Updated 3 months ago
- Textbook on reinforcement learning from human feedbackβ1,068Updated this week
- nanoGPT style version of Llama 3.1β1,389Updated 10 months ago
- The Open Cookbook for Top-Tier Code Large Language Modelβ1,735Updated 6 months ago