huggingface / smollmLinks
Everything about the SmolLM and SmolVLM family of models
β3,539Updated last month
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,404Updated 8 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,826Updated 2 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.β4,494Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β3,015Updated 2 weeks ago
- Large Concept Models: Language modeling in a sentence representation spaceβ2,324Updated 11 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,661Updated 7 months ago
- NanoGPT (124M) in 3 minutesβ4,116Updated this week
- Code for BLT research paperβ2,024Updated 2 months ago
- β3,062Updated last month
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,114Updated 7 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,233Updated this week
- Bringing BERT into modernity via both architecture changes and scalingβ1,607Updated 6 months ago
- DataComp for Language Modelsβ1,404Updated 4 months ago
- Minimalistic large language model 3D-parallelism trainingβ2,407Updated last month
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,729Updated 8 months ago
- Textbook on reinforcement learning from human feedbackβ1,396Updated this week
- AllenAI's post-training codebaseβ3,515Updated this week
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ2,110Updated last month
- Synthetic data curation for post-training and structured data extractionβ1,595Updated last week
- The Open Cookbook for Top-Tier Code Large Language Modelβ1,977Updated last year
- Tool for generating high quality Synthetic datasetsβ1,455Updated 2 months ago
- Fast State-of-the-Art Static Embeddingsβ1,969Updated last week
- Optimizing inference proxy for LLMsβ3,266Updated 2 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β2,029Updated last month
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β2,427Updated this week
- PyTorch native post-training libraryβ5,646Updated this week
- β695Updated 8 months ago
- Sky-T1: Train your own O1 preview model within $450β3,367Updated 6 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,179Updated 11 months ago
- Witness the aha moment of VLM with less than $3.β4,020Updated 7 months ago