huggingface / smollmLinks
Everything about the SmolLM and SmolVLM family of models
β3,314Updated last month
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,635Updated last month
- The simplest, fastest repository for training/finetuning small-sized VLMs.β4,100Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.β1,374Updated 5 months ago
- Code for BLT research paperβ1,989Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation spaceβ2,290Updated 8 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,901Updated this week
- nanoGPT style version of Llama 3.1β1,432Updated last year
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β1,685Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,609Updated 5 months ago
- Sky-T1: Train your own O1 preview model within $450β3,341Updated 3 months ago
- A course on aligning smol models.β6,440Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddingsβ1,858Updated last week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speeβ¦β3,076Updated 4 months ago
- Synthetic data curation for post-training and structured data extractionβ1,526Updated 2 months ago
- Optimizing inference proxy for LLMsβ2,988Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,484Updated 4 months ago
- Bringing BERT into modernity via both architecture changes and scalingβ1,537Updated 3 months ago
- β3,030Updated last year
- NanoGPT (124M) in 3 minutesβ3,176Updated 2 months ago
- The Open Cookbook for Top-Tier Code Large Language Modelβ1,925Updated 10 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,150Updated 8 months ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agentsβ1,821Updated last week
- Textbook on reinforcement learning from human feedbackβ1,259Updated 2 weeks ago
- Witness the aha moment of VLM with less than $3.β3,955Updated 4 months ago
- Tools for merging pretrained large language models.β6,352Updated 3 weeks ago
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoningβ2,076Updated last month
- Democratizing Reinforcement Learning for LLMsβ4,458Updated this week
- Tool for generating high quality Synthetic datasetsβ1,282Updated 2 weeks ago
- 4M: Massively Multimodal Masked Modelingβ1,765Updated 4 months ago
- Minimalistic large language model 3D-parallelism trainingβ2,252Updated last month