huggingface / smollmLinks
Everything about the SmolLM and SmolVLM family of models
☆3,602Updated 3 weeks ago
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,625Updated 3 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,406Updated 9 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,875Updated last month
- Democratizing Reinforcement Learning for LLMs☆5,081Updated this week
- Code for BLT research paper☆2,027Updated 3 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,084Updated 2 weeks ago
- Sky-T1: Train your own O1 preview model within $450☆3,370Updated 7 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,718Updated 8 months ago
- Textbook on reinforcement learning from human feedback☆1,560Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,293Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,626Updated 2 weeks ago
- DataComp for Language Models☆1,416Updated 5 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,332Updated last year
- Fast State-of-the-Art Static Embeddings☆1,992Updated last month
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,187Updated last year
- Bringing BERT into modernity via both architecture changes and scaling☆1,627Updated 7 months ago
- Minimalistic large language model 3D-parallelism training☆2,544Updated 2 months ago
- The Open Cookbook for Top-Tier Code Large Language Model☆2,035Updated last year
- A course on aligning smol models.☆6,579Updated this week
- Optimizing inference proxy for LLMs☆3,317Updated 2 weeks ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,895Updated 2 weeks ago
- Tool for generating high quality Synthetic datasets☆1,491Updated 3 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,108Updated last week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,503Updated last week
- Curated list of datasets and tools for post-training.☆4,229Updated 3 months ago
- Fully open data curation for reasoning models☆2,206Updated 2 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,050Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,762Updated 9 months ago
- Our library for RL environments + evals☆3,809Updated this week
- Tools for merging pretrained large language models.☆6,783Updated 2 weeks ago