huggingface / smollmLinks
Everything about the SmolLM2 and SmolVLM family of models
☆2,623Updated last week
Alternatives and similar repositories for smollm
Users that are interested in smollm are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,625Updated last week
- Democratizing Reinforcement Learning for LLMs☆3,600Updated this week
- nanoGPT style version of Llama 3.1☆1,390Updated 10 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,305Updated 2 months ago
- Synthetic data curation for post-training and structured data extraction☆1,425Updated last week
- A course on aligning smol models.☆5,990Updated this week
- Optimizing inference proxy for LLMs☆2,589Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,796Updated last week
- ☆2,977Updated 9 months ago
- NanoGPT (124M) in 3 minutes☆2,751Updated 2 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,944Updated last month
- Witness the aha moment of VLM with less than $3.☆3,821Updated last month
- Fast State-of-the-Art Static Embeddings☆1,746Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"☆2,480Updated 3 weeks ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,458Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,068Updated this week
- Code for BLT research paper☆1,720Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,684Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,426Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,239Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,507Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,745Updated last month
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,390Updated last week
- Minimalistic large language model 3D-parallelism training☆1,965Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,502Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆798Updated 3 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,490Updated 5 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆780Updated 3 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,263Updated last month
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,071Updated 10 months ago