Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,894Jan 9, 2026Updated 2 months ago
Alternatives and similar repositories for smol-vision
Users that are interested in smol-vision are comparing it to the libraries listed below
Sorting:
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆845Jan 28, 2025Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆354Jun 2, 2025Updated 9 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,542Mar 1, 2026Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,660Mar 2, 2026Updated last week
- Everything about the SmolLM and SmolVLM family of models☆3,652Jan 13, 2026Updated last month
- A course on aligning smol models.☆6,590Feb 6, 2026Updated last month
- tiny vision language model☆9,386Nov 14, 2025Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,868May 17, 2025Updated 9 months ago
- ☆697Apr 30, 2025Updated 10 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,114Mar 2, 2026Updated last week
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,511Updated this week
- Quick exploration into fine tuning florence 2☆338Sep 19, 2024Updated last year
- Go ahead and axolotl questions☆11,395Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,685Oct 27, 2025Updated 4 months ago
- Fast State-of-the-Art Static Embeddings☆2,007Feb 28, 2026Updated last week
- Late Interaction Models Training & Retrieval☆740Updated this week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,371May 19, 2025Updated 9 months ago
- Tools for merging pretrained large language models.☆6,842Feb 28, 2026Updated last week
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆198Apr 1, 2024Updated last year
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,212Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,206Mar 1, 2026Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,915Updated this week
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,157Jan 23, 2025Updated last year
- Structured Outputs☆13,488Mar 2, 2026Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,507Feb 17, 2026Updated 2 weeks ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,236Feb 26, 2026Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,392Mar 1, 2026Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,602Dec 20, 2025Updated 2 months ago
- Robust recipes to align language models with human and AI preferences☆5,510Sep 8, 2025Updated 6 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,234Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,732May 21, 2025Updated 9 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,486Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,129Updated this week
- PyTorch native post-training library☆5,697Updated this week
- ☆2,188Jan 9, 2026Updated 2 months ago
- Train transformer language models with reinforcement learning.☆17,523Updated this week
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year