merveenoyan / smol-visionView external linksLinks
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,875Jan 9, 2026Updated last month
Alternatives and similar repositories for smol-vision
Users that are interested in smol-vision are comparing it to the libraries listed below
Sorting:
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆842Jan 28, 2025Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆353Jun 2, 2025Updated 8 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,512Feb 3, 2026Updated 2 weeks ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,659Updated this week
- Everything about the SmolLM and SmolVLM family of models☆3,621Jan 13, 2026Updated last month
- A course on aligning smol models.☆6,577Feb 6, 2026Updated last week
- tiny vision language model☆9,329Nov 14, 2025Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,852May 17, 2025Updated 9 months ago
- ☆696Apr 30, 2025Updated 9 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,502Jan 13, 2026Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,095Feb 9, 2026Updated last week
- Quick exploration into fine tuning florence 2☆339Sep 19, 2024Updated last year
- Go ahead and axolotl questions☆11,289Updated this week
- Late Interaction Models Training & Retrieval☆711Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,647Oct 27, 2025Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆1,996Feb 8, 2026Updated last week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,355May 19, 2025Updated 8 months ago
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 3 weeks ago
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆198Apr 1, 2024Updated last year
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,135Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,155Feb 8, 2026Updated last week
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,155Jan 23, 2025Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,922Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,486Feb 4, 2026Updated last week
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,172Feb 3, 2026Updated 2 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,263Feb 4, 2026Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,593Dec 20, 2025Updated last month
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,199Nov 3, 2025Updated 3 months ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,122Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,719May 21, 2025Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,456Updated this week
- ☆2,166Jan 9, 2026Updated last month
- 🤗 smolagents: a barebones library for agents that think in code.☆25,422Jan 23, 2026Updated 3 weeks ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,754Jul 18, 2025Updated 6 months ago