Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,914Jan 9, 2026Updated 4 months ago
Alternatives and similar repositories for smol-vision
Users that are interested in smol-vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆847Jan 28, 2025Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆356Jun 2, 2025Updated 11 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,610Apr 21, 2026Updated 2 weeks ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,671May 1, 2026Updated last week
- Everything about the SmolLM and SmolVLM family of models☆3,755Apr 2, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A course on aligning smol models.☆6,638Apr 17, 2026Updated 3 weeks ago
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,620Apr 20, 2026Updated 2 weeks ago
- tiny vision language model☆9,651Apr 20, 2026Updated 2 weeks ago
- ☆698Apr 30, 2025Updated last year
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,850Oct 27, 2025Updated 6 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,918May 17, 2025Updated 11 months ago
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆197Apr 1, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,209Apr 27, 2026Updated last week
- Quick exploration into fine tuning florence 2☆340Sep 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,436May 19, 2025Updated 11 months ago
- Go ahead and axolotl questions☆11,842May 1, 2026Updated last week
- Fast State-of-the-Art Static Embeddings☆2,053Updated this week
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.☆1,165Jan 23, 2025Updated last year
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- DSPy: The framework for programming—not prompting—language models☆34,180May 2, 2026Updated last week
- Late Interaction Models Training & Retrieval☆796Apr 30, 2026Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,337May 1, 2026Updated last week
- Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆63,536Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,707Apr 24, 2026Updated 2 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆500Jul 23, 2025Updated 9 months ago
- Structured Outputs☆13,776Apr 16, 2026Updated 3 weeks ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,364Mar 27, 2026Updated last month
- Fast BM25 search in Python, powered by Numpy and Numba☆1,656Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,033Apr 20, 2026Updated 2 weeks ago
- 4M: Massively Multimodal Masked Modeling☆1,794Jun 2, 2025Updated 11 months ago
- Robust recipes to align language models with human and AI preferences☆5,593Apr 8, 2026Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,613Dec 20, 2025Updated 4 months ago
- Structured information extraction from documents☆316Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,318Apr 21, 2026Updated 2 weeks ago
- PyTorch native post-training library☆5,750May 1, 2026Updated last week
- Train transformer language models with reinforcement learning.☆18,282Updated this week
- Build local voice agents with open-source models☆4,716Updated this week
- Machine Learning Engineering Open Book☆17,854Mar 16, 2026Updated last month