merveenoyan / smol-visionLinks

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

☆1,540

Alternatives and similar repositories for smol-vision

Users that are interested in smol-vision are comparing it to the libraries listed below

Sorting:

huggingface / huggingface-llama-recipes
☆677Updated 3 months ago
AnswerDotAI / byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
☆806Updated 6 months ago
meta-llama / synthetic-data-kit
Tool for generating high quality Synthetic datasets
☆1,081Updated last week
huggingface / evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…
☆1,498Updated 6 months ago
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,032Updated this week
illuin-tech / colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,088Updated this week
roboflow / maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
☆2,604Updated this week
SkalskiP / awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
☆627Updated last year
apple / ml-4m
4M: Massively Multimodal Masked Modeling
☆1,756Updated 2 months ago
SkalskiP / vlms-zero-to-hero
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
☆1,110Updated 6 months ago
MinishLab / model2vec
Fast State-of-the-Art Static Embeddings
☆1,782Updated this week
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆317Updated 2 months ago
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,469Updated last month
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,766Updated this week
togethercomputer / together-cookbook
A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.
☆987Updated last week
facebookresearch / blt
Code for BLT research paper
☆1,760Updated 2 months ago
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,821Updated this week
mistralai / cookbook
☆1,927Updated this week
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,464Updated 3 weeks ago
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
huggingface / yourbench
🤗 Benchmark Large Language Models Reliably On Your Data
☆367Updated this week
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆899Updated 3 months ago
PKU-YuanGroup / LLaVA-CoT
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
☆2,037Updated last week
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆231Updated last year
argilla-io / synthetic-data-generator
Build datasets using natural language
☆505Updated 2 months ago
facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆2,254Updated 6 months ago
zou-group / textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
☆2,796Updated last week
prometheus-eval / prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
☆974Updated 3 months ago
meta-llama / llama-prompt-ops
An open-source tool for general prompt optimization.
☆576Updated this week
mistralai / mistral-finetune
☆2,990Updated 10 months ago