merveenoyan / smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. π
β1,093Updated 3 weeks ago
Alternatives and similar repositories for smol-vision:
Users that are interested in smol-vision are comparing it to the libraries listed below
- The code used to train and run inference with the ColPali architecture.β1,386Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β693Updated 2 months ago
- β1,403Updated last week
- β590Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β1,879Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,238Updated last month
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β946Updated last week
- Everything about the SmolLM & SmolLM2 family of modelsβ1,554Updated last week
- Bringing BERT into modernity via both architecture changes and scalingβ1,045Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VLβ1,427Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β706Updated this week
- 4M: Massively Multimodal Masked Modelingβ1,666Updated 3 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"β831Updated last month
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β596Updated 10 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ970Updated this week
- Curated list of datasets and tools for post-training.β2,467Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β246Updated last month
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024β1,649Updated this week
- π Automatically annotate papers using LLMsβ259Updated 3 weeks ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β1,992Updated last month
- Automatically evaluate your LLMs in Google Colabβ575Updated 8 months ago
- Scalable data pre processing and curation toolkit for LLMsβ743Updated this week
- ReFT: Representation Finetuning for Language Modelsβ1,373Updated 2 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engineβ426Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 π―β841Updated last week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.β2,705Updated this week
- β774Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation spaceβ1,713Updated this week
- System 2 Reasoning Link Collectionβ722Updated this week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β547Updated this week