SkalskiP / awesome-foundation-and-multimodal-modelsLinks

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

☆636

Alternatives and similar repositories for awesome-foundation-and-multimodal-models

Users that are interested in awesome-foundation-and-multimodal-models are comparing it to the libraries listed below

Sorting:

SkalskiP / top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
☆741Updated 4 months ago
merveenoyan / smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,642Updated last month
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆244Updated last year
SkunkworksAI / BakLLaVA
☆714Updated last year
merveenoyan / awesome-osml-for-devs
List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf
☆198Updated last year
andimarafioti / florence2-finetuning
Quick exploration into fine tuning florence 2
☆334Updated last year
camenduru / LLaVA-colab
☆227Updated last year
roboflow / maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
☆2,641Updated this week
kyegomez / MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…
☆458Updated 2 weeks ago
ContextualAI / lens
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
☆354Updated 3 months ago
SkalskiP / top-cvpr-2023-papers
This repository is a curated collection of the most exciting and influential CVPR 2023 papers. 🔥 [Paper + Code]
☆653Updated 4 months ago
voxel51 / voxelgpt
AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions
☆248Updated 10 months ago
LLaVA-VL / LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
☆758Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆211Updated last year
huggingface / computer-vision-course
This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face …
☆723Updated last month
apple / ml-4m
4M: Massively Multimodal Masked Modeling
☆1,764Updated 4 months ago
apple / ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
☆1,377Updated 2 months ago
penghao-wu / vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
☆681Updated last year
facebookresearch / MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…
☆1,694Updated 3 weeks ago
mistralai-sf24 / hackathon
☆446Updated last year
merveenoyan / siglip
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
☆277Updated 8 months ago
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆289Updated 7 months ago
gokayfem / awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
☆1,041Updated 8 months ago
amitsangani / Llama
All the projects related to Llama
☆379Updated 6 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 6 months ago
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆195Updated last year
apple / ml-veclip
The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"
☆246Updated 9 months ago
osanseviero / ml_timeline
☆587Updated 2 years ago
mbzuai-oryx / groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…
☆922Updated 2 months ago
voxel51 / papers-with-data
A curated list of papers that released datasets along with their work
☆126Updated last year