uncbiag / Awesome-Foundation-ModelsLinks

A curated list of foundation models for vision and language tasks

☆1,112

Alternatives and similar repositories for Awesome-Foundation-Models

Users that are interested in Awesome-Foundation-Models are comparing it to the libraries listed below

Sorting:

Computer-Vision-in-the-Wild / CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
☆1,340Updated last year
awaisrauf / Awesome-CV-Foundational-Models
☆530Updated 11 months ago
JindongGu / Awesome-Prompting-on-Vision-Language-Model
This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation …
☆494Updated 7 months ago
NVlabs / RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,376Updated 2 weeks ago
EdisonLeeeee / Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
☆856Updated last year
DirtyHarryLYL / LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
☆873Updated 7 months ago
gokayfem / awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
☆1,064Updated 8 months ago
mbzuai-oryx / groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses tha…
☆922Updated 2 months ago
DmitryRyumin / ICCV-2023-25-Papers
ICCV 2023-2025 Papers: Discover cutting-edge research from ICCV 2023-25, the leading computer vision conference. Stay updated on the late…
☆954Updated this week
yzhuoning / Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
☆1,219Updated last year
google-research / big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,203Updated 5 months ago
OpenGVLab / VisionLLM
VisionLLM Series
☆1,119Updated 8 months ago
facebookresearch / hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
☆1,036Updated last year
DmitryRyumin / CVPR-2023-24-Papers
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest d…
☆457Updated last year
ttengwang / Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
☆925Updated last year
Qinying-Liu / Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
☆751Updated last week
facebookresearch / perception_models
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆1,698Updated last month
mhamilton723 / FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
☆1,594Updated last year
KMnP / vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
☆1,177Updated 2 years ago
facebookresearch / multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
☆1,661Updated this week
facebookresearch / MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…
☆1,699Updated last month
huytransformer / Awesome-Out-Of-Distribution-Detection
Out-of-distribution detection, robustness, and generalization resources. The repository contains a curated list of papers, tutorials, boo…
☆956Updated last month
mlfoundations / wise-ft
Robust fine-tuning of zero-shot models
☆744Updated 3 years ago
jianzongwu / Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆956Updated 7 months ago
SunzeY / AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
☆847Updated 3 months ago
dvlab-research / LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
☆2,458Updated 8 months ago
allenai / visprog
Official code for VisProg (CVPR 2023 Best Paper!)
☆749Updated last year
JamesQFreeman / LoRA-ViT
Low rank adaptation for Vision Transformer
☆425Updated last year
czczup / ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,429Updated 4 months ago
tim-learn / awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
☆1,118Updated 2 weeks ago