SuperBruceJia / Awesome-Large-Vision-Language-Model
Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Large-Vision-Language-Model
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated last month
- ☆32Updated this week
- Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of mo…☆16Updated last week
- HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Vision-Language M…☆13Updated 3 months ago
- ☆31Updated last month
- Bag of MLP☆20Updated 3 years ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Updated 7 months ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆23Updated last year
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆20Updated last year
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆49Updated 2 months ago
- ☆16Updated last month
- ☆19Updated 3 months ago
- ☆30Updated this week
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆20Updated 2 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- ☆26Updated last year
- ☆14Updated 2 years ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆16Updated last week
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated 5 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- codes for paper "Interpretability-Aware Vision Transformer"☆22Updated last year
- Advances in recent large vision language models (LVLMs)☆13Updated 2 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆23Updated this week
- ☆20Updated 3 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated last year
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆25Updated 10 months ago
- ☆28Updated 2 weeks ago
- Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]☆16Updated last year
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆37Updated last year
- ☆19Updated last month