THU-MIG / PYRA
The official implementation of our ECCV 2024 publication, PYRA (Parallel Yielding Re-Activation).
☆13Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for PYRA
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆64Updated last month
- FreeVA: Offline MLLM as Training-Free Video Assistant☆49Updated 5 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆60Updated 4 months ago
- Official implementation of TagAlign☆32Updated 7 months ago
- ☆12Updated last week
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆52Updated 3 months ago
- ☆29Updated 8 months ago
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆55Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆27Updated 2 years ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆57Updated 2 months ago
- ☆32Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- The official implementation of our ICCV 2023 publication, C-VisDiT☆10Updated last month
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 4 months ago
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆42Updated 3 weeks ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆41Updated 9 months ago
- ☆14Updated 6 months ago
- ☆16Updated last year
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆29Updated last year
- ☆22Updated last year
- This is the official repo for the incoming work: ByteVideoLLM☆15Updated 3 weeks ago
- ☆28Updated last month
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆72Updated 2 months ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆41Updated last year
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆94Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆64Updated 3 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆70Updated 3 months ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆51Updated 10 months ago