IMPlus-PCALab / GrowGrowUp
Some experiences for new researchers to grow grow up
☆39Updated 2 years ago
Alternatives and similar repositories for GrowGrowUp:
Users that are interested in GrowGrowUp are comparing it to the libraries listed below
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆83Updated 6 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆85Updated 2 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆20Updated last week
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 5 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆28Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆69Updated 5 months ago
- [ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".☆13Updated 5 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆86Updated last year
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆38Updated this week
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆35Updated 7 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆32Updated 9 months ago
- The repository contains the official implementation of "Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation"☆37Updated 2 weeks ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆66Updated last month
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆27Updated this week
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆57Updated 3 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆28Updated this week
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆15Updated last month
- cliptrase☆33Updated 6 months ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆20Updated 3 weeks ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆69Updated 8 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- ☆25Updated 9 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆77Updated 8 months ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆25Updated 4 months ago
- FreeVA: Offline MLLM as Training-Free Video Assistant☆57Updated 9 months ago
- [CVPR2022, TPAMI2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆20Updated 2 months ago
- Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…☆21Updated 11 months ago