IMPlus-PCALab / GrowGrowUp
Some experiences for new researchers to grow grow up
☆40Updated 2 years ago
Alternatives and similar repositories for GrowGrowUp:
Users that are interested in GrowGrowUp are comparing it to the libraries listed below
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆16Updated 2 months ago
- ☆12Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆34Updated 11 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆84Updated 8 months ago
- Segment Anything with Deictic Prompting☆25Updated 6 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆70Updated 7 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆89Updated 2 weeks ago
- Awesome paper for multi-modal llm with grounding ability☆17Updated 9 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆27Updated last month
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆23Updated 4 months ago
- [ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".☆14Updated 7 months ago
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation☆25Updated last year
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆20Updated 3 months ago
- The repository contains the official implementation of "Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation"☆40Updated 2 months ago
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆27Updated last month
- ☆16Updated 6 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆49Updated last month
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆52Updated 6 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [IJCV 2024]☆15Updated 5 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆49Updated 3 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆47Updated 2 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated 9 months ago
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆17Updated 2 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆44Updated last month
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆39Updated last month
- ☆16Updated 4 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆36Updated 3 weeks ago
- Official repository of InLine attention (NeurIPS 2024)☆46Updated 4 months ago