PKU-ICST-MIPL / DyFo_CVPR2025Links
☆22Updated 3 weeks ago
Alternatives and similar repositories for DyFo_CVPR2025
Users that are interested in DyFo_CVPR2025 are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆58Updated 7 months ago
- ☆84Updated last year
- ☆44Updated 5 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆50Updated 3 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆20Updated 3 months ago
- ☆78Updated 6 months ago
- Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"☆26Updated 2 months ago
- ☆62Updated last month
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆27Updated 2 months ago
- The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"☆143Updated last week
- ☆28Updated 4 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆103Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆34Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆43Updated 4 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- ☆32Updated last year
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆67Updated last year
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆91Updated 3 weeks ago
- Official implementation of TagAlign☆35Updated 5 months ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆143Updated 5 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆79Updated 11 months ago
- [CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception☆55Updated 2 weeks ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆39Updated last month
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆53Updated 7 months ago
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆20Updated 5 months ago
- ☆67Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆52Updated last year
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆37Updated 2 months ago
- ☆29Updated 11 months ago
- Zone Evaluation: Revealing Spatial Bias in Object Detection (TPAMI 2024)☆44Updated 6 months ago