sotayang / SEIZELinks
[ACM MM'2024] Official repository for "Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval"
☆43Updated last year
Alternatives and similar repositories for SEIZE
Users that are interested in SEIZE are comparing it to the libraries listed below
Sorting:
- PyTorch implementation for "Unlearning the Noisy Correspondence Makes CLIP More Robust (ICCV 2025)"☆70Updated 4 months ago
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆163Updated 11 months ago
- Explain Before You Answer: A Survey on Compositional Visual Reasoning☆307Updated 3 months ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 7 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆186Updated last week
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last month
- **Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.☆351Updated 3 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆274Updated 5 months ago
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Updated 3 months ago
- [SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…☆72Updated 10 months ago
- The official repository for ArGue: Attribute-Guided Prompt Tuning For Vision-Language Models☆141Updated last year
- ☆207Updated 8 months ago
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆105Updated 10 months ago
- Official repository of MMGenBench☆120Updated 11 months ago
- Code for paper 'Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity…☆92Updated last year
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆75Updated last year
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 5 months ago
- High Quality Video Reasoning Segmentation☆144Updated 2 months ago
- ☆69Updated 6 months ago
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆117Updated last month
- ☆90Updated last year
- Official implementation of "DAW: Exploring the Better Weighting Function for Semi-supervised Semantic Segmentation" (NeurIPS 2023)☆36Updated 11 months ago
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- Official repo for "More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models"☆77Updated 4 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 5 months ago
- ☆198Updated 4 months ago
- Practical New Tasks and Inspiring Modeling Solutions for Diverse Open Vision Problems☆139Updated 4 months ago
- Pytorch implementation for Negation-Aware Test-Time Adaptation for Vision-Language Models.☆35Updated 6 months ago
- Source code for our CVPR paper Learning from Noisy Labels with Decoupled Meta Label Purifier☆73Updated 2 years ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month