lili0415 / DPU-OOD-DetectionLinks
☆13Updated 8 months ago
Alternatives and similar repositories for DPU-OOD-Detection
Users that are interested in DPU-OOD-Detection are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Mixture of Experts for Audio-Visual Learning☆15Updated 5 months ago
- [NeurIPS 2024, spotlight] Scaling Out-of-Distribution Detection for Multiple Modalities☆62Updated last month
- ☆18Updated 9 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆62Updated 2 months ago
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆39Updated 4 months ago
- [WIP🚧] 2025 up-to-date list of resources on visual tokenizers (primarily for visual generation). Give it a star 🌟 if you find it useful…☆14Updated 6 months ago
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆9Updated 3 months ago
- Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models☆108Updated last week
- ☆28Updated 3 months ago
- ☆25Updated 3 months ago
- ☆12Updated 6 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆8Updated 5 months ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆86Updated 3 weeks ago
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆68Updated 2 months ago
- [ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification☆20Updated 9 months ago
- The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).☆18Updated 6 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆34Updated 5 months ago
- ☆21Updated last year
- Collection of awesome Continual Test-Time Adaptation methods☆18Updated last year
- Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025☆30Updated 4 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆39Updated 3 months ago
- Official Repository of "Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery" (NeurIPS 2023)☆20Updated 2 weeks ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆36Updated 3 months ago
- [CVPR 2025] Official PyTorch Code for "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension "MMRL++: P…☆57Updated 3 weeks ago
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆14Updated last month
- Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)☆35Updated 2 months ago
- Advances in recent large vision language models (LVLMs)☆14Updated 9 months ago
- ☆18Updated 6 months ago
- Official Repository of Personalized Visual Instruct Tuning☆31Updated 4 months ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆18Updated last month