dvlab-research / MOOD
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.
☆133Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MOOD
- [AAAI 2023] Zero-Shot Enhancement of CLIP with Parameter-free Attention☆83Updated last year
- Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".☆199Updated 7 months ago
- PyTorch implementation for the paper Class-incremental Novel Class Discovery (ECCV 2022)☆98Updated 2 months ago
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆72Updated 4 months ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆108Updated last month
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆207Updated 6 months ago
- GMoE could be the next backbone model for many kinds of generalization task.☆296Updated last year
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆85Updated 8 months ago
- [ICCV'23] Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation☆142Updated 7 months ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆262Updated 4 months ago
- [NeurIPS 2021] Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning☆115Updated 3 years ago
- Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)☆170Updated 4 months ago
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆81Updated last month
- [ICPR'24 Oral] Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery☆34Updated 4 months ago
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆142Updated last year
- [Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)☆135Updated last month
- FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.☆104Updated 2 weeks ago
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆220Updated 4 months ago
- Mathematical Visual Instruction Tuning for Multi-modal Large Language Models☆109Updated 3 months ago
- ☆101Updated 8 months ago
- [ICCV 2023] BoxSnake official repository.☆67Updated 5 months ago
- [ECCV'22 Oral] Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes☆141Updated last year
- (ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…☆56Updated 7 months ago
- Code for AAAl 2024 paper: Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects☆139Updated last month
- Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"☆84Updated 3 months ago
- Visualization of DiT self attention features☆163Updated 3 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆66Updated last year
- 🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…☆66Updated 2 weeks ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆128Updated last month
- 【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?☆75Updated 9 months ago