chenshuang-zhang / imagenet_d
[CVPR 2024 Highlight] ImageNet-D
β41Updated 4 months ago
Alternatives and similar repositories for imagenet_d:
Users that are interested in imagenet_d are comparing it to the libraries listed below
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β32Updated 8 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Promptβ¦β36Updated 2 months ago
- Official Repository of Personalized Visual Instruct Tuningβ26Updated 3 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"β34Updated 2 months ago
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"β30Updated 6 months ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insightsβ24Updated 3 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognitionβ26Updated last year
- Code Release of F-LMM: Grounding Frozen Large Multimodal Modelsβ62Updated 6 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generationβ22Updated 3 weeks ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".β27Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)β38Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"β16Updated 8 months ago
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?β30Updated 10 months ago
- PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Modelsβ25Updated 2 months ago
- β52Updated last year
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)β45Updated last year
- β16Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".β20Updated 4 months ago
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?β25Updated 2 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.β91Updated 10 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effectβ¦β35Updated 8 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMsβ25Updated last month
- Official implementation of LaVin-DiTβ20Updated 3 weeks ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)β25Updated 3 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)β73Updated last year
- β30Updated last week
- β27Updated last year
- [ECCVβ24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"β19Updated 3 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"β18Updated 2 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"β25Updated 8 months ago