chenshuang-zhang / imagenet_d
β36Updated 4 months ago
Related projects: β
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β23Updated 3 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Modelsβ35Updated last month
- β52Updated last year
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Promptβ¦β26Updated 2 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognitionβ26Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)β33Updated 3 weeks ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"β26Updated 3 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inferenceβ37Updated 3 weeks ago
- β57Updated last year
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?β25Updated 5 months ago
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)β21Updated 3 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]β47Updated 9 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)β32Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.β84Updated 5 months ago
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)β43Updated 9 months ago
- MIMIC: Masked Image Modeling with Image Correspondencesβ15Updated 3 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)β62Updated 10 months ago
- β20Updated 9 months ago
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentationβ24Updated last year
- Towards Unified and Effective Domain Generalizationβ28Updated 9 months ago
- β13Updated this week
- β13Updated this week
- (ICLR 2024, CVPR 2024) SparseFormerβ62Updated 5 months ago
- Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insightsβ16Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learningβ¦β19Updated last month
- β41Updated this week
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentationβ35Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)β43Updated last year
- REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasetsβ11Updated 11 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"β24Updated 4 months ago