zsnoob / EfficientDDP-4-Contrastive-Train
Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training
☆32Updated last year
Alternatives and similar repositories for EfficientDDP-4-Contrastive-Train:
Users that are interested in EfficientDDP-4-Contrastive-Train are comparing it to the libraries listed below
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆208Updated 5 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆79Updated 6 months ago
- ☆53Updated 5 months ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆48Updated last year
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆21Updated 3 months ago
- ☆73Updated last month
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆21Updated last month
- ✌ CLoG: Benchmarking Continual Learning of Image Generation Models☆18Updated 10 months ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆35Updated last month
- Open source implementation of "Vision Transformers Need Registers"☆175Updated 3 weeks ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆91Updated 11 months ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆44Updated last year
- ☆10Updated 3 months ago
- Code for our ICML'24 on multimodal dataset distillation☆37Updated 6 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 11 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆87Updated 11 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆85Updated 6 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆31Updated 3 months ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆72Updated last year
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆28Updated 5 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆53Updated 4 months ago
- The official PyTorch implementation of the paper "MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning"☆29Updated 4 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆153Updated 2 years ago
- ☆9Updated 3 weeks ago
- Sparse Linear Concept Embeddings☆91Updated last month
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆90Updated last year
- Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes☆76Updated 8 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆185Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆10Updated 9 months ago
- PyTorch implementation of our CVPR 2024 paper "Unified Entropy Optimization for Open-Set Test-Time Adaptation"☆20Updated 7 months ago