zsnoob / EfficientDDP-4-Contrastive-TrainLinks
Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training
☆35Updated 2 years ago
Alternatives and similar repositories for EfficientDDP-4-Contrastive-Train
Users that are interested in EfficientDDP-4-Contrastive-Train are comparing it to the libraries listed below
Sorting:
- Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...☆266Updated 3 weeks ago
- Open source implementation of "Vision Transformers Need Registers"☆204Updated 2 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆233Updated 7 months ago
- Visualizing the attention of vision-language models☆270Updated 10 months ago
- The official PyTorch implementation of the paper "MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning"☆28Updated last year
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆58Updated 11 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆234Updated last year
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆42Updated 7 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆62Updated last week
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆96Updated last year
- ☆201Updated 2 weeks ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆178Updated 2 years ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆246Updated last year
- ☆77Updated last year
- Processed / Cleaned Data for Paper Copilot☆803Updated last month
- CKA (Centered Kernel Alignment) implemented in PyTorch☆52Updated 3 weeks ago
- One-shot Entropy Minimization☆187Updated 6 months ago
- A paper list of Awesome Latent Space.☆276Updated last week
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆168Updated 3 years ago
- ☆83Updated last year
- ☆154Updated 10 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆99Updated last year
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆107Updated last year
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆105Updated last year
- ☆45Updated 4 months ago
- A curated list of awesome Multimodal studies.☆308Updated 3 weeks ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆311Updated 8 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆290Updated last week
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆365Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆68Updated last year