zsnoob / EfficientDDP-4-Contrastive-TrainLinks
Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training
☆36Updated 2 years ago
Alternatives and similar repositories for EfficientDDP-4-Contrastive-Train
Users that are interested in EfficientDDP-4-Contrastive-Train are comparing it to the libraries listed below
Sorting:
- Visualizing the attention of vision-language models☆277Updated 11 months ago
- Open source implementation of "Vision Transformers Need Registers"☆209Updated last week
- Processed / Cleaned Data for Paper Copilot☆833Updated this week
- Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...☆272Updated last month
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆233Updated 8 months ago
- ☆204Updated last month
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆178Updated 2 years ago
- A collection of papers on discrete diffusion models☆168Updated 7 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆73Updated last month
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated last year
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆58Updated last year
- ☆79Updated last year
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆199Updated 9 months ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆435Updated 3 months ago
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…☆818Updated last month
- A curated list of awesome Multimodal studies.☆312Updated last month
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆247Updated 2 years ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆95Updated last year
- CKA (Centered Kernel Alignment) implemented in PyTorch☆56Updated last month
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆45Updated 4 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2025.☆651Updated this week
- One-shot Entropy Minimization☆188Updated 7 months ago
- The official PyTorch implementation of the paper "MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning"☆28Updated last year
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆44Updated 8 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆168Updated 3 years ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆296Updated 3 weeks ago
- Collection of papers on state-space models☆615Updated 2 months ago
- Awesome Low-Rank Adaptation☆59Updated 5 months ago
- ☆64Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆102Updated 3 weeks ago