zsnoob / EfficientDDP-4-Contrastive-TrainLinks
Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training
☆34Updated last year
Alternatives and similar repositories for EfficientDDP-4-Contrastive-Train
Users that are interested in EfficientDDP-4-Contrastive-Train are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆222Updated 9 months ago
- ☆101Updated 6 months ago
- Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...☆164Updated last month
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆222Updated 3 months ago
- Open source implementation of "Vision Transformers Need Registers"☆191Updated 2 weeks ago
- Visualizing the attention of vision-language models☆236Updated 6 months ago
- The official PyTorch implementation of the paper "MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning"☆28Updated 9 months ago
- Awesome Low-Rank Adaptation☆44Updated last month
- Processed / Cleaned Data for Paper Copilot☆577Updated 2 weeks ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆91Updated 11 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆33Updated 2 weeks ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆52Updated 8 months ago
- ☆27Updated 6 months ago
- ☆69Updated 10 months ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆42Updated 4 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆102Updated 11 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆243Updated last year
- Survey: https://arxiv.org/pdf/2507.20198☆145Updated 2 weeks ago
- A curated list of awesome Multimodal studies.☆271Updated 2 months ago
- ☆80Updated last year
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆319Updated 11 months ago
- Code for our ICML'24 on multimodal dataset distillation☆39Updated 11 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆85Updated 3 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆50Updated 8 months ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆171Updated last year
- The trainer for HF to record losses of different tasks and objectives.☆46Updated 6 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆38Updated 10 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆133Updated 2 months ago
- A collection of papers on discrete diffusion models☆161Updated 2 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆374Updated 9 months ago