zsnoob / EfficientDDP-4-Contrastive-TrainLinks
Optimizing the way of contrastive learning in PyTorch-DDP(DistributedDataParallel) multi-GPU training
☆34Updated last year
Alternatives and similar repositories for EfficientDDP-4-Contrastive-Train
Users that are interested in EfficientDDP-4-Contrastive-Train are comparing it to the libraries listed below
Sorting:
- Best Papers of Top Venues like CVPR, NeurIPS, ICLR, ICML, ICCV, ECCV, ...☆181Updated 2 months ago
- Visualizing the attention of vision-language models☆240Updated 7 months ago
- Open source implementation of "Vision Transformers Need Registers"☆194Updated last week
- ☆103Updated 6 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆53Updated 8 months ago
- Processed / Cleaned Data for Paper Copilot☆593Updated last month
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆232Updated 4 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆244Updated last year
- ☆28Updated 7 months ago
- ☆69Updated 11 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆37Updated last month
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆227Updated 10 months ago
- A paper list for spatial reasoning☆143Updated 4 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆172Updated this week
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆172Updated last year
- ☆81Updated last year
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆66Updated 2 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆185Updated 5 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆91Updated last year
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆313Updated last week
- ☆259Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆63Updated last month
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆82Updated 2 weeks ago
- CKA (Centered Kernel Alignment) implemented in PyTorch☆44Updated 2 weeks ago
- A collection of papers on discrete diffusion models☆164Updated 3 months ago
- Code for Scaling Language-Free Visual Representation Learning (WebSSL)☆245Updated 5 months ago
- Awesome Low-Rank Adaptation☆47Updated 2 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆324Updated last year
- The official PyTorch implementation of the paper "MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning"☆28Updated 10 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆266Updated 10 months ago