☆21Jun 4, 2025Updated last year
Alternatives and similar repositories for multimodal_alignment
Users that are interested in multimodal_alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆31Nov 12, 2024Updated last year
- ☆78Jul 30, 2025Updated 10 months ago
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- Fork of Flame repo for training of some new stuff in development☆19Jun 1, 2026Updated last week
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆19Jun 20, 2025Updated 11 months ago
- (CVPR 2024) FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning☆20Jun 21, 2024Updated last year
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆27Jan 17, 2026Updated 4 months ago
- ☆13Sep 8, 2024Updated last year
- NegCLIP.☆41Feb 6, 2023Updated 3 years ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆19Jun 4, 2025Updated last year
- ☆31Aug 21, 2023Updated 2 years ago
- ☆15Mar 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Apr 10, 2025Updated last year
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".☆25Aug 16, 2023Updated 2 years ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Jun 18, 2025Updated 11 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆20Dec 13, 2023Updated 2 years ago
- ☆21Apr 3, 2026Updated 2 months ago
- G^3: Geolocation via Guidebook Grounding, Findings of EMNLP 2022☆17Sep 10, 2024Updated last year
- [CVPR'26] UniGame code implementation☆20Apr 21, 2026Updated last month
- Dataset for people walk on the roads☆16Mar 2, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- under review☆14Mar 1, 2021Updated 5 years ago
- ☆12Jun 26, 2024Updated last year
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 7 months ago
- [ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks☆14May 2, 2025Updated last year
- [CVPR 2022] HINT: Hierarchical Neuron Concept Explainer☆20Apr 19, 2023Updated 3 years ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆102May 20, 2025Updated last year
- ☆19Jun 26, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Large Language Models Powered Context-aware Motion Prediction☆15Jan 12, 2026Updated 5 months ago
- This is the official source code for CVPR 2024 paper [WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by I…☆16Mar 26, 2024Updated 2 years ago
- Holistic evaluation of multimodal foundation models☆48Aug 11, 2024Updated last year
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆38Oct 13, 2023Updated 2 years ago
- ☆33Jul 11, 2024Updated last year
- Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players☆591May 28, 2026Updated 2 weeks ago
- Submission Under Review☆17May 15, 2025Updated last year