[NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy
☆73Nov 13, 2023Updated 2 years ago
Alternatives and similar repositories for FactorCL
Users that are interested in FactorCL are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions☆86Oct 28, 2024Updated last year
- ☆27Mar 3, 2025Updated 11 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆13Oct 30, 2024Updated last year
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- ☆14May 20, 2025Updated 9 months ago
- ☆10Nov 23, 2023Updated 2 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- ☆13Jul 10, 2024Updated last year
- Comparing performance of different InfoNCE type losses used in contrastive learning.☆14Jun 12, 2024Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆39Sep 30, 2024Updated last year
- Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks☆12Apr 13, 2020Updated 5 years ago
- ☆15Mar 30, 2025Updated 11 months ago
- Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks☆30May 25, 2022Updated 3 years ago
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆17Jul 1, 2024Updated last year
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- ☆30Jun 14, 2022Updated 3 years ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆99Aug 22, 2024Updated last year
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated 11 months ago
- ☆17Jan 1, 2024Updated 2 years ago
- The official implementation of OpenSR (ACL2023 Oral)☆16Nov 29, 2023Updated 2 years ago
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆170Sep 26, 2022Updated 3 years ago
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- This is an official implementation of GRIT-VLP☆20Aug 8, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year
- ☆28Nov 25, 2024Updated last year
- Embedder with binary sparse distributed representation.☆20May 15, 2025Updated 9 months ago
- Multi-view-AE: An extensive collection of multi-modal autoencoders implemented in a modular, scikit-learn style framework.☆56Aug 1, 2024Updated last year
- Immunology Informatics - Big Data Analysis in Immunology - Tutorials☆24Sep 9, 2017Updated 8 years ago
- Official codebase for Human Guided Exploration (HuGE)☆22Aug 16, 2023Updated 2 years ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- ☆24Jul 18, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated 11 months ago
- Benchmarking GNNs for fMRI analysis☆25Nov 27, 2022Updated 3 years ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year