☆44May 20, 2025Updated 10 months ago
Alternatives and similar repositories for C-MCR
Users that are interested in C-MCR are comparing it to the libraries listed below
Sorting:
- ☆45May 20, 2025Updated 10 months ago
- ☆22Apr 22, 2025Updated 10 months ago
- Code repository for MMUGL: Multi-modal Graph Learning over UMLS Knowledge Graphs☆11Dec 7, 2023Updated 2 years ago
- ☆23Jul 29, 2023Updated 2 years ago
- Official Repository for ICML 2024 Paper "OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport"☆23Dec 4, 2025Updated 3 months ago
- CLUE code☆14May 1, 2025Updated 10 months ago
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆58Sep 4, 2024Updated last year
- Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】☆16Mar 19, 2024Updated 2 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated 2 years ago
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- PMMRec: Multi-Modality is All You Need for Transferable Recommender Systems☆23Aug 8, 2023Updated 2 years ago
- KDD 2024 | FlexCare: Leveraging Cross-Task Synergy for Flexible Multimodal Healthcare Prediction☆17Sep 4, 2024Updated last year
- Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)☆20Dec 6, 2022Updated 3 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 9 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆54Jun 28, 2024Updated last year
- ☆19May 19, 2024Updated last year
- Joint learning of images and text via maximization of mutual information☆19Dec 14, 2021Updated 4 years ago
- A platform for reinforcement learning in Terraria☆12Nov 20, 2019Updated 6 years ago
- A python implement for Certifiable Robust Multi-modal Training☆19Jun 21, 2025Updated 9 months ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆19Jan 20, 2025Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- ☆17Jul 15, 2024Updated last year
- ☆33Apr 11, 2025Updated 11 months ago
- [T-PAMI] A curated list of self-supervised multimodal learning resources.☆276Aug 16, 2024Updated last year
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆27Feb 2, 2025Updated last year
- A PyTorch implementation of ACNet based on TCSVT 2023 paper "ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image…☆11Dec 8, 2023Updated 2 years ago
- personalized recommendation☆12Mar 26, 2020Updated 5 years ago
- The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …☆12Dec 27, 2018Updated 7 years ago
- The efficient tuning method for VLMs☆81Mar 10, 2024Updated 2 years ago
- 3D Face Alignment ---The 10th International Conference on Image and Graphics(ICIG2019)-Oral☆11Dec 3, 2019Updated 6 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- Trust Region Policy Optimization with Generalized Advantage Estimator☆16Nov 15, 2018Updated 7 years ago
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated 2 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆69Oct 15, 2024Updated last year
- CVPR2023: Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment☆15May 19, 2023Updated 2 years ago
- Future Technologies Conference 2025 - MULTIMODAL EMOTION RECOGNITION AND SENTIMENT ANALYSIS IN MULTI-PARTY CONVERSATION CONTEXTS☆13Sep 12, 2024Updated last year
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆25May 18, 2023Updated 2 years ago