☆39Nov 22, 2024Updated last year
Alternatives and similar repositories for Multimodal-Fusion-with-Attention-Bottlenecks
Users that are interested in Multimodal-Fusion-with-Attention-Bottlenecks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 5 years ago
- ☆28Aug 22, 2024Updated last year
- Multimodal sentiment analysis using transformer encoders and fusion across text, audio, and visual features on the CMU-MOSEI dataset usin…☆13Jun 4, 2025Updated 10 months ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆21Apr 3, 2024Updated 2 years ago
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)☆11Aug 2, 2024Updated last year
- AVI-R Package (formerly DIVA IO): A robust reader for AVI video files☆12Dec 21, 2020Updated 5 years ago
- The code for the WACV24 paper: AU-Aware Dynamic 3D Face Reconstruction from Videos with Transformer☆16Nov 6, 2023Updated 2 years ago
- The official implementation of "Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled …☆13Nov 4, 2021Updated 4 years ago
- SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summ…☆17Mar 13, 2023Updated 3 years ago
- Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]☆63Feb 13, 2025Updated last year
- The code of CVPR2024 "S^2MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering "☆11Apr 3, 2024Updated 2 years ago
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 9 months ago
- ☆11Oct 29, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Feb 17, 2025Updated last year
- 超市销售数据分析练习(R课程)☆10Oct 10, 2021Updated 4 years ago
- ☆12Oct 11, 2024Updated last year
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 10 months ago
- ☆11Jul 18, 2022Updated 3 years ago
- ☆15Mar 11, 2023Updated 3 years ago
- a library of works related to Large Language Models (LLMs) based Agent Hallucination☆53Oct 30, 2025Updated 5 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆20Jan 27, 2025Updated last year
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆57Apr 20, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Jun 9, 2022Updated 3 years ago
- Transfer Learning☆10Aug 3, 2018Updated 7 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Mar 14, 2023Updated 3 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆65Mar 29, 2025Updated last year
- Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image Translation (AAAI'24)☆14Jun 16, 2024Updated last year
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆32Jan 24, 2024Updated 2 years ago
- Strom 实时风控统计☆21Nov 30, 2017Updated 8 years ago
- Official PyTorch Implementation.☆14Mar 30, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆23Jul 28, 2025Updated 8 months ago
- A curated list of balanced multimodal learning methods.☆167Mar 26, 2026Updated 3 weeks ago
- Incomplete Multi-view Clustering via Diffusion Contrastive Generation☆27Mar 22, 2026Updated 3 weeks ago
- [TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".☆22Jun 15, 2023Updated 2 years ago
- Code Release for "Minimum Class Confusion for Versatile Domain Adaptation"(ECCV2020)☆55Aug 2, 2020Updated 5 years ago
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- Deep Safe Multi-view Clustering: Reducing the Risk of Clustering Performance Degradation Caused by View Increase☆13Jul 4, 2022Updated 3 years ago