☆40Nov 22, 2024Updated last year
Alternatives and similar repositories for Multimodal-Fusion-with-Attention-Bottlenecks
Users that are interested in Multimodal-Fusion-with-Attention-Bottlenecks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 2 years ago
- ☆28Aug 22, 2024Updated last year
- Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition☆25Jul 12, 2022Updated 3 years ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆22Apr 3, 2024Updated 2 years ago
- [TAFFC 2024] The official implementation of paper: From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Rec…☆123Oct 28, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)☆11Aug 2, 2024Updated last year
- The official implementation of "Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled …☆13Nov 4, 2021Updated 4 years ago
- Official code repo of SimMLM [ICCV 2025]☆27Dec 1, 2025Updated 5 months ago
- The code of CVPR2024 "S^2MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering "☆11Apr 3, 2024Updated 2 years ago
- ☆13Nov 15, 2024Updated last year
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 10 months ago
- ☆18Feb 17, 2025Updated last year
- 超市销售数据分析练习(R课程)☆10Oct 10, 2021Updated 4 years ago
- ☆12Oct 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jul 18, 2022Updated 3 years ago
- Code for Semantically Robust Unpaired Image Translation for Data with Unmatched Semantics Statistics (SRUNIT), ICCV 2021☆11Feb 10, 2022Updated 4 years ago
- ☆15Mar 11, 2023Updated 3 years ago
- a library of works related to Large Language Models (LLMs) based Agent Hallucination☆54Oct 30, 2025Updated 6 months ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆57Apr 20, 2023Updated 3 years ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆21Jan 27, 2025Updated last year
- ☆17Jun 9, 2022Updated 3 years ago
- Adapt Capsule Network for Name Entity Recognition Task☆10Jun 12, 2019Updated 6 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Nov 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Mar 14, 2023Updated 3 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆65Mar 29, 2025Updated last year
- Continuous wavelet transform☆17Apr 10, 2016Updated 10 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 4 years ago
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆23Jul 28, 2025Updated 9 months ago
- A curated list of balanced multimodal learning methods.☆168Mar 26, 2026Updated last month
- [ACM MM 2024] Pytorch Code for the paper "Robust Variational Contrastive Learning for Partially View-unaligned Clustering"☆15Feb 7, 2026Updated 3 months ago
- This repository provides the implementation of the ICLR 2025 Multi-View Permutation of Variational Auto-Encoders (MVP) method for handlin…☆31Feb 21, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Incomplete Multi-view Clustering via Diffusion Contrastive Generation☆29Mar 22, 2026Updated last month
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Code Release for "Minimum Class Confusion for Versatile Domain Adaptation"(ECCV2020)☆55Aug 2, 2020Updated 5 years ago
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- ☆14Feb 11, 2022Updated 4 years ago
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆19Feb 25, 2023Updated 3 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year