NMS05/Multimodal-Fusion-with-Attention-Bottlenecks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NMS05/Multimodal-Fusion-with-Attention-Bottlenecks)

NMS05 / Multimodal-Fusion-with-Attention-Bottlenecks

☆42

Alternatives and similar repositories for Multimodal-Fusion-with-Attention-Bottlenecks

Users that are interested in Multimodal-Fusion-with-Attention-Bottlenecks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

viniciusguigo / pytorch_dvib
View on GitHub
Deep Variational Information Bottleneck (DVIB) in PyTorch.
☆10Apr 25, 2020Updated 6 years ago
SwordfallYeung / LogMonitor
View on GitHub
利用kafka+storm+mysql/redis构建日志监控系统
☆13May 6, 2018Updated 8 years ago
AlbertOh90 / Soft-VQ-VAE
View on GitHub
☆20May 28, 2019Updated 7 years ago
nku-zhichengzhang / TSL300
View on GitHub
[ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"
☆18Feb 14, 2025Updated last year
PaulH97 / Sen12Landslides
View on GitHub
Official code repository of Sen12Landslides
☆29May 19, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HealthX-Lab / TextSAM-EUS
View on GitHub
[ICCV 2025 CVAMD] Official implementation of TextSAM-EUS
☆26Jan 28, 2026Updated 6 months ago
eeyhsong / NICE-LLM
View on GitHub
[TNNLS 2025] Language-guided contrastive learning for M/EEG-based image recognition.
☆36Dec 2, 2024Updated last year
nerovalerius / registration_3d
View on GitHub
Point cloud registration of two intel D435i 3D cameras using the Iterative-Closest-Point algorithm.
☆13May 31, 2021Updated 5 years ago
imanlab / action_conditioned_tactile_prediction
View on GitHub
☆10May 12, 2023Updated 3 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
DataoceanAI / CNVSRC2023Baseline
View on GitHub
Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)
☆23Apr 27, 2024Updated 2 years ago
kkew3 / cse291g-sv2p
View on GitHub
CDNA and SV2P reimplementation and improvement for class project
☆11Mar 18, 2019Updated 7 years ago
longzhen520 / S2MVTC
View on GitHub
The code of CVPR2024 "S^2MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering "
☆11Apr 3, 2024Updated 2 years ago
siahuat0727 / MGNet
View on GitHub
The official implementation of "Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled …
☆13Nov 4, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mispchallenge / MISP-ICME-AVSR
View on GitHub
☆17Jan 1, 2024Updated 2 years ago
visionjo / pykinship
View on GitHub
SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summ…
☆17Mar 13, 2023Updated 3 years ago
sunyuan-cs / 2024-TKDE-RMCNC
View on GitHub
About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)
☆11Aug 2, 2024Updated last year
XavierCHEN34 / UniReal
View on GitHub
☆30Jun 30, 2025Updated last year
haifangong / VQAMix
View on GitHub
[IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering
☆16Oct 9, 2022Updated 3 years ago
kuangcy1998 / AU-D3DFace
View on GitHub
The code for the WACV24 paper: AU-Aware Dynamic 3D Face Reconstruction from Videos with Transformer
☆17Nov 6, 2023Updated 2 years ago
zijwang / talkdown
View on GitHub
Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."
☆10Jan 26, 2020Updated 6 years ago
ms-dot-k / AVSR
View on GitHub
PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…
☆23Apr 3, 2024Updated 2 years ago
Awenbocc / CPCR
View on GitHub
☆15Mar 11, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TencentAILabHealthcare / SETMIL
View on GitHub
☆11Jul 18, 2022Updated 4 years ago
SeanJia / SRUNIT
View on GitHub
Code for Semantically Robust Unpaired Image Translation for Data with Unmatched Semantics Statistics (SRUNIT), ICCV 2021
☆11Feb 10, 2022Updated 4 years ago
ydk122024 / CCIM
View on GitHub
[CVPR2023] Context De-confounded Emotion Recognition
☆18Jul 23, 2023Updated 3 years ago
porterjenkins / region-encoder
View on GitHub
Repository for the paper "Unsupervised Representation Learning of Spatial Data via Multimodal Embedding"
☆12Dec 5, 2019Updated 6 years ago
Yangxin666 / STGAE
View on GitHub
Code repo for Spatio-Temporal Denoising Graph Autoencoder (STD-GAE)
☆12Sep 6, 2022Updated 3 years ago
YorkUCVIL / Static-Dynamic-Interpretability
View on GitHub
☆17Jun 9, 2022Updated 4 years ago
menon92 / WaveletCNN
View on GitHub
Wavelet CNN, Texture Classification in Keras
☆54Mar 3, 2022Updated 4 years ago
technion-cs-nlp / irm-for-nli
View on GitHub
☆11Jun 2, 2022Updated 4 years ago
Thinklab-SJTU / predictive-consistency-learning
View on GitHub
[ICML 2025] Generative Modeling Reinvents Supervised Learning: Label Repurposing with Predictive Consistency Learning
☆15Jul 14, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
bruceyo / TSMF
View on GitHub
Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition
☆25Jul 12, 2022Updated 4 years ago
yasar-rehman / L-DAWA
View on GitHub
This is the official repository of L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representat…
☆12May 20, 2024Updated 2 years ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
hbing-l / PoSynDA
View on GitHub
[ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation
☆12Aug 28, 2023Updated 2 years ago
tinglyfeng / figure_for_data_analysis
View on GitHub
☆10Apr 15, 2023Updated 3 years ago
XLearning-SCU / 2024-ICLR-READ
View on GitHub
Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".
☆54Dec 24, 2024Updated last year
xychen2022 / VersatileSegmentation
View on GitHub
☆10May 14, 2024Updated 2 years ago