(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆15Apr 29, 2025Updated 10 months ago
Alternatives and similar repositories for cav2vec
Users that are interested in cav2vec are comparing it to the libraries listed below
Sorting:
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆21Apr 3, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- ☆19Apr 18, 2024Updated last year
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Audio-Visual Speech Recognition☆20Jul 7, 2025Updated 7 months ago
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models