[CVPR25] Official Implementation of CAV-MAE Sync
☆30Apr 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for cav-mae-sync
Users that are interested in cav-mae-sync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unsupervised word segmentation and clustering of speech☆13Feb 17, 2017Updated 9 years ago
- Temperature Schedules for self-supervised contrastive methods on long-tail data (ICLR'23)☆18Apr 25, 2023Updated 2 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated last year
- WildVSR☆22Dec 13, 2023Updated 2 years ago
- ☆23Dec 5, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Mar 24, 2024Updated 2 years ago
- Uncertainty-Guided Pseudo-Labelling with Model Averaging☆11Mar 17, 2026Updated last month
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Generative Regional Editing (GRE) Benchmark☆19Sep 10, 2024Updated last year
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆54Mar 30, 2022Updated 4 years ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆24Aug 19, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆28Nov 29, 2023Updated 2 years ago
- ☆31Jun 19, 2025Updated 10 months ago
- Official repository for the MMFM challenge☆25Jun 18, 2024Updated last year
- Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)☆90Jul 25, 2024Updated last year
- First neural GPT aligned with text and speech. Welcome to join us to make better foundation model in neural modality.☆14Oct 30, 2024Updated last year
- DO with Terraform and Ansible☆11Jun 5, 2018Updated 7 years ago
- ICCV 2021☆34May 11, 2022Updated 3 years ago
- A reconstruction framework for materializing subjective experiences from brain signals☆14Jan 18, 2025Updated last year
- Aligning First, Then Fusing: A Novel Weakly-Supervised Multimodal Violence Detection Method☆22Oct 2, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆290Mar 20, 2024Updated 2 years ago
- Using GAN to create synthetic and partially synthetic EEG data to augment training sets for motor imagery interaction tasks☆13Aug 27, 2019Updated 6 years ago
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 10 months ago
- The DistanceMetrics package is a comprehensive Python library designed to compute a wide variety of distance metrics between two vectors,…☆16Sep 25, 2025Updated 6 months ago
- Conditional EEG diffusion model☆17Apr 5, 2024Updated 2 years ago
- ☆24Feb 3, 2026Updated 2 months ago
- Codebase for publication "Neural decoding from stereotactic EEG: accounting for electrode variability across subjects" @ NeurIPS (2024)☆19Jun 11, 2025Updated 10 months ago
- Official Implementation of "Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning." ICLR 2026.☆30Feb 3, 2026Updated 2 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆54May 14, 2024Updated last year
- A framework for building speech-enabled websites.☆10Jul 10, 2015Updated 10 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- Celeb-DF++: A Large-scale Challenging Video DeepFake Benchmark for Generalizable Forensics☆29Jul 29, 2025Updated 8 months ago
- WavSpA: Wavelet Space Attention for Enhancing Transformer's Long Sequence Learning☆12Feb 24, 2024Updated 2 years ago
- ☆37Jul 31, 2025Updated 8 months ago
- A novel contrastive split-latent permutation autoencoder (CSLP-AE) framework☆15Nov 20, 2024Updated last year