prajwalkr / transpotterLinks
Official implementation of Transpotter, published in BMVC 2021
☆16Updated 2 years ago
Alternatives and similar repositories for transpotter
Users that are interested in transpotter are comparing it to the libraries listed below
Sorting:
- Audio Only Speech Enhancement using Unet☆9Updated 4 years ago
- ☆30Updated 2 years ago
- ☆29Updated 3 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- ☆17Updated 7 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆41Updated 2 years ago
- ☆45Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆17Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 8 months ago
- ☆65Updated 9 months ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆20Updated 11 months ago
- Seeing Wake Words: Audio-visual Keyword Spotting☆65Updated 4 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10Updated 5 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆34Updated 2 years ago
- Self-supervised Speaker Diarization Interspeech 2022 Implementation☆8Updated 8 months ago
- ☆43Updated 2 years ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion"☆13Updated last month
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 3 years ago
- ☆38Updated 7 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆19Updated 9 months ago