zaocan666 / DyViSEView external linksLinks
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for DyViSE
Users that are interested in DyViSE are comparing it to the libraries listed below
Sorting:
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆59Jan 24, 2024Updated 2 years ago
- SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确,目前我还是初学者,如有问题请原谅我并指出,谢谢!☆15May 16, 2025Updated 8 months ago
- ☆16Dec 17, 2024Updated last year
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 8 months ago
- ☆24Feb 20, 2024Updated last year
- ICASSP 2021 accepted paper☆20May 20, 2021Updated 4 years ago
- ☆20Dec 29, 2024Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- ☆49Nov 24, 2022Updated 3 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Jul 11, 2022Updated 3 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆165Mar 23, 2025Updated 10 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆29Feb 26, 2023Updated 2 years ago
- ☆25Nov 23, 2021Updated 4 years ago
- Code for paper Learning Audio-Visual Dereverberation☆30Aug 10, 2022Updated 3 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆69Jul 8, 2021Updated 4 years ago
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆31Jul 21, 2021Updated 4 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- This project was built during the competition of Smart India Hackathon 2020. In This I am using a Android device's Camera to detect Garba…☆11Apr 5, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- ☆70Sep 13, 2024Updated last year
- ☆30Jul 21, 2022Updated 3 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆66May 3, 2022Updated 3 years ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆42Apr 5, 2023Updated 2 years ago
- Official implementation for AVGN☆40Mar 24, 2023Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Mar 18, 2023Updated 2 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- ☆12Sep 25, 2023Updated 2 years ago