☆49Nov 24, 2022Updated 3 years ago
Alternatives and similar repositories for AVA-AVD
Users that are interested in AVA-AVD are comparing it to the libraries listed below
Sorting:
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆59Jan 24, 2024Updated 2 years ago
- ☆67Sep 13, 2022Updated 3 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Jun 19, 2023Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- ☆19Apr 18, 2024Updated last year
- ☆13Oct 25, 2024Updated last year
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆167Mar 23, 2025Updated 11 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆453Oct 23, 2023Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Oct 18, 2022Updated 3 years ago
- ☆92Apr 24, 2025Updated 10 months ago
- ☆14Feb 9, 2023Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- ☆32Jun 26, 2023Updated 2 years ago