[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆63Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for MSDWILD
Users that are interested in MSDWILD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Feb 19, 2026Updated 2 months ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆59May 29, 2023Updated 2 years ago
- ☆51Nov 24, 2022Updated 3 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆172Mar 23, 2025Updated last year
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- ☆42Nov 22, 2024Updated last year
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆467Oct 23, 2023Updated 2 years ago
- ☆64Jun 28, 2023Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆164Jul 26, 2022Updated 3 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- ☆20Mar 20, 2026Updated last month
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 9 months ago
- ☆13Oct 25, 2024Updated last year
- Diarization scoring tools.☆263Apr 8, 2026Updated last month
- ☆59Mar 28, 2025Updated last year