mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆39Updated last year
Alternatives and similar repositories for w2v2_audioFrameClassification:
Users that are interested in w2v2_audioFrameClassification are comparing it to the libraries listed below
- ☆43Updated 2 years ago
- ☆57Updated 10 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆76Updated 9 months ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 3 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆25Updated 10 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 5 months ago
- A simple package for Guided source separation (GSS)☆117Updated 9 months ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆26Updated 3 months ago
- ☆31Updated 11 months ago