microsoft / NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆40Updated this week
Related projects: ⓘ
- ☆63Updated last year
- A simple package for Guided source separation (GSS)☆104Updated 3 months ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- ☆27Updated 5 months ago
- ☆47Updated 4 months ago
- Discriminative Training of VBx Diarization☆17Updated 7 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆26Updated last year
- ☆47Updated 3 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆66Updated 3 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆24Updated 3 months ago
- UTokyo-SaruLab MOS Prediction System☆49Updated this week
- ☆48Updated 11 months ago
- A list of papers for child ASR☆24Updated 5 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆37Updated 3 months ago
- Pytorch implementation of subband decomposition☆88Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆61Updated 4 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆78Updated 11 months ago
- ☆31Updated 3 years ago
- ConMamba for Automatic Speech Recognition☆38Updated last month
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆91Updated last year
- The VoxTube dataset official repository☆60Updated 7 months ago
- Reference-aware automatic speech evaluation toolkit☆95Updated 6 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆54Updated 3 weeks ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆54Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆12Updated 3 months ago
- Official repository of NeXt-TDNN for speaker verification☆48Updated 5 months ago
- ☆70Updated 3 weeks ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago