usc-sail / mica-speech-activity-detection
Robust Speech Activity Detection (SAD) in movie audio
☆25Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for mica-speech-activity-detection
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Constrained Permutation Invariant Training, Speech Separation☆42Updated 3 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- SpEx+(tied) source code☆75Updated last year
- A unofficial Pytorch implementation of Google's VoiceFilter☆97Updated last year
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Updated 3 years ago
- PyTorch implementation of RPNSD☆60Updated 4 months ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 3 years ago
- MultiSV: scripts for data preparation☆25Updated 4 months ago
- ☆29Updated 2 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆70Updated 5 years ago
- Tensorflow 2 implementation of Speech Separation Methods☆24Updated 4 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆17Updated 2 years ago
- STOI loss function in PyTorch☆87Updated last month
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆71Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- ☆59Updated 4 years ago
- 为音频加混响的代码☆25Updated last year
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆16Updated 2 years ago