EricWilbanks / faseAlignLinks
Command line tool for forced-alignment of Spanish speech data
☆13Updated 2 years ago
Alternatives and similar repositories for faseAlign
Users that are interested in faseAlign are comparing it to the libraries listed below
Sorting:
- Simple Python package for fast DER computation☆33Updated 2 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- ☆56Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Discriminative Training of VBx Diarization☆25Updated 9 months ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- ☆23Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 2 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆53Updated 5 months ago
- ☆35Updated 4 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- ☆48Updated 5 years ago
- ☆40Updated 3 years ago
- Discriminative Condition-Aware PLDA☆44Updated 11 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆108Updated 9 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- ☆53Updated 7 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆29Updated 2 years ago
- ☆54Updated last year
- A simple package for Guided source separation (GSS)☆125Updated last year
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆70Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated 2 years ago