EricWilbanks / faseAlignLinks
Command line tool for forced-alignment of Spanish speech data
☆13Updated 2 years ago
Alternatives and similar repositories for faseAlign
Users that are interested in faseAlign are comparing it to the libraries listed below
Sorting:
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆58Updated last year
- Simple Python package for fast DER computation☆35Updated 2 years ago
- ☆40Updated 3 years ago
- ☆37Updated 4 years ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- Discriminative Training of VBx Diarization☆26Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 4 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆59Updated 8 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆45Updated 8 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆116Updated this week
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Updated 4 years ago
- A simple package for Guided source separation (GSS)☆128Updated last year
- Yin pitch estimator in PyTorch☆117Updated 2 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- multilingual speech aligner☆77Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆79Updated 3 months ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Updated 5 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- ☆54Updated last year
- MOS score prediction by fine-tuned wav2vec2.0 model☆167Updated 2 years ago