EricWilbanks / faseAlign
Command line tool for forced-alignment of Spanish speech data
☆13Updated 2 years ago
Alternatives and similar repositories for faseAlign:
Users that are interested in faseAlign are comparing it to the libraries listed below
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- ☆27Updated 4 years ago
- ☆40Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated last year
- ☆54Updated 9 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆63Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- Phonetically-Oriented Word Error Rate☆34Updated 5 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆27Updated 6 months ago
- A list of papers for child ASR☆38Updated 5 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- Discriminative Training of VBx Diarization☆23Updated 6 months ago
- ☆16Updated 5 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆17Updated 2 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- ☆15Updated last week
- ☆31Updated 5 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 8 months ago
- ☆33Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- multilingual speech aligner☆72Updated last year