EricWilbanks / faseAlign
Command line tool for forced-alignment of Spanish speech data
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for faseAlign
- ☆40Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- ☆28Updated 3 weeks ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆66Updated 7 months ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- A list of papers for child ASR☆26Updated last month
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 4 months ago
- ☆27Updated 3 years ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 7 months ago
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- ☆48Updated 5 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Simple Python package for fast DER computation☆32Updated last year
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- multilingual speech aligner☆71Updated 11 months ago
- Deep Articulatory Synthesis and Inversion☆43Updated 8 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Updated 4 years ago
- ☆27Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆43Updated 3 weeks ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 4 years ago
- ☆13Updated this week
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆46Updated 4 months ago